Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 554555556557558559560 ... 584
Topics (20421)
Replies Last Post Views
[jira] Created: (NUTCH-134) Summarizer doesn't select the best snippets by JIRA jira@apache.org
16
by JIRA jira@apache.org
Re: [Nutch-cvs] svn commit: r406044 - /lucene/nutch/trunk/src/plugin/build.xml by Andrzej Białecki-2
1
by Jérôme Charron
HEADS UP: Config changes related to scoring API by Andrzej Białecki-2
0
by Andrzej Białecki-2
[jira] Created: (NUTCH-240) Scoring API: extension point, scoring filters and an OPIC plugin by JIRA jira@apache.org
16
by JIRA jira@apache.org
Experiment on crawler behaviour by Andrzej Białecki-2
0
by Andrzej Białecki-2
summarizer.setConf(conf) should be removed. by Stefan Groschupf-2
0
by Stefan Groschupf-2
distance between words by luti
3
by luti
Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/ by Doug Cutting
18
by Dawid Weiss
new location! nutch user meeting San Francisco by Stefan Groschupf-2
0
by Stefan Groschupf-2
Preventing overlapped search results. by Brian Hill-3
0
by Brian Hill-3
[jira] Created: (NUTCH-267) Indexer doesn't consider linkdb when calculating boost value by JIRA jira@apache.org
5
by JIRA jira@apache.org
Interleaved (parallel) fetch cycles by Andrzej Białecki-2
1
by Doug Cutting
Issues to work on by Dennis Kubes
1
by Chris Fellows-3
dfs -report by Marko Bauhardt-2
1
by Doug Cutting
New tools: CrawlDbMerger, LinkDbMerger, SegmentMerger by Andrzej Białecki-2
6
by Andrzej Białecki-2
[jira] Created: (NUTCH-257) Summary#toString always Entity encodes -- problem for OpenSearchServlet#description field by JIRA jira@apache.org
3
by JIRA jira@apache.org
Creating different binary databases for indexing by Dennis Kubes
4
by Dennis Kubes
PATCH - Fixes for 0.8 tutorial by Lukáš Vlček
0
by Lukáš Vlček
http chunked content by Stefan Groschupf-2
8
by Chris Fellows-3
[jira] Created: (NUTCH-263) MapWritable.equals() doesn't work properly by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (NUTCH-264) Tools for merging and filtering CrawlDb-s and LinkDb-s by JIRA jira@apache.org
2
by JIRA jira@apache.org
plugins in job file. by Stefan Groschupf-2
8
by Andrzej Białecki-2
nutch is loosing not modified pages by Stefan Groschupf-2
1
by Andrzej Białecki-2
Feature idea - Indexing Text Lengths by Hero Doug
2
by Hero Doug
generate.max.per.host is per reduce task by Chris Schneider-2
1
by Doug Cutting
CommerceNet Events » Blog Archive » T3 5/11: Stefan Groschupf on Extending Nutch by Doug Cutting
0
by Doug Cutting
Re: svn commit: r399515 - /lucene/nutch/trunk/src/java/org/apache/nutch/segment/SegmentReader.java by Doug Cutting
0
by Doug Cutting
nutch inject bug(fix) by Jochen Frey-2
0
by Jochen Frey-2
Classloader by Christopher Burkey
0
by Christopher Burkey
A Developer's getting started doc? by Andrew Libby
7
by Lukáš Vlček
Content-Type inconsistency? by Jérôme Charron
11
by Jérôme Charron
0.8 tutorial typos in Whole-web indexing? by Lukáš Vlček
0
by Lukáš Vlček
[jira] Created: (NUTCH-260) Three new plugins that parse, index and query meta tags defined in the configuration by JIRA jira@apache.org
1
by JIRA jira@apache.org
Creating a throttle by Fankhauser, Alain
4
by Doug Cutting
Php frontend by ocramp
1
by Andrew Libby
1 ... 554555556557558559560 ... 584