Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 597598599600601602603 ... 616
Topics (21530)
Replies Last Post Views
[jira] Created: (NUTCH-157) Problem during parsing msword document . It fetching properly but parsing is not working. Please show me the way how can i parse it by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Created: (NUTCH-92) DistributedSearch incorrectly scores results by Mihir Sharma (Jira)
5
by Mihir Sharma (Jira)
[jira] Closed: (NUTCH-55) Create dmoz.org search plugin - incorporate the dmoz.org title/category/description if available & by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Created: (NUTCH-154) Unable to add/update new files to fetchlist/fetcher and thus index, when u rerun crawl tool on same db. by Mihir Sharma (Jira)
1
by Mihir Sharma (Jira)
[jira] Commented: (NUTCH-55) Create dmoz.org search plugin - incorporate the dmoz.org title/category/description if available & by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Created: (NUTCH-128) second configuration nodes overwrites first node by Mihir Sharma (Jira)
4
by Mihir Sharma (Jira)
Fwd: bug in Nutch wiki - FAQ by Stefan Groschupf-2
0
by Stefan Groschupf-2
failure with crawl using 12/23 trunk by Byron Miller-2
0
by Byron Miller-2
[jira] Created: (NUTCH-147) nutch map reduce does not work in windows map reduce runs in a loop by Mihir Sharma (Jira)
2
by Mihir Sharma (Jira)
[jira] Created: (NUTCH-148) org.apache.nutch.tools.CrawlTool throws error while doing deleteduplicates by Mihir Sharma (Jira)
5
by Mihir Sharma (Jira)
Removing old classes from trunk/ by Andrzej Białecki-2
1
by Stefan Groschupf-2
Static initializers by Andrzej Białecki-2
6
by marcel.schnippe
Commons HttpClient 3.0 released by Stefan Groschupf-2
1
by Andrzej Białecki-2
nutch-0.8-dev *mapred.input.subdir* problem ? by Lukáš Vlček
4
by Paul E. Baclace
Crawling a nutch index with Lucene by Oliver Hummel
2
by Oliver Hummel
nightly build by tigger .
1
by Stefan Groschupf-2
NDFS Connection reset by Jack.Tang
3
by Paul E. Baclace
[bug] overwriting job properties until runtime is not possible by Stefan Groschupf-2
2
by Stefan Groschupf-2
[jira] Created: (NUTCH-145) ant build of the war fie fails on Chinese (zh) .xml files due to UTF-8 BOM by Mihir Sharma (Jira)
4
by Mihir Sharma (Jira)
[jira] Created: (NUTCH-146) mapred.job.tracker.info.port is defined 2 times in the nutch-default.xml by Mihir Sharma (Jira)
1
by Mihir Sharma (Jira)
Re: [Nutch-dev] distributed seach by Stefan Groschupf-2
11
by Rolando H. Martinell...
nutch and google suggestion by Jack.Tang
2
by Jack.Tang
Latest version of Mapred by Rafi Iz
3
by Jérôme Charron
no nightly builds until 27 December by Doug Cutting-2
0
by Doug Cutting-2
[jira] Commented: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
Re: svn commit: r357334 - in /lucene/nutch/trunk: conf/nutch-default.xml src/java/org/apache/nutch/protocol/Content.java src/java/org/apache/nutch/protocol/ContentProperties.java by Doug Cutting-2
2
by Doug Cutting-2
[jira] Created: (NUTCH-144) corrupt language identifier tri files and bad language recognition for german by Mihir Sharma (Jira)
2
by Mihir Sharma (Jira)
[jira] Updated: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Reopened: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Commented: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Commented: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Commented: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
Boolean search support by Nguyen Ngoc Giang
0
by Nguyen Ngoc Giang
[jira] Commented: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
[jira] Closed: (NUTCH-3) multi values of header discarded by Mihir Sharma (Jira)
0
by Mihir Sharma (Jira)
1 ... 597598599600601602603 ... 616