Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 480481482483484485486 ... 537
Topics (18768)
Replies Last Post Views
NutchSimilarity#coord() by Enis Soztutar
0
by Enis Soztutar
nutch plugin-analyser language identifier by saran
0
by saran
Is there any chance that my patches will be considered? by Marcin Okraszewski-3
3
by Doğacan Güney-3
[jira] Created: (NUTCH-535) ParseData's contentMeta accumulates unnecessary values during parse by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Created: (NUTCH-536) Reduce number of warnings in nutch core by JIRA jira@apache.org
5
by JIRA jira@apache.org
[jira] Created: (NUTCH-522) Use URLValidator in the Injector by JIRA jira@apache.org
19
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-537) TestMP3Parser.java, TestRTFParser.java, TestMSWordParser.java compile by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Issue Comment Edited: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
nutch stops randomly while crawling by Eric Benavente
0
by Eric Benavente
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Issue Comment Edited: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-533) LinkDbMerger: url normlaized is not updated in the key and inlinks list by JIRA jira@apache.org
8
by JIRA jira@apache.org
[jira] Created: (NUTCH-520) A common infrastructure for different index backends by JIRA jira@apache.org
5
by JIRA jira@apache.org
Pages in UTF-16 by Blaž Smolnikar
0
by Blaž Smolnikar
[jira] Created: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS by JIRA jira@apache.org
6
by JIRA jira@apache.org
[jira] Created: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE by JIRA jira@apache.org
9
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
CrawlDbReader TopN by Emmanuel JOKE
0
by Emmanuel JOKE
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
searchserver failover problem by Nathan Wilkinson
0
by Nathan Wilkinson
[jira] Created: (NUTCH-523) web2 searchform problems with patch by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Issue Comment Edited: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by JIRA jira@apache.org
0
by JIRA jira@apache.org
1 ... 480481482483484485486 ... 537