Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 555556557558559560561 ... 612
Topics (21405)
Replies Last Post Views
Nutch developer needed by Geoffrey McCaleb
0
by Geoffrey McCaleb
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
Clustering patches ready for review/ commit. by Dawid Weiss
0
by Dawid Weiss
[jira] Closed: (NUTCH-237) Carrot2 clustering plugin upgrade. by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Created: (NUTCH-397) porting clustering-carrot2 plugin to carrot2 v2.0 by Nick Burch (Jira)
4
by Nick Burch (Jira)
Redirects and alias handling (LONG) by Andrzej Białecki-2
10
by Doğacan Güney-3
[jira] Created: (NUTCH-439) Top Level Domains Indexing / Scoring by Nick Burch (Jira)
19
by Nick Burch (Jira)
[jira] Created: (NUTCH-543) CLONE -some problem about the Nutch cache by Nick Burch (Jira)
1
by Nick Burch (Jira)
[jira] Created: (NUTCH-542) Null Pointer Exception on getSummary when segment no longer exists by Nick Burch (Jira)
0
by Nick Burch (Jira)
How to get results without a query based on the date by aditya naga hemanth ...
1
by wuqi-2
Using Nutch LanguageIdentifierPlugin in Apache UIMA by Michael Baessler
2
by Michael Baessler
Has anybody successfully used org.apache.lucene.search.similar.MoreLikeThis by Doan, Tan
1
by Doan, Tan
NutchSimilarity#coord() by Enis Soztutar
0
by Enis Soztutar
nutch plugin-analyser language identifier by saran
0
by saran
Is there any chance that my patches will be considered? by Marcin Okraszewski-3
3
by Doğacan Güney-3
[jira] Created: (NUTCH-535) ParseData's contentMeta accumulates unnecessary values during parse by Nick Burch (Jira)
7
by Nick Burch (Jira)
[jira] Created: (NUTCH-536) Reduce number of warnings in nutch core by Nick Burch (Jira)
5
by Nick Burch (Jira)
[jira] Created: (NUTCH-522) Use URLValidator in the Injector by Nick Burch (Jira)
19
by Nick Burch (Jira)
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Created: (NUTCH-537) TestMP3Parser.java, TestRTFParser.java, TestMSWordParser.java compile by Nick Burch (Jira)
4
by Nick Burch (Jira)
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Issue Comment Edited: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
nutch stops randomly while crawling by Eric Benavente
0
by Eric Benavente
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Issue Comment Edited: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Updated: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Created: (NUTCH-533) LinkDbMerger: url normlaized is not updated in the key and inlinks list by Nick Burch (Jira)
8
by Nick Burch (Jira)
[jira] Created: (NUTCH-520) A common infrastructure for different index backends by Nick Burch (Jira)
5
by Nick Burch (Jira)
Pages in UTF-16 by Blaž Smolnikar
0
by Blaž Smolnikar
[jira] Created: (NUTCH-514) Indexer should only index pages with fetch status SUCCESS by Nick Burch (Jira)
6
by Nick Burch (Jira)
[jira] Created: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE by Nick Burch (Jira)
9
by Nick Burch (Jira)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector by Nick Burch (Jira)
0
by Nick Burch (Jira)
1 ... 555556557558559560561 ... 612