Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 558559560561562563564 ... 623
Topics (21789)
Replies Last Post Views
[jira] Created: (NUTCH-617) Cached Text Only by Isabelle Giguere (Ji...
1
by Isabelle Giguere (Ji...
nutch latest build - inject operation failing by DS jha
13
by esmithers
Build failed in Hudson: Nutch-trunk #362 by Apache Hudson Server
16
by Apache Hudson Server
Failing Hudson Builds by Dennis Kubes-2
5
by Nigel Daley
Filter fetching by mime type by Nynodata Development...
0
by Nynodata Development...
cannot locate "default.properties" in filesystem by David Weiser
0
by David Weiser
Java error crawling by lupin1979
0
by lupin1979
Next release? by Andrzej Białecki-2
3
by Andrzej Białecki-2
[jira] Commented: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Resolved: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Commented: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Updated: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Commented: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Commented: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Commented: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Assigned: (NUTCH-44) too many search results by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-603) Add more default url normalizations by Isabelle Giguere (Ji...
7
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating by Isabelle Giguere (Ji...
17
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-606) Refactoring of Generator, run all urls through checks by Isabelle Giguere (Ji...
11
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop by Isabelle Giguere (Ji...
5
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-607) Update build.xml to include tika jar by Isabelle Giguere (Ji...
5
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-602) Allow configurable number of handlers for search servers by Isabelle Giguere (Ji...
6
by Isabelle Giguere (Ji...
Maybe doing a 0.9.1 release by Dennis Kubes-2
2
by Dennis Kubes-2
JIRAClient by Andrzej Białecki-2
3
by Sami Siren-2
[jira] Created: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 by Isabelle Giguere (Ji...
3
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-593) Nutch crawl problem by Isabelle Giguere (Ji...
2
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-551) performance for generate is often really bad by Isabelle Giguere (Ji...
8
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled by Isabelle Giguere (Ji...
3
by Isabelle Giguere (Ji...
[jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Closed: (NUTCH-339) Refactor nutch to allow fetcher improvements by Isabelle Giguere (Ji...
0
by Isabelle Giguere (Ji...
[jira] Created: (NUTCH-553) Add more normalization rules to regex-normalize file. by Isabelle Giguere (Ji...
2
by Isabelle Giguere (Ji...
Cant run twice get in SegmentReader by nadav hashimshony
0
by nadav hashimshony
problem with reading more then one urls from the DB by nadav hashimshony
0
by nadav hashimshony
ApacheCon Europe BoF for Lucene/Nutch/Solr by Grant Ingersoll-2
0
by Grant Ingersoll-2
read crawldb. by nadav hashimshony
9
by nadav hashimshony
1 ... 558559560561562563564 ... 623