Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 519520521522523524525 ... 584
Topics (20421)
Replies Last Post Views
Failing Hudson Builds by Dennis Kubes-2
5
by Nigel Daley
Filter fetching by mime type by Nynodata Development...
0
by Nynodata Development...
cannot locate "default.properties" in filesystem by David Weiser
0
by David Weiser
Java error crawling by lupin1979
0
by lupin1979
Next release? by Andrzej Białecki-2
3
by Andrzej Białecki-2
[jira] Commented: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Resolved: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Updated: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-44) too many search results by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-603) Add more default url normalizations by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating by JIRA jira@apache.org
17
by JIRA jira@apache.org
[jira] Created: (NUTCH-606) Refactoring of Generator, run all urls through checks by JIRA jira@apache.org
11
by JIRA jira@apache.org
[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop by JIRA jira@apache.org
5
by JIRA jira@apache.org
[jira] Created: (NUTCH-607) Update build.xml to include tika jar by JIRA jira@apache.org
5
by JIRA jira@apache.org
[jira] Created: (NUTCH-602) Allow configurable number of handlers for search servers by JIRA jira@apache.org
6
by JIRA jira@apache.org
Maybe doing a 0.9.1 release by Dennis Kubes-2
2
by Dennis Kubes-2
JIRAClient by Andrzej Białecki-2
3
by Sami Siren-2
[jira] Created: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Created: (NUTCH-593) Nutch crawl problem by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (NUTCH-551) performance for generate is often really bad by JIRA jira@apache.org
8
by JIRA jira@apache.org
[jira] Created: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Closed: (NUTCH-339) Refactor nutch to allow fetcher improvements by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-553) Add more normalization rules to regex-normalize file. by JIRA jira@apache.org
2
by JIRA jira@apache.org
Cant run twice get in SegmentReader by nadav hashimshony
0
by nadav hashimshony
problem with reading more then one urls from the DB by nadav hashimshony
0
by nadav hashimshony
ApacheCon Europe BoF for Lucene/Nutch/Solr by Grant Ingersoll-2
0
by Grant Ingersoll-2
read crawldb. by nadav hashimshony
9
by nadav hashimshony
cache page return http 500 in 1.0-dev (rev 616745) by Vinci
2
by Andrzej Białecki-2
Reg: Nutch Admin GUI by prafulla
1
by Andrzej Białecki-2
Build failed in Hudson: Nutch-trunk #343 by Apache Hudson Server
2
by Apache Hudson Server
1 ... 519520521522523524525 ... 584