Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 552553554555556557558 ... 617
Topics (21573)
Replies Last Post Views
Java error crawling by lupin1979
0
by lupin1979
Next release? by Andrzej Białecki-2
3
by Andrzej Białecki-2
[jira] Commented: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Resolved: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Updated: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Assigned: (NUTCH-44) too many search results by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-603) Add more default url normalizations by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
[jira] Created: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating by Clark Perkins (Jira)
17
by Clark Perkins (Jira)
[jira] Created: (NUTCH-606) Refactoring of Generator, run all urls through checks by Clark Perkins (Jira)
11
by Clark Perkins (Jira)
[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop by Clark Perkins (Jira)
5
by Clark Perkins (Jira)
[jira] Created: (NUTCH-607) Update build.xml to include tika jar by Clark Perkins (Jira)
5
by Clark Perkins (Jira)
[jira] Created: (NUTCH-602) Allow configurable number of handlers for search servers by Clark Perkins (Jira)
6
by Clark Perkins (Jira)
Maybe doing a 0.9.1 release by Dennis Kubes-2
2
by Dennis Kubes-2
JIRAClient by Andrzej Białecki-2
3
by Sami Siren-2
[jira] Created: (NUTCH-604) Upgrade Nutch to Lucene 2.3.0 by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Created: (NUTCH-593) Nutch crawl problem by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
[jira] Created: (NUTCH-551) performance for generate is often really bad by Clark Perkins (Jira)
8
by Clark Perkins (Jira)
[jira] Created: (NUTCH-382) Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-339) Refactor nutch to allow fetcher improvements by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Closed: (NUTCH-339) Refactor nutch to allow fetcher improvements by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-553) Add more normalization rules to regex-normalize file. by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
Cant run twice get in SegmentReader by nadav hashimshony
0
by nadav hashimshony
problem with reading more then one urls from the DB by nadav hashimshony
0
by nadav hashimshony
ApacheCon Europe BoF for Lucene/Nutch/Solr by Grant Ingersoll-2
0
by Grant Ingersoll-2
read crawldb. by nadav hashimshony
9
by nadav hashimshony
cache page return http 500 in 1.0-dev (rev 616745) by Vinci
2
by Andrzej Białecki-2
Reg: Nutch Admin GUI by prafulla
1
by Andrzej Białecki-2
Build failed in Hudson: Nutch-trunk #343 by Apache Hudson Server
2
by Apache Hudson Server
[jira] Created: (NUTCH-587) Upgrade Nutch to use Hadoop 0.15.1 release by Clark Perkins (Jira)
6
by Clark Perkins (Jira)
Help needed? by showWayer
0
by showWayer
Build failed in Hudson: Nutch-trunk #340 by Apache Hudson Server
1
by Apache Hudson Server
1 ... 552553554555556557558 ... 617