Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 544545546547548549550 ... 583
Topics (20384)
Replies Last Post Views
[jira] Created: (NUTCH-377) Add possibility to search for multiple values by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (NUTCH-376) Add methods to control runtime behaviour of NutchBean by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-374) when http.content.limit be set to -1 and Response.CONTENT_ENCODING is gzip or x-gzip , it can not fetch any thing. by JIRA jira@apache.org
4
by JIRA jira@apache.org
wavering again and then the hell the earth which bodies are by Bradley Parker
0
by Bradley Parker
[jira] Created: (NUTCH-361) generator create fetchlist randomly by JIRA jira@apache.org
19
by JIRA jira@apache.org
[jira] Created: (NUTCH-375) Link to 0.8.x apidocs broken on website by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (NUTCH-351) Protocol forward proxy by JIRA jira@apache.org
3
by JIRA jira@apache.org
Searching on fields with uppercase letters by Enrico Triolo-2
2
by Enrico Triolo-2
[jira] Created: (NUTCH-373) Fetcher halting and throttling by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (NUTCH-372) Fetcher halting and throttling by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (NUTCH-368) Message queueing system by JIRA jira@apache.org
7
by JIRA jira@apache.org
Modifications necessary to upgrade to Hadoop 0.6.2 by Marcel Petrisor
0
by Marcel Petrisor
[jira] Created: (NUTCH-370) Generator loosed urls when run with LocalJobRunner by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (NUTCH-344) Fetcher threads blocked on synchronized block in cleanExpiredServerBlocks by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Created: (NUTCH-338) Remove the text parser as an option for parsing PDF files in parse-plugins.xml by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Created: (NUTCH-318) log4j not proper configured, readdb doesnt give any information by JIRA jira@apache.org
12
by JIRA jira@apache.org
[jira] Created: (NUTCH-266) hadoop bug when doing updatedb by JIRA jira@apache.org
23
by JIRA jira@apache.org
[jira] Created: (NUTCH-105) Network error during robots.txt fetch causes file to be ignored by JIRA jira@apache.org
6
by JIRA jira@apache.org
0.8.1 by Sami Siren-2
4
by Sami Siren-2
[jira] Created: (NUTCH-205) Wrong 'fetch date' for non available pages by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (NUTCH-276) db.score.link.internal problem by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (NUTCH-350) urls blocked db.fetch.retry.max * http.max.delays times during fetching are marked as STATUS_DB_GONE by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (NUTCH-253) Normalize Host during Generate by JIRA jira@apache.org
2
by JIRA jira@apache.org
[jira] Created: (NUTCH-337) Fetcher ignores the fetcher.parse value configured in config file by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Created: (NUTCH-336) Harvested links shouldn't get db.score.injected in addition to inbound contributions by JIRA jira@apache.org
3
by JIRA jira@apache.org
[jira] Created: (NUTCH-332) doubling score causes by page internal anchors. by JIRA jira@apache.org
4
by JIRA jira@apache.org
[jira] Created: (NUTCH-365) Flexible URL normalization by JIRA jira@apache.org
9
by JIRA jira@apache.org
Ant tasks/build.xml file for running Nutch in debug mode? by Jp Mutch
0
by Jp Mutch
Empty "incoming anchor text" by Zhen Zhen
3
by Richard Braman-2
CrawlDatum.modifiedTime ? by Kim, Greg
0
by Kim, Greg
[jira] Created: (NUTCH-367) DistributedSearch thown ClassCastException by JIRA jira@apache.org
1
by JIRA jira@apache.org
[jira] Created: (NUTCH-364) Javascript parser creates some fairly bogus URLs by JIRA jira@apache.org
1
by JIRA jira@apache.org
I cann't fetch wml page by yin chunhui
0
by yin chunhui
Speed of reading local files by Zhen Zhen
0
by Zhen Zhen
Time of Reading Local Files by Jane Zhen
0
by Jane Zhen
1 ... 544545546547548549550 ... 583