Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 550551552553554555556 ... 588
Topics (20565)
Replies Last Post Views
CrawlDatum.modifiedTime ? by Kim, Greg
0
by Kim, Greg
[jira] Created: (NUTCH-367) DistributedSearch thown ClassCastException by Michael Gibney (Jira...
1
by Michael Gibney (Jira...
[jira] Created: (NUTCH-364) Javascript parser creates some fairly bogus URLs by Michael Gibney (Jira...
1
by Michael Gibney (Jira...
I cann't fetch wml page by yin chunhui
0
by yin chunhui
Speed of reading local files by Zhen Zhen
0
by Zhen Zhen
Time of Reading Local Files by Jane Zhen
0
by Jane Zhen
[jira] Created: (NUTCH-369) StringUtil.resolveEncodingAlias is unuseful. by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
A Problem about Nutch Plugin by yin chunhui
0
by yin chunhui
ask a problem about nutch (from China) by yin chunhui
2
by Howie Wang
Re: Any plans to move to build Nutchusing Maven? by sshingler
8
by Otis Gospodnetic-2-2
Patch Available status? by chrismattmann
11
by Otis Gospodnetic-2-2
File system watching for intranets by Ben Ogle
2
by Ben Ogle
[jira] Created: (NUTCH-366) Merge URLFilters and URLNormalizers by Michael Gibney (Jira...
1
by Federico Dal Maso
I use eclipse to run NutchAnalysis.java, but it meet QueryFilter RunTime error by heack
0
by heack
Help: DistributedSearch thown ClassCastException by emanihc
0
by emanihc
How could I test my modify to NutchAnalysis.jj? by heack
2
by heack
[jira] Created: (NUTCH-363) Fetcher normalizes everything at least twice by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Created: (NUTCH-339) Refactor nutch to allow fetcher improvements by Michael Gibney (Jira...
18
by Michael Gibney (Jira...
Ontology compile bug by Michael Wechner
3
by Michael Wechner
HTTP/1.1 problem by Doğacan Güney-2
1
by Otis Gospodnetic-2-2
[jira] Created: (NUTCH-359) extraction of links will fail for whole page if one single link cannot be parsed by Michael Gibney (Jira...
1
by Michael Gibney (Jira...
[jira] Created: (NUTCH-273) When a page is redirected, the original url is NOT updated. by Michael Gibney (Jira...
4
by Michael Gibney (Jira...
[jira] Created: (NUTCH-362) Remove parse-text from unsupported filetypes in parse-plugins.xml by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Created: (NUTCH-208) http: proxy exception list: by Michael Gibney (Jira...
4
by Michael Gibney (Jira...
log error in deploying nutch-0.9-dev.jar by AJ Chen-2
1
by AJ Chen-2
[Fwd: Re: get CrawlDatum] by Uroš Gruber-2
2
by Uroš Gruber-2
Nutch nightly build failure by Nutch - Dev mailing ...
0
by Nutch - Dev mailing ...
Content-type detection for Tika by Jukka Zitting
1
by Jérôme Charron
problem with hadoop by Richard Braman
2
by Richard Braman
Nutch nightly build failure by Nutch - Dev mailing ...
0
by Nutch - Dev mailing ...
[jira] Created: (NUTCH-249) black- white list url filtering by Michael Gibney (Jira...
10
by Uroš Gruber-2
several url to search for [multiple url] by dee-2
0
by dee-2
[jira] Created: (NUTCH-360) Switch nutch to use java 5 source format by Michael Gibney (Jira...
1
by Michael Gibney (Jira...
LuceneQueryOptimizer and no query by daniel rosher
0
by daniel rosher
[jira] Created: (NUTCH-358) Language Switching by Michael Gibney (Jira...
2
by Michael Gibney (Jira...
1 ... 550551552553554555556 ... 588