Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 561562563564565566567 ... 588
Topics (20565)
Replies Last Post Views
[jira] Created: (NUTCH-196) lib-xml and lib-log4j plugins by Michael Gibney (Jira...
9
by Andrzej Białecki-2
Cygwin broken (Re: [Nutch-cvs] svn commit: r388310 - ...) by Andrzej Białecki-2
1
by Doug Cutting
Spelling suggestion for RSS Feed by Aled Jones
1
by Jérôme Charron
[jira] Created: (NUTCH-232) Search.jsp has multiple search forms creating invalid html / incorrect focus function by Michael Gibney (Jira...
1
by Michael Gibney (Jira...
hi,how to use the ICTCLASCall by 吴志敏
4
by 吴志敏
ICTCLAS with nutch 0.7.1. by 吴志敏
0
by 吴志敏
[jira] Created: (NUTCH-231) Invalid CSS entries by Michael Gibney (Jira...
2
by Michael Gibney (Jira...
[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Created: (NUTCH-210) Context.xml file for Nutch web application by Michael Gibney (Jira...
4
by Michael Gibney (Jira...
[jira] Created: (NUTCH-239) I changed httpclient to use javax.net.ssl instead of com.sun.net.ssl by Michael Gibney (Jira...
2
by Richard Braman
Nutch 0.7.2 by Piotr Kosiorowski
4
by Piotr Kosiorowski
[jira] Created: (NUTCH-117) Crawl crashes with java.io.IOException: already exists: C:\nutch\crawl.intranet\oct18\db\webdb.new\pagesByURL by Michael Gibney (Jira...
5
by Michael Gibney (Jira...
[jira] Closed: (NUTCH-14) NullPointerException NutchBean.getSummary by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Closed: (NUTCH-23) content text/xml parser by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Resolved: (NUTCH-23) content text/xml parser by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Created: (NUTCH-96) MapFile.Writer throws directory exists exception if run multiple times in the same JVM or server JVM. by Michael Gibney (Jira...
2
by Michael Gibney (Jira...
[jira] Created: (NUTCH-94) MapFile.Writer throwing 'File exists error'. by Michael Gibney (Jira...
3
by Michael Gibney (Jira...
[jira] Closed: (NUTCH-165) object pooling for nutch bean --- to impriove performance by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Closed: (NUTCH-24) Cannot handle incorrectly cased Content-Type by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Resolved: (NUTCH-24) Cannot handle incorrectly cased Content-Type by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Closed: (NUTCH-34) Parsing different content formats by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Resolved: (NUTCH-34) Parsing different content formats by Michael Gibney (Jira...
0
by Michael Gibney (Jira...
[jira] Created: (NUTCH-185) XMLParser is configurable plugin. It use XPath and namespaces to do the mapping between the XML elements and Lucene fields. by Michael Gibney (Jira...
5
by Michael Gibney (Jira...
Carrot2 upgrade patch by Dawid Weiss
0
by Dawid Weiss
[jira] Created: (NUTCH-234) Clustering extension code cleanups and a real JUnit test case for the current implementation. by Michael Gibney (Jira...
2
by Michael Gibney (Jira...
[jira] Created: (NUTCH-235) Duplicate Inlink values by Michael Gibney (Jira...
7
by Michael Gibney (Jira...
RegexURLFilter file attribute by Jérôme Charron
0
by Jérôme Charron
Duplicate Inlink problem by Andrzej Białecki-2
0
by Andrzej Białecki-2
Much faster RegExp lib needed in nutch? by Jack.Tang
26
by Stefan Groschupf-2
Crawling Accuracy by carmmello
1
by kkrugler
Searching on a particular domain by MagRaj
0
by MagRaj
update linkdb by Marko Bauhardt-2
3
by Andrzej Białecki-2
OPIC score calculation issues by Andrzej Białecki-2
3
by Doug Cutting
[proposal] catching session-id urls by kangas
2
by kangas
Null Pointer exception in AnalyzerFactory? by chrismattmann
2
by chrismattmann
1 ... 561562563564565566567 ... 588