Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 523524525526527528529 ... 583
Topics (20390)
Replies Last Post Views
open source enterprise content search solution based on Nutch - http://nutch-iice.sourceforge.net by joel gump
0
by joel gump
Quote Please? by James Phillips-2
0
by James Phillips-2
Upgrading Nutch to Hadoop 0.14 or 0.15 by Dennis Kubes-2
2
by Dennis Kubes-2
Update to URL ordering from Generator.java by Ned Rockson-3
5
by kkrugler
Optimizing nutch crawl for fastest performance by Tranquil
0
by Tranquil
How to write a parse plugin and not get NullPointerException on ParseData by Tranquil
1
by Tranquil
web2 plugin by Rajasekar Karthik
0
by Rajasekar Karthik
[jira] Created: (NUTCH-569) Protocol plugins should report progress to the fetcher by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-568) Indexer does not update the Lucene "TITLE" field by JIRA jira@apache.org
2
by JIRA jira@apache.org
Nutch/Lucene unique ID for every item crawled? by Sagar Vibhute-2
3
by Sagar Naik-2
Out of order key while in reduce phase by Ned Rockson
1
by Sagar Naik-2
[jira] Created: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list by JIRA jira@apache.org
12
by JIRA jira@apache.org
JIRA, Resolving and Closing Issues by Dennis Kubes-2
2
by Sami Siren-2
Scoring API issues (LONG) by Andrzej Białecki-2
6
by Andrzej Białecki-2
Re: writing a new parse-exe plugin [NullPointerException] by Tranquil
0
by Tranquil
writing a new parse-exe plugin by Tranquil
3
by Tranquil
Anyone looked for a better HTML parser? by Doug Cook
3
by Dawid Weiss
Cached PDF files? by Sagar Vibhute-2
0
by Sagar Vibhute-2
Selective/Configurable HTML Parsing? by Sagar Vibhute-2
1
by Andrzej Białecki-2
[jira] Created: (NUTCH-436) Incorrect handling of relative paths when the embedded URL path is empty by JIRA jira@apache.org
15
by JIRA jira@apache.org
How to add a field to results? by Sagar Vibhute-2
0
by Sagar Vibhute-2
Choices in Nutch Web interface? by Christopher Bader-2
2
by Susam Pal
[jira] Created: (NUTCH-562) Port mime type framework to use Tika mime detection framework by JIRA jira@apache.org
12
by chrismattmann
download code works in fetch class but not in plugins class by Tranquil
0
by Tranquil
Solved: Downloading file types to file system by Tranquil
0
by Tranquil
Downloading file types to file system by Tranquil
3
by Tranquil
Disregard last post by Ned Rockson
0
by Ned Rockson
InvertLinks logical problem? by Ned Rockson
0
by Ned Rockson
Java Packages (missing) by Sagar Vibhute-2
2
by Sagar Vibhute-2
[jira] Created: (NUTCH-508) ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker by JIRA jira@apache.org
7
by JIRA jira@apache.org
bug with generate performance by misc
6
by misc
Strange RemoteException thrown while doing a parse of ~64m documents by Ned Rockson
2
by Ned Rockson
First Plugin by Sagar Vibhute-2
6
by Sagar Vibhute-2
Failed Fetch Pages - Index Verification and Optimization by Rajasekar Karthik
0
by Rajasekar Karthik
Hits estimation? by Hal Fulton-2
3
by Sagar Vibhute
1 ... 523524525526527528529 ... 583