Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 556557558559560561562 ... 617
Topics (21573)
Replies Last Post Views
Usage of mapred-default.xml is deprecated in hadoop0.15.0 by Ned Rockson-3
0
by Ned Rockson-3
[jira] Created: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
[jira] Created: (NUTCH-411) Parse ignores meta refresh redirection by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
db.ignore.internal.links and ranking algorithms by Rajasekar Karthik
4
by Rajasekar Karthik
NullPointerException in FetchedSegments.getSummary() by John Doe-37
0
by John Doe-37
[jira] Commented: (NUTCH-572) Scoring and redirected Urls by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-572) Scoring and redirected Urls by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Tika API by Ned Rockson-3
5
by Ned Rockson-3
JIRA emails and Nutch by Dennis Kubes-2
4
by Dennis Kubes-2
adding dmoz meta data to index. by ned@bcit
1
by Sebastian Steinmetz
MD5 vs TextProfile Signature by Rajasekar Karthik
0
by Rajasekar Karthik
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Issue Comment Edited: (NUTCH-356) Plugin repository cache can lead to memory leak by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
How dose the Nutch-0.9 read the configuration file? by Xin Zhang-2
1
by Tranquil
How to extract specified information from html? by jqq
4
by jqq
Nutch automatically deleting sites from search results by Rajasekar Karthik
0
by Rajasekar Karthik
plugin analyzer by Robert Benea
4
by Rajasekar Karthik
When is the Clause.getQuery().getBoost == 0? by Ned Rockson-3
1
by Andrzej Białecki-2
Next move with JIRA ticket by Ned Rockson-3
2
by Ned Rockson
[jira] Created: (NUTCH-501) implementing a different caching mechanism for objects by Clark Perkins (Jira)
16
by Clark Perkins (Jira)
Adding new class to nutch by Tranquil
2
by Tranquil
What are the side effects of running crawl multiple times? by Paolo Castagna-2
1
by Andrzej Białecki-2
[jira] Created: (NUTCH-565) Arc File to Nutch Segments Converter by Clark Perkins (Jira)
19
by Clark Perkins (Jira)
nutch to search local filesystem by Prem Kumar L
0
by Prem Kumar L
open source enterprise content search solution based on Nutch - http://nutch-iice.sourceforge.net by joel gump
0
by joel gump
Quote Please? by James Phillips-2
0
by James Phillips-2
Upgrading Nutch to Hadoop 0.14 or 0.15 by Dennis Kubes-2
2
by Dennis Kubes-2
Update to URL ordering from Generator.java by Ned Rockson-3
5
by kkrugler
Optimizing nutch crawl for fastest performance by Tranquil
0
by Tranquil
How to write a parse plugin and not get NullPointerException on ParseData by Tranquil
1
by Tranquil
web2 plugin by Rajasekar Karthik
0
by Rajasekar Karthik
1 ... 556557558559560561562 ... 617