Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 593594595596597598599 ... 617
Topics (21573)
Replies Last Post Views
[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Created: (NUTCH-123) Cache.jsp some times generate NullPointerException by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-165) object pooling for nutch bean --- to impriove performance by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-118) FAQ link points to invalid URL by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
[jira] Created: (NUTCH-137) footer is not displayed in search result page by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
A little hack: retrieve only new urls by Enrico Triolo-2
0
by Enrico Triolo-2
RE: Authentication / Content-type by T. Kuro Kurosaka
2
by Thushara Wijeratna
[jira] Created: (NUTCH-198) SWF parser by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-23) content text/xml parser by Clark Perkins (Jira)
1
by Rida Benjelloun
[jira] Commented: (NUTCH-53) Parser plugin for Zip files by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Resolved: (NUTCH-52) Parser plugin for MS Excel files by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Word, Powerpoint and Excel parsers by Jérôme Charron
0
by Jérôme Charron
Fetch same URL two times by Fuad Efendi
0
by Fuad Efendi
[jira] Created: (NUTCH-192) meta data support for CrawlDatum by Clark Perkins (Jira)
27
by Clark Perkins (Jira)
ignore eclipse .project and .classpath by chrismattmann
3
by chrismattmann
[jira] Created: (NUTCH-209) include nutch jar in mapred jobs by Clark Perkins (Jira)
5
by Clark Perkins (Jira)
Empty Parse by Jérôme Charron
3
by Jérôme Charron
whitespaces was: meta data support for CrawlDatum by Stefan Groschupf-2
4
by Stefan Groschupf-2
Jakarta-POI 3.0-alpha1 by Jérôme Charron
0
by Jérôme Charron
process/create/hand over: crawl meta data by Stefan Groschupf-2
1
by Jack.Tang
[jira] Created: (NUTCH-139) Standard metadata property names in the ParseData metadata by Clark Perkins (Jira)
67
by Clark Perkins (Jira)
Success with Nutch & GCJ by Andrzej Białecki-2
0
by Andrzej Białecki-2
No node available for block <blockID> errors by Chris Schneider-2
0
by Chris Schneider-2
[jira] Created: (NUTCH-149) outlinks not shown properly in cached.jsp by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Created: (NUTCH-158) Process Sitemap data in text, rss or xml format as well as OAI-PMH by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
tool to mount nutch filesystem by John X
8
by John X
[OT] Mailing lists by Andrew McNabb
1
by Doug Cutting
[jira] Created: (NUTCH-207) Bandwidth target for fetcher rather than a thread count by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
[jira] Created: (NUTCH-193) move NDFS and MapReduce to a separate project by Clark Perkins (Jira)
15
by Clark Perkins (Jira)
Some bugs I'm trying to characterize.... by Bryan A. P. Pendleto...
1
by michael_cafarella
[jira] Created: (NUTCH-81) Webapp only works when deployed in root by Clark Perkins (Jira)
6
by Clark Perkins (Jira)
RE: takes too long to remove a page from WEBDB by Fuad Efendi
2
by Stefan Groschupf-2
javaswf.jar by Jérôme Charron
2
by Stefan Groschupf-2
[jira] Created: (NUTCH-200) OpenSearch Servlet ist broken by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
1 ... 593594595596597598599 ... 617