Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 599600601602603604605 ... 623
Topics (21785)
Replies Last Post Views
[jira] Created: (NUTCH-90) reduce logging output of IndexSegment by Steve Loughran (Jira...
1
by Steve Loughran (Jira...
[jira] Created: (NUTCH-64) no results after a restart of a search--server (without tomcat restart) by Steve Loughran (Jira...
8
by Steve Loughran (Jira...
[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] Created: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation by Steve Loughran (Jira...
3
by Steve Loughran (Jira...
[jira] Created: (NUTCH-123) Cache.jsp some times generate NullPointerException by Steve Loughran (Jira...
3
by Steve Loughran (Jira...
[jira] Commented: (NUTCH-165) object pooling for nutch bean --- to impriove performance by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] Created: (NUTCH-118) FAQ link points to invalid URL by Steve Loughran (Jira...
1
by Steve Loughran (Jira...
[jira] Created: (NUTCH-137) footer is not displayed in search result page by Steve Loughran (Jira...
1
by Steve Loughran (Jira...
A little hack: retrieve only new urls by Enrico Triolo-2
0
by Enrico Triolo-2
RE: Authentication / Content-type by T. Kuro Kurosaka
2
by Thushara Wijeratna
[jira] Created: (NUTCH-198) SWF parser by Steve Loughran (Jira...
7
by Steve Loughran (Jira...
[jira] Commented: (NUTCH-23) content text/xml parser by Steve Loughran (Jira...
1
by Rida Benjelloun
[jira] Commented: (NUTCH-53) Parser plugin for Zip files by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] Resolved: (NUTCH-52) Parser plugin for MS Excel files by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
Word, Powerpoint and Excel parsers by Jérôme Charron
0
by Jérôme Charron
Fetch same URL two times by Fuad Efendi
0
by Fuad Efendi
[jira] Created: (NUTCH-192) meta data support for CrawlDatum by Steve Loughran (Jira...
27
by Steve Loughran (Jira...
ignore eclipse .project and .classpath by chrismattmann
3
by chrismattmann
[jira] Created: (NUTCH-209) include nutch jar in mapred jobs by Steve Loughran (Jira...
5
by Steve Loughran (Jira...
Empty Parse by Jérôme Charron
3
by Jérôme Charron
whitespaces was: meta data support for CrawlDatum by Stefan Groschupf-2
4
by Stefan Groschupf-2
Jakarta-POI 3.0-alpha1 by Jérôme Charron
0
by Jérôme Charron
process/create/hand over: crawl meta data by Stefan Groschupf-2
1
by Jack.Tang
[jira] Created: (NUTCH-139) Standard metadata property names in the ParseData metadata by Steve Loughran (Jira...
67
by Steve Loughran (Jira...
Success with Nutch & GCJ by Andrzej Białecki-2
0
by Andrzej Białecki-2
No node available for block <blockID> errors by Chris Schneider-2
0
by Chris Schneider-2
[jira] Created: (NUTCH-149) outlinks not shown properly in cached.jsp by Steve Loughran (Jira...
3
by Steve Loughran (Jira...
[jira] Created: (NUTCH-158) Process Sitemap data in text, rss or xml format as well as OAI-PMH by Steve Loughran (Jira...
1
by Steve Loughran (Jira...
tool to mount nutch filesystem by John X
8
by John X
[OT] Mailing lists by Andrew McNabb
1
by Doug Cutting
[jira] Created: (NUTCH-207) Bandwidth target for fetcher rather than a thread count by Steve Loughran (Jira...
2
by Steve Loughran (Jira...
[jira] Created: (NUTCH-193) move NDFS and MapReduce to a separate project by Steve Loughran (Jira...
15
by Steve Loughran (Jira...
Some bugs I'm trying to characterize.... by Bryan A. P. Pendleto...
1
by michael_cafarella
[jira] Created: (NUTCH-81) Webapp only works when deployed in root by Steve Loughran (Jira...
6
by Steve Loughran (Jira...
RE: takes too long to remove a page from WEBDB by Fuad Efendi
2
by Stefan Groschupf-2
1 ... 599600601602603604605 ... 623