Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 517518519520521522523 ... 584
Topics (20421)
Replies Last Post Views
Build failed in Hudson: Nutch-trunk #404 by Apache Hudson Server
2
by Apache Hudson Server
[jira] Assigned: (NUTCH-16) boost documents matching a url pattern by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Closed: (NUTCH-75) Patch for WebDBReader to get more detailed information about WebDBs by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-295) More description for fetcher.threads.fetch property by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-213) checkstyle by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Assigned: (NUTCH-249) black- white list url filtering by JIRA jira@apache.org
0
by JIRA jira@apache.org
Created: (NUTCH-447) Dmoz Structure Parser Tool by JIRA jira@apache.org
5
by JIRA jira@apache.org
[jira] Created: (NUTCH-555) StackOverflowError in DomContentUtils by JIRA jira@apache.org
9
by JIRA jira@apache.org
Why is Nutch not involved in Google Summer of Code - 2008? by Susam Pal
10
by Otis Gospodnetic-2-2
siteinfo.xml by Chen, Tao
0
by Chen, Tao
[jira] Created: (NUTCH-623) Change name of plugin source directory from "languageidentifier" to "language-identifier" by JIRA jira@apache.org
1
by JIRA jira@apache.org
Glitches debuggging on eclipse with languageidentifier plugin by Nacho (Derecho.com)
0
by Nacho (Derecho.com)
[jira] Created: (NUTCH-622) Support for application/x-suggestions+json by JIRA jira@apache.org
0
by JIRA jira@apache.org
Build failed in Hudson: Nutch-trunk #398 by Apache Hudson Server
3
by Apache Hudson Server
Multiple readseg requests. by nadav hashimshony
0
by nadav hashimshony
Build failed in Hudson: Nutch-trunk #396 by Apache Hudson Server
1
by Apache Hudson Server
[jira] Created: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation by JIRA jira@apache.org
10
by JIRA jira@apache.org
[jira] Created: (NUTCH-620) BasicURLNormalizer should collapse runs of slashes with a single slash by JIRA jira@apache.org
9
by JIRA jira@apache.org
Build failed in Hudson: Nutch-trunk #393 by Apache Hudson Server
1
by Apache Hudson Server
Compilation errors at revision 638548 by Andrew York
0
by Andrew York
Current OPIC implementation by Siddhartha Reddy
1
by Andrzej Białecki-2
[jira] Commented: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval by JIRA jira@apache.org
7
by JIRA jira@apache.org
[jira] Commented: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Created: (NUTCH-616) Reset Fetch Retry counter when fetch is successful by JIRA jira@apache.org
6
by JIRA jira@apache.org
[jira] Created: (NUTCH-610) Can't Update or modify an index while web gui is running by JIRA jira@apache.org
6
by JIRA jira@apache.org
[jira] Closed: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Closed: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Commented: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] Closed: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by JIRA jira@apache.org
0
by JIRA jira@apache.org
Retire the original Fetcher before the release? by Andrzej Białecki-2
4
by Andrzej Białecki-2
1 ... 517518519520521522523 ... 584