Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 550551552553554555556 ... 617
Topics (21573)
Replies Last Post Views
[jira] Closed: (NUTCH-75) Patch for WebDBReader to get more detailed information about WebDBs by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Assigned: (NUTCH-295) More description for fetcher.threads.fetch property by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Assigned: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Assigned: (NUTCH-213) checkstyle by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Assigned: (NUTCH-249) black- white list url filtering by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Created: (NUTCH-447) Dmoz Structure Parser Tool by Clark Perkins (Jira)
5
by Clark Perkins (Jira)
[jira] Created: (NUTCH-555) StackOverflowError in DomContentUtils by Clark Perkins (Jira)
9
by Clark Perkins (Jira)
Why is Nutch not involved in Google Summer of Code - 2008? by Susam Pal
10
by Otis Gospodnetic-2-2
siteinfo.xml by Chen, Tao
0
by Chen, Tao
[jira] Created: (NUTCH-623) Change name of plugin source directory from "languageidentifier" to "language-identifier" by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
Glitches debuggging on eclipse with languageidentifier plugin by Nacho (Derecho.com)
0
by Nacho (Derecho.com)
[jira] Created: (NUTCH-622) Support for application/x-suggestions+json by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Build failed in Hudson: Nutch-trunk #398 by Apache Hudson Server
3
by Apache Hudson Server
Multiple readseg requests. by nadav hashimshony
0
by nadav hashimshony
Build failed in Hudson: Nutch-trunk #396 by Apache Hudson Server
1
by Apache Hudson Server
[jira] Created: (NUTCH-598) Remove deprecated use of ToolBase, Migration to the new implementation by Clark Perkins (Jira)
10
by Clark Perkins (Jira)
[jira] Created: (NUTCH-620) BasicURLNormalizer should collapse runs of slashes with a single slash by Clark Perkins (Jira)
9
by Clark Perkins (Jira)
Build failed in Hudson: Nutch-trunk #393 by Apache Hudson Server
1
by Apache Hudson Server
Compilation errors at revision 638548 by Andrew York
0
by Andrew York
Current OPIC implementation by Siddhartha Reddy
1
by Andrzej Białecki-2
[jira] Commented: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-615) Redirected URL are fetched wihtout setting any FetchInterval by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-616) Reset Fetch Retry counter when fetch is successful by Clark Perkins (Jira)
6
by Clark Perkins (Jira)
[jira] Created: (NUTCH-610) Can't Update or modify an index while web gui is running by Clark Perkins (Jira)
6
by Clark Perkins (Jira)
[jira] Closed: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-243) Some meta-refresh urls get ignored due to matching regular expression by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Closed: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-223) Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Closed: (NUTCH-220) PDF Box can't parse document: java.lang.NullPointerException by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Retire the original Fetcher before the release? by Andrzej Białecki-2
4
by Andrzej Białecki-2
(nutch 1.0) Query processing problem: NutchBeans and webapps search fail, but Luke sucess by Vinci
0
by Vinci
Cached page - can it be changed? by Vinci
0
by Vinci
Chnage the Analyzer by plugin - how to dealing with the query? by Vinci
1
by Vinci
1 ... 550551552553554555556 ... 617