Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 595596597598599600601 ... 617
Topics (21573)
Replies Last Post Views
[jira] Commented: (NUTCH-79) Fault tolerant searching. by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-95) DeleteDuplicates depends on the order of input segments by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
[jira] Commented: (NUTCH-16) boost documents matching a url pattern by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
Nutch - New Features (?) by Fuad Efendi
0
by Fuad Efendi
Nutch - New Features (?) by Fuad Efendi
0
by Fuad Efendi
older Nutch list archives (@sf.net)? by Gordon Mohr
4
by Gordon Mohr
[jira] Created: (NUTCH-189) Injection infinite loop by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
Re: svn commit: r372810 - /lucene/nutch/trunk/bin/nutch by Rod Taylor-2
5
by Rod Taylor-2
A Nutch config editor... by Dominik Friedrich
2
by Dominik Friedrich
Re: [Nutch-cvs] svn commit: r372810 - /lucene/nutch/trunk/bin/nutch by Andrzej Białecki-2
3
by Doug Cutting-2
[jira] Created: (NUTCH-190) ParseUtil drops reason for failed parse by Clark Perkins (Jira)
4
by Clark Perkins (Jira)
[jira] Created: (NUTCH-186) mapred-default.xml is over ridden by nutch-site.xml by Clark Perkins (Jira)
8
by Clark Perkins (Jira)
Searchable mailing lists on nutch.org? by Andy Liu-3
3
by Doug Cutting-2
Optimizing which links to fetch by kkrugler
1
by Doug Cutting-2
[jira] Created: (NUTCH-136) mapreduce segment generator generates 50 % less than excepted urls by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
[jira] Created: (NUTCH-183) MapReduce has a series of problems concerning task-allocation to worker nodes by Clark Perkins (Jira)
7
by Clark Perkins (Jira)
Two possible extensions by Guenter, Matthias
2
by Stefan Groschupf-2
xml-parser plugin contribution by Rida Benjelloun
3
by Stefan Groschupf-2
lang identifier and nutch analyzer in trunk by Jack.Tang
15
by Andrzej Białecki-2
Nutch merge problem after fetch is aborted with hung threads. by Lukáš Vlček
0
by Lukáš Vlček
patch for nutch and nutch-daemon.sh by Zaheed Haque
0
by Zaheed Haque
Patch for NDFS's df.java by Dominik Friedrich
2
by Stefan Groschupf-2
protocol-httpclient; maximum total connections by orkunt.sabuncu
1
by Stefan Groschupf-2
[jira] Created: (NUTCH-127) uncorrect values using -du, or ls does not return items by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
Using org.apache.nutch.indexer.IndexMerger (Nutch 0.7) by Chun Wei Ho
0
by Chun Wei Ho
[jira] Closed: (NUTCH-45) Log corrupt segments in SegmentMergeTool by Clark Perkins (Jira)
0
by Clark Perkins (Jira)
[jira] Created: (NUTCH-68) A tool to generate arbitrary fetchlists by Clark Perkins (Jira)
2
by Clark Perkins (Jira)
number of block duplicated by Stefan Groschupf-2
5
by Pashabhai
[jira] Created: (NUTCH-182) Log when db.max configuration limits reached by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
[jira] Created: (NUTCH-87) Efficient site-specific crawling for a large number of sites by Clark Perkins (Jira)
14
by Clark Perkins (Jira)
Authentication / Content-type by Thushara Wijeratna
0
by Thushara Wijeratna
Generating multiple fetchlists between updates by Andrzej Białecki-2
1
by Doug Cutting-2
[jira] Created: (NUTCH-176) Using -dir: creates an error, when the directory already exists by Clark Perkins (Jira)
1
by Clark Perkins (Jira)
[jira] Created: (NUTCH-177) Default installation seems to produce working entity of nutch by Clark Perkins (Jira)
3
by Clark Perkins (Jira)
[jira] Created: (NUTCH-179) Proposition: Enable Nutch to use a parser plugin not just based on content type by Clark Perkins (Jira)
4
by Clark Perkins (Jira)
1 ... 595596597598599600601 ... 617