Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 558559560561562563564 ... 595
Topics (20798)
Replies Last Post Views
Checking if crawl dir exists ... by Michael Wechner
4
by Michael Wechner
nutch/lucene question... by aaaaa
1
by Dennis Kubes
reading crawl dir from nutch-default.xml by dee-2
0
by dee-2
Nutch as caching web proxy by Neil Ireson-4
5
by Anton Potekhin
Re: [Fwd: Re: [Nutch Wiki] Update of "RenaudRichardet" by RenaudRichardet] by Renaud Richardet-3
1
by Stefan Groschupf
Single Search Server, Multiple Indexes on Separate Disks by Dennis Kubes
0
by Dennis Kubes
Re: [Nutch Wiki] Update of "RenaudRichardet" by RenaudRichardet by Stefan Groschupf
0
by Stefan Groschupf
How to debug War/Tomcat? by Chris Stephens-3
0
by Chris Stephens-3
[jira] Created: (NUTCH-357) crawling simulation by Sebastian Nagel (Jir...
3
by Stefan Groschupf
Injector calls Map with blank lines by Dennis Kubes
0
by Dennis Kubes
differ search in filesystem or webpages by dee-2
0
by dee-2
0.8 not loading plugins by Chris Stephens-3
11
by Chris Stephens-3
Fwd: [webspam-announces] Web Spam Collection Announced by Stefan Groschupf
0
by Stefan Groschupf
[jira] Created: (NUTCH-354) MapWritable, nextEntry is not reset when Entries are recycled by Sebastian Nagel (Jir...
3
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-355) The title of query result could like the summary have the highlight?? by Sebastian Nagel (Jir...
1
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-346) Improve readability of logs/hadoop.log by Sebastian Nagel (Jir...
2
by Sebastian Nagel (Jir...
the implementation code of explanation.jsp in Search Page by Feng Ji
1
by Andrzej Białecki-2
show new data in search result page by Feng Ji
0
by Feng Ji
Thoughts on Parser design and dependencies by Jukka Zitting
10
by Andrzej Białecki-2
architecture question/thoughts by aaaaa
0
by aaaaa
[jira] Created: (NUTCH-341) IndexMerger now deletes entire <workingdir> after completing by Sebastian Nagel (Jir...
4
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-347) Build: plugins' Jars not found by Sebastian Nagel (Jir...
4
by Sebastian Nagel (Jir...
Adding Database Field by Levent Ulutas
0
by Levent Ulutas
[jira] Created: (NUTCH-345) Add support for Content-Encoding: deflated by Sebastian Nagel (Jir...
3
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-322) Fetcher discards ProtocolStatus, doesn't store redirected pages by Sebastian Nagel (Jir...
11
by Andrzej Białecki-2
[jira] Created: (NUTCH-342) Nutch commands log to nutch/logs/hadoop.logs by default by Sebastian Nagel (Jir...
4
by Anton Potekhin
[jira] Created: (NUTCH-343) Index MP3 SHA1 hashes by Sebastian Nagel (Jir...
1
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-352) Add jar command to bin/nutch to allow launching hadoop job jars by Sebastian Nagel (Jir...
1
by Sebastian Nagel (Jir...
Neko parsing fix inadvertently reverted? by Benjamin Higgins
2
by Andrzej Białecki-2
[jira] Created: (NUTCH-348) Generator is building fetch list using *lowest* scoring URLs by Sebastian Nagel (Jir...
2
by Sebastian Nagel (Jir...
HTTP Accept Header seems to be missing by Michael Wechner
2
by Michael Wechner
Nutch, samba and urls... by Bugzilla from treffe...
1
by Sami Siren-2
[jira] Created: (NUTCH-233) wrong regular expression hang reduce process for ever by Sebastian Nagel (Jir...
6
by Sebastian Nagel (Jir...
[jira] Created: (NUTCH-349) Port Nutch to use Hadoop Text instead of UTF8 by Sebastian Nagel (Jir...
2
by Sebastian Nagel (Jir...
Tika update by Jukka Zitting
4
by Jukka Zitting
1 ... 558559560561562563564 ... 595