Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
123456 ... 558
Topics (19512)
Replies Last Post Views
[jira] [Updated] (NUTCH-2613) Documentation for exchange component by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Resolved] (NUTCH-2390) No documentation on pluggable indexing by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2390) No documentation on pluggable indexing by JIRA jira@apache.org
0
by JIRA jira@apache.org
[Nutch Wiki] Update of "bin/nutch index" by SebastianNagel by Apache Wiki
0
by Apache Wiki
[jira] [Created] (NUTCH-2626) bin/crawl: remove option -noParsing from fetch command by JIRA jira@apache.org
0
by JIRA jira@apache.org
[Nutch Wiki] Update of "bin/nutch fetch" by SebastianNagel by Apache Wiki
0
by Apache Wiki
[jira] [Commented] (NUTCH-2602) Configuration values in the description of index writers by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2622) Unbundle LGPL-licensed jars from binary release by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2290) Update licenses of bundled libraries by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Resolved] (NUTCH-2622) Unbundle LGPL-licensed jars from binary release by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2622) Unbundle LGPL-licensed jars from binary release by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Resolved] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2602) Configuration values in the description of index writers by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2602) Configuration values in the description of index writers by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2625) ProtocolFactory.getProtocol(url) may create multiple plugin instances by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Created] (NUTCH-2625) ProtocolFactory.getProtocol(url) may create multiple plugin instances by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Comment Edited] (NUTCH-2612) Support for sitemap processing by hostname by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2612) Support for sitemap processing by hostname by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2612) Support for sitemap processing by hostname by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Comment Edited] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Commented] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Created] (NUTCH-2624) protocol-okhttp resource leak by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Created] (NUTCH-2623) Fetcher to guarantee delay for same host/domain/ip independent of http/https protocol by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-1926) AdaptiveFetchScheduler reads nutch-default settings as float but it needs integer. by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Resolved] (NUTCH-2095) WARC exporter for the CommonCrawlDataDumper by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2192) Get rid of oro by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Resolved] (NUTCH-2226) SOLR mismatch in deploy mode by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2279) LinkRank fails when using Hadoop MR output compression by JIRA jira@apache.org
0
by JIRA jira@apache.org
[jira] [Updated] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika by JIRA jira@apache.org
0
by JIRA jira@apache.org
123456 ... 558