Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
12345678 ... 604
Topics (21119)
Replies Last Post Views
[jira] [Closed] (NUTCH-1984) Eliminate unnecessary dependencies by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2002) ParserChecker to check robots.txt by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-1999) Add http://nutch.apache.org/robots.txt by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Closed] (NUTCH-2003) topN is not work correctly by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2002) ParserChecker to check robots.txt by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2003) topN is not work correctly by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2024) httpcore classpath jar conflict when invoking protocol-selenium by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2032) Plugin to index the raw content of a readable document. by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Closed] (NUTCH-2532) Throw error if HBase is not available while running nutch commands. by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Closed] (NUTCH-2075) Generate will not choose URL without distance marker by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Closed] (NUTCH-2076) exceptions are not handled when using method waitForCompletion in a try block by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2103) Nutch 2.3 has an old version of hbase jar in runtime/lib folder by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2103) Nutch 2.3 has an old version of hbase jar in runtime/lib folder by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2113) Need documentation for using various Gora backends by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2118) browser requests sometimes timeout when using the selenium grid because of port access issues by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2113) Need documentation for using various Gora backends by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2126) Use selenium protocol for specific sites by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2131) Problem running nutch(crawl) with selenium by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2134) Redirection and cookie handling using protocol plugins by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2240) ava.lang.NoSuchFieldError: INSTANCE selenium nutch by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2249) WordNet Integration for Cosine Similarity by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2253) ProtocolFactory still not thread-safe by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2265) Write A Test Package for Scoring Similarity by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Resolved] (NUTCH-2268) SolrIndexerJob: java.lang.RuntimeException by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2268) SolrIndexerJob: java.lang.RuntimeException by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2274) InteractiveSelenium Plugin's DefaultHandler Returns Null by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Commented] (NUTCH-2275) MD5Signature by default doesn't take in account parse by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2275) MD5Signature by default doesn't take in account parse by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2277) Adding goldstandard.txt default file in conf by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2277) Adding goldstandard.txt default file in conf by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Updated] (NUTCH-2293) Make the unit tests which requires "plugin.folders" as integration tests by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Commented] (NUTCH-2318) Text extraction in HtmlParser adds too much whitespace. by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
[jira] [Commented] (NUTCH-2739) indexer-elastic: Upgrade ES and migrate to REST client by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
Build failed in Jenkins: Nutch-trunk #3653 by Apache Jenkins Serve...
1
by Apache Jenkins Serve...
[jira] [Updated] (NUTCH-2318) Text extraction in HtmlParser adds too much whitespace. by Sebastian Nagel (Jir...
0
by Sebastian Nagel (Jir...
12345678 ... 604