Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1234567 ... 625
Topics (21864)
Replies Last Post Views
[jira] [Commented] (NUTCH-2818) Ant build: upgrade Apache Rat report task by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Updated] (NUTCH-2812) Methods returning array may expose internal representation by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2805) Rename plugin urlfilter-domainblacklist by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Updated] (NUTCH-2807) SitemapProcessor to warn that ignoring robotst.xt affects detection of sitemaps by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2818) Ant build: upgrade Apache Rat report task by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2818) Ant build: upgrade Apache Rat report task by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[GitHub] [nutch] sebastian-nagel opened a new pull request #549: NUTCH-2818 Fix Apache Rat task to check sources for license headers by GitBox
1
by GitBox
[jira] [Resolved] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Assigned] (NUTCH-2814) HttpDateFormat's internal time zone may change after parsing a date by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[GitHub] [nutch] sebastian-nagel opened a new pull request #546: NUTCH-2814 HttpDateFormat's internal time zone may change after parsing a date by GitBox
1
by GitBox
[jira] [Commented] (NUTCH-2803) Rename property http.robot.rules.whitelist by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2809) Upgrade any23 plugin dependency by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Updated] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Assigned] (NUTCH-2823) IllegalStateException in IndexWriters.describe() when validating url param for SolrIndexer by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2669) Reliable solution for javax.ws packaging.type by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2672) Ant build erronously installs *-test.jar instead *.jar for target "nightly" by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2671) Upgrade ant ivy library by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2669) Reliable solution for javax.ws packaging.type by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Resolved] (NUTCH-2671) Upgrade ant ivy library by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[GitHub] [nutch] sebastian-nagel opened a new pull request #550: NUTCH-2697 Upgrade Ivy to 2.5.0 by GitBox
1
by GitBox
[jira] [Commented] (NUTCH-2801) RobotsRulesParser command-line checker to use http.robots.agents as fall-back by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2799) Add .asf.yaml file by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2753) Add -listen option to command-line help of CrawlDbReader and LinkDbReader by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2810) FreeGenerator to actually apply configured number of fetch lists by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2790) CSVIndexWriter does not escape leading quotes properly by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2791) domainstats, protocolstats and crawlcomplete do not handle GCS URLs by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2434) Add methods to reset parameters HTMLMetaTags by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2730) SitemapProcessor to treat sitemap URLs as Set instead of List by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
[jira] [Commented] (NUTCH-2796) Upgrade to crawler-commons 1.1 by Steve Loughran (Jira...
0
by Steve Loughran (Jira...
1234567 ... 625