Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 526527528529530531532 ... 612
Topics (21405)
Replies Last Post Views
[ANNOUNCE] New Nutch Committer: Julien Nioche by Mattmann, Chris A (3...
3
by Futebol DotInfo
[Nutch Wiki] Update of "PublicServers" by RBalmes by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "PublicServers" by search2.net by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "PublicServers" by search2.net by Apache Wiki
0
by Apache Wiki
unsubscribe by 宫照
0
by 宫照
答复: unsubscribe by Boycott-2
0
by Boycott-2
[jira] Created: (NUTCH-768) Upgrade Nutch 1.0 to use Hadoop 0.20 by Parth (Jira)
8
by Parth (Jira)
[jira] Created: (NUTCH-777) Upgrading to jetty6 broke unit tests by Parth (Jira)
6
by Parth (Jira)
Build failed in Hudson: Nutch-trunk #1007 by Apache Hudson Server
10
by Apache Hudson Server
Creating an alternative Linkdb with part of the outlinks by Santiago Pérez
0
by Santiago Pérez
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "TikaPlugin" by JulienNioche by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "FrontPage" by JulienNioche by Apache Wiki
0
by Apache Wiki
Filtering ParseSegment by MilleBii
0
by MilleBii
java.net.URL synchronization by Otis Gospodnetic-2-2
2
by Fuad Efendi
[jira] Created: (NUTCH-770) Timebomb for Fetcher by Parth (Jira)
14
by Parth (Jira)
Build failed in Hudson: Nutch-trunk #998 by Apache Hudson Server
4
by Apache Hudson Server
[jira] Created: (NUTCH-769) Fetcher to skip queues for URLS getting repeated exceptions by Parth (Jira)
6
by Parth (Jira)
wrong wiki front page by Alban Mouton
4
by Alban Mouton
[jira] Created: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers by Parth (Jira)
7
by Parth (Jira)
[jira] Created: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop by Parth (Jira)
17
by Parth (Jira)
[jira] Created: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container. by Parth (Jira)
5
by Parth (Jira)
[jira] Created: (NUTCH-741) Job file includes multiple copies of nutch config files. by Parth (Jira)
4
by Parth (Jira)
[jira] Created: (NUTCH-738) Close SegmentUpdater when FetchedSegments is closed by Parth (Jira)
7
by Parth (Jira)
[Nutch Wiki] Trivial Update of "Automating_Fetches_with_Python" by newacct by Apache Wiki
0
by Apache Wiki
[jira] Created: (NUTCH-761) Avoid cloningCrawlDatum in CrawlDbReducer by Parth (Jira)
4
by Parth (Jira)
[jira] Created: (NUTCH-773) some minor bugs in AbstractFetchSchedule.java by Parth (Jira)
5
by Parth (Jira)
[jira] Created: (NUTCH-760) Allow field mapping from nutch to solr index by Parth (Jira)
12
by Parth (Jira)
[jira] Created: (NUTCH-772) Upgrade Nutch to use Lucene 2.9.1 by Parth (Jira)
4
by Parth (Jira)
[jira] Created: (NUTCH-753) Prevent new Fetcher to retrieve the robots twice by Parth (Jira)
4
by Parth (Jira)
[jira] Created: (NUTCH-765) Allow Crawl class to call Either Solr or Lucene Indexer by Parth (Jira)
5
by Parth (Jira)
[Nutch Wiki] Update of "FrontPage" by Davinder by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "FrontPage" by Davinder by Apache Wiki
0
by Apache Wiki
[Nutch Wiki] Update of "FrontPage" by Davinder by Apache Wiki
0
by Apache Wiki
1 ... 526527528529530531532 ... 612