Nutch - Dev

This forum is an archive for the mailing list dev@nutch.apache.org (more options) Messages posted here will be sent to this mailing list.
If you'd like to contribute to Nutch, please subscribe to the Nutch developer mailing list.
1 ... 559560561562563564565 ... 621
Topics (21706)
Replies Last Post Views
[jira] Created: (NUTCH-589) Hierarchical Classloaders by Tim Allison (Jira)
0
by Tim Allison (Jira)
Task process exit with nonzero status of 65 by Ned Rockson-3
0
by Ned Rockson-3
Image Search Engine Input by sseveran
6
by Trey Spiva-2
some question about development by 颜韵旋
2
by 吕召刚
Parsing ppt with mimetype application/x-mspowerpoint by pavan kumar donepudi
0
by pavan kumar donepudi
Issue with IndexSearcher initialization in NuchBean by Frederic Ciminera
0
by Frederic Ciminera
Maintaining source url data (father) during runtime by Tranquil
4
by Tranquil
Applicant for Nutch Project by shaowen yu
1
by Grant Ingersoll-2
Backwards compatibility strategy by Sami Siren-2
1
by Doğacan Güney-3
Commit Times for Issues by Dennis Kubes-2
6
by Andrzej Białecki-2
[jira] Created: (NUTCH-552) Upgrade Nutch to Hadoop 0.14.x by Tim Allison (Jira)
14
by Tim Allison (Jira)
[jira] Created: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility by Tim Allison (Jira)
45
by Tim Allison (Jira)
about heritrix crawl,Who will tell me in this Nutch forum?thanks by xingjian
0
by xingjian
Nutch trunk js-parser problem with extremely long and meaningless Elements by Ned Rockson-3
0
by Ned Rockson-3
[jira] Created: (NUTCH-576) Different Analyzers Support by Tim Allison (Jira)
0
by Tim Allison (Jira)
Need help in updating url in runtime in [Fetcher.java] by Tranquil
0
by Tranquil
takes the URI info, Content, headers, ect into a MYSQL database during crawl. by xingjian
2
by xingjian
[jira] Created: (NUTCH-548) Move URLNormalizer from Outlink to ParseOutputFormat by Tim Allison (Jira)
12
by Tim Allison (Jira)
[jira] Created: (NUTCH-538) Delete unused classes under o.a.n.util by Tim Allison (Jira)
5
by Tim Allison (Jira)
Hudson build is back to normal: Nutch-Nightly #262 by hudson-6
0
by hudson-6
wiki faq by misc
0
by misc
Generator speed by misc
0
by misc
Auto complete by misc
0
by misc
Can we add this to nutch? by misc
1
by Dennis Kubes-2
EOF exception while fetching by Ned Rockson-3
0
by Ned Rockson-3
Build failed in Hudson: Nutch-Nightly #261 by hudson-6
1
by Doğacan Güney-3
[jira] Created: (NUTCH-547) Redirection handling: YahooSlurp's algorithm by Tim Allison (Jira)
15
by Tim Allison (Jira)
[jira] Created: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates by Tim Allison (Jira)
4
by Tim Allison (Jira)
Usage of mapred-default.xml is deprecated in hadoop0.15.0 by Ned Rockson-3
0
by Ned Rockson-3
[jira] Created: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block by Tim Allison (Jira)
2
by Tim Allison (Jira)
[jira] Created: (NUTCH-411) Parse ignores meta refresh redirection by Tim Allison (Jira)
3
by Tim Allison (Jira)
db.ignore.internal.links and ranking algorithms by Rajasekar Karthik
4
by Rajasekar Karthik
NullPointerException in FetchedSegments.getSummary() by John Doe-37
0
by John Doe-37
[jira] Commented: (NUTCH-572) Scoring and redirected Urls by Tim Allison (Jira)
0
by Tim Allison (Jira)
[jira] Commented: (NUTCH-356) Plugin repository cache can lead to memory leak by Tim Allison (Jira)
0
by Tim Allison (Jira)
1 ... 559560561562563564565 ... 621