$ tree path/to/segments/
The folder crawl_generate is empty! Generator is complex with multiple steps working in temporary
folders. Eventually, it's only the final copying which is broken. But I see no other way as to debug
the fetch list generation to find out what the reason is.
I strongly recommend to test also all other tools from command-line. For the bulk of them just run
a sample crawl via bin/crawl.
On 08/09/2017 11:56 AM, Omkar Reddy wrote:
> Hello dev@,
> I am facing an EOFException in the file TestGenerator.java and I cannot get my hands on the way in
> which I can solve it. The Exception is as follows :
> 1. 2017-08-09 12:57:06,026 WARN fs.FSInputChecker (ChecksumFileSystem.java:<init>(157)) - Problem
> opening checksum file:
> Ignoring exception:
> 2. java.io.EOFException
> I cannot understand the reason for it. This PR is the part of an effort to upgrade Nutch to use
> new MapReduce API.
> Please find the detailed log of the test here. Any suggestions/help would be appreciated.
>  https://paste.apache.org/e1cQ >  https://github.com/apache/nutch/pull/188