Two days ago I posted this message below to the nutch-user list already.
Because nobody answered yet I think this is more an developer than an
(for me it seems to be a bug).
I would like to discuss it with a nutch developer.
just a view days ago we started to use Nutch (0.7.1).
It's really nice and I would like to see it evolve.
Here's my issue/question:
While fetching our URLs, we got some errors like this:
60202 154316 fetch of http://www.test-domain.de/crawl_html/page_2.html failed with: java.lang.Exception:
org.apache.nutch.protocol.RetryLater: Exceeded http.max.delays: retry
That seems to be ok and indicates some network problems.
The problem is that the entry in the Webdb shows the following: