I met that problem before, after I change the
http.timeout and max.delay values to 100 times the
default setting, the problem is gone,
you might look at nutch-default.xml and make a
overwritten in nutch-site.xml,
Michael,
--- Drew Farris <
[hidden email]> wrote:
> By any chance are you crawling many pages stored on
> a single server or
> small number of servers? If so, take a look at:
>
>
http://www.mail-archive.com/nutch-developers%40lists.sourceforge.net/msg04414.html>
http://www.mail-archive.com/nutch-developers%40lists.sourceforge.net/msg04427.html>
> On 7/27/05, Christophe Noel
> <
[hidden email]> wrote:
> > Hello,
> >
> > When I'm fetching , I really have too many Http
> Timeout with default
> > nutch parameters.
> >
> > Does anyone have tips to improve that point ?
> >
> > Thanks very much.
> >
> > Christophe No?l.
> > www.cetic.be
> >
> > =====
> >
> > org.apache.nutch.protocol.RetryLater: Exceeded
> http.max.delays: retry later.
> > at
> >
>
org.apache.nutch.protocol.httpclient.Http.blockAddr(Http.java:133)
> > at
> >
>
org.apache.nutch.protocol.httpclient.Http.getProtocolOutput(Http.java:201)
> > at
> >
>
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:135)
> > org.apache.nutch.protocol.RetryLater: Exceeded
> http.max.delays: retry later.
> > at
> >
>
org.apache.nutch.protocol.httpclient.Http.blockAddr(Http.java:133)
> > at
> >
>
org.apache.nutch.protocol.httpclient.Http.getProtocolOutput(Http.java:201)
> > at
> >
>
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:135)
> >
>
____________________________________________________
Start your day with Yahoo! - make it your home page
http://www.yahoo.com/r/hs