RE: problems with file protocol

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

RE: problems with file protocol

Marc DELERUE-2
Hi,

I'm trying to crawl a lan with Nutch. When I'm crawling, all files
(html, txt, doc, xls) return : java.lang.IllegalArgumentException: null
type.
But when I'm indexing websites, I don't get this error and every files
are perfectly indexed.

Could somebody help me ?
 Best Regards.

Marc Delerue.

Reply | Threaded
Open this post in threaded view
|

Re: problems with file protocol

Jérôme Charron
> I'm trying to crawl a lan with Nutch. When I'm crawling, all files
> (html, txt, doc, xls) return : java.lang.IllegalArgumentException: null
> type.
Could you please send back a full stack trace?
I will check the code of the MimeType detector ("null type" message is strange).

Jerome

--
http://motrech.free.fr/
http://frutch.free.fr/
Reply | Threaded
Open this post in threaded view
|

RE: problems with file protocol

Marc DELERUE-2
In reply to this post by Marc DELERUE-2

-----Message d'origine-----
De : Jérôme Charron [mailto:[hidden email]]
Envoyé : lundi 30 mai 2005 14:45
À : [hidden email]
Objet : Re: [Nutch-dev] problems with file protocol

>> I'm trying to crawl a lan with Nutch. When I'm crawling, all files
>> (html, txt, doc, xls) return : java.lang.IllegalArgumentException: null
>> type.
>Could you please send back a full stack trace?
>I will check the code of the MimeType detector ("null type" message is >strange).
>
>Jerome


Here is a part of the stack trace, everything until this line is normal.

050530 144927 Processing /opt/nutch-nightly/FILE/segments/20050530144927/fetchlist.unsorted: Sorted 119 entries in 0.0020 seconds.
050530 144927 Processing /opt/nutch-nightly/FILE/segments/20050530144927/fetchlist.unsorted: Sorted 59500.0 entries/second
050530 144927 Overall processing: Sorted 119 entries in 0.0020 seconds.
050530 144927 Overall processing: Sorted 1.680672268907563E-5 entries/second
050530 144927 FetchListTool completed
050530 144928 logging at INFO
050530 144928 fetching file:/home/samba/public/marco/documents/Rapportdesynthèse(manquelesannexes).doc
050530 144928 fetching file:/home/samba/public/marco/documents/Bilan 4eme semaine.doc
050530 144928 fetch of file:/home/samba/public/marco/documents/Rapportdesynthèse(manquelesannexes).doc failed with: java.lang.IllegalArgumentException: null type
050530 144928 fetching file:/home/samba/public/marco/nutch-nightlyPourTomcat/WEB-INF/
050530 144928 fetching file:/home/samba/public/marco/nutch-nightlyPourOpt/build.xml
050530 144928 fetch of file:/home/samba/public/marco/nutch-nightlyPourOpt/build.xml failed with: java.lang.IllegalArgumentException: null type
050530 144928 fetching file:/home/samba/public/test/nutch-dev/jp/
050530 144928 fetching file:/home/samba/public/test/nutch-dev/hu/
050530 144928 fetch of file:/home/samba/public/marco/documents/Bilan 4eme semaine.doc failed with: java.lang.IllegalArgumentException: null type
050530 144928 fetching file:/home/samba/public/test/nutch-dev/search.jsp~
050530 144928 fetch of file:/home/samba/public/test/nutch-dev/search.jsp~ failed with: java.lang.IllegalArgumentException:null type
050530 144928 fetching file:/home/samba/public/test/nutch-dev/pl/
050530 144928 fetching file:/home/sfk/
050530 144928 fetching file:/home/samba/public/marco/documents/demande de conges.doc
050530 144928 fetch of file:/home/samba/public/marco/documents/demande de conges.doc failed with: java.lang.IllegalArgumentException: null type

Marc
Reply | Threaded
Open this post in threaded view
|

Re: problems with file protocol

Jérôme Charron
> Here is a part of the stack trace, everything until this line is normal.
I can't reproduce your problem...
It seems that the files you tried to fetch are on a samba mount point.
Please, could you try to perform the same test but the same files on
your local hard drive.

Jerome

--
http://motrech.free.fr/
http://frutch.free.fr/
Reply | Threaded
Open this post in threaded view
|

RE: problems with file protocol

Marc DELERUE-2
In reply to this post by Marc DELERUE-2
Same result, i don't know why.
I think I'm damned... ;)

Marc


-----Message d'origine-----
De : Jérôme Charron [mailto:[hidden email]]
Envoyé : lundi 30 mai 2005 16:32
À : Marc DELERUE
Cc : [hidden email]
Objet : Re: [Nutch-dev] problems with file protocol

> Here is a part of the stack trace, everything until this line is normal.
I can't reproduce your problem...
It seems that the files you tried to fetch are on a samba mount point.
Please, could you try to perform the same test but the same files on
your local hard drive.

Jerome

--
http://motrech.free.fr/
http://frutch.free.fr/
Reply | Threaded
Open this post in threaded view
|

Re: problems with file protocol

Jérôme Charron
> Same result, i don't know why.
> I think I'm damned... ;)
Which Nutch version do you use?

--
http://motrech.free.fr/
http://frutch.free.fr/
Reply | Threaded
Open this post in threaded view
|

RE: problems with file protocol

Marc DELERUE-2
In reply to this post by Marc DELERUE-2

I use nutch-0.7-dev.
It's the same since may the 15th.
...

-----Message d'origine-----
De : Jérôme Charron [mailto:[hidden email]]
Envoyé : lundi 30 mai 2005 16:38
À : Marc DELERUE
Cc : [hidden email]
Objet : Re: [Nutch-dev] problems with file protocol

> Same result, i don't know why.
> I think I'm damned... ;)
Which Nutch version do you use?

--
http://motrech.free.fr/
http://frutch.free.fr/