fnm frq like files are not creating while crwaling some site

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

fnm frq like files are not creating while crwaling some site

patil-2
I commented dedup in crwal.java, if i uncomment it.. its raising exception like...

Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
        at org.apache.nutch.indexer.DeleteDuplicates.dedup(DeleteDuplicates.java:447)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:136)

else

no exception but facing below problem...

please help out... am not able to generate some files under index folder... when i crwal a site...

i need to generate below files... please help... tried nearly a week.. to solve.


_0.fdt
_0.tis
_0.fdx
_0.prx
._0.fdt.crc
_0.tii
._0.fdx.crc
_0.nrm
._0.fnm.crc
._0.frq.crc
_0.fnm
_0.frq
._0.nrm.crc
._0.tii.crc
._0.tis.crc
._0.prx.crc


response in the form of solutions is appreciated.
Thanks
Patil