nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From patil <sbpatil_...@yahoo.co.in>
Subject fnm frq like files are not creating while crwaling some site
Date Wed, 12 Dec 2007 09:59:06 GMT

I commented dedup in crwal.java, if i uncomment it.. its raising exception
like...

Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
        at
org.apache.nutch.indexer.DeleteDuplicates.dedup(DeleteDuplicates.java:447)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:136)

else

no exception but facing below problem...

please help out... am not able to generate some files under index folder...
when i crwal a site...

i need to generate below files... please help... tried nearly a week.. to
solve.


_0.fdt
_0.tis
_0.fdx
_0.prx
._0.fdt.crc
_0.tii
._0.fdx.crc
_0.nrm
._0.fnm.crc
._0.frq.crc
_0.fnm
_0.frq
._0.nrm.crc
._0.tii.crc
._0.tis.crc
._0.prx.crc


response in the form of solutions is appreciated.
Thanks
Patil
-- 
View this message in context: http://www.nabble.com/fnm-frq-like-files-are-not-creating-while-crwaling-some-site-tp14291969p14291969.html
Sent from the Nutch - Dev mailing list archive at Nabble.com.


Mime
View raw message