[jira] Created: (NUTCH-259) Problem in IndexSorter after dedup

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-259) Problem in IndexSorter after dedup

JIRA jira@apache.org
Problem in IndexSorter after dedup
----------------------------------

         Key: NUTCH-259
         URL: http://issues.apache.org/jira/browse/NUTCH-259
     Project: Nutch
        Type: Bug

  Components: indexer  
    Reporter: Michael
    Priority: Minor


When trying to run IndexSorter i'm getting an error:

Exception in thread "main" java.lang.IllegalArgumentException: attempt to access a deleted document
        at org.apache.lucene.index.SegmentReader.document(SegmentReader.java:282)
        at org.apache.lucene.index.FilterIndexReader.document(FilterIndexReader.java:104)
        at org.apache.nutch.indexer.IndexSorter$SortingReader.document(IndexSorter.java:170)
        at org.apache.lucene.index.SegmentMerger.mergeFields(SegmentMerger.java:186)
        at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:88)
        at org.apache.lucene.index.IndexWriter.addIndexes(IndexWriter.java:579)
        at org.apache.nutch.indexer.IndexSorter.sort(IndexSorter.java:240)
        at org.apache.nutch.indexer.IndexSorter.main(IndexSorter.java:291)


--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira