[jira] Created: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents

Tim Allison (Jira)
DeleteDuplicate fails if Segment index directory has 0 documents
----------------------------------------------------------------

                 Key: NUTCH-467
                 URL: https://issues.apache.org/jira/browse/NUTCH-467
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 0.9.0
         Environment: all
            Reporter: Dennis Kubes
             Fix For: 0.9.0


If any of the segment indexes have 0 documents, then the DDRecordReader in DeleteDuplicates throws an IndexOutOfBoundsException.  The record reader needs to check for empty document segment indexes.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents

Tim Allison (Jira)

     [ https://issues.apache.org/jira/browse/NUTCH-467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dennis Kubes updated NUTCH-467:
-------------------------------

    Attachment: nutch-467.patch

Submitted by Andrzej Bialecki.

> DeleteDuplicate fails if Segment index directory has 0 documents
> ----------------------------------------------------------------
>
>                 Key: NUTCH-467
>                 URL: https://issues.apache.org/jira/browse/NUTCH-467
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 0.9.0
>         Environment: all
>            Reporter: Dennis Kubes
>             Fix For: 0.9.0
>
>         Attachments: nutch-467.patch
>
>
> If any of the segment indexes have 0 documents, then the DDRecordReader in DeleteDuplicates throws an IndexOutOfBoundsException.  The record reader needs to check for empty document segment indexes.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (NUTCH-467) DeleteDuplicate fails if Segment index directory has 0 documents

Tim Allison (Jira)
In reply to this post by Tim Allison (Jira)

     [ https://issues.apache.org/jira/browse/NUTCH-467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrzej Bialecki  resolved NUTCH-467.
-------------------------------------

    Resolution: Fixed
      Assignee: Andrzej Bialecki

Patch applied in rev. 532105.

> DeleteDuplicate fails if Segment index directory has 0 documents
> ----------------------------------------------------------------
>
>                 Key: NUTCH-467
>                 URL: https://issues.apache.org/jira/browse/NUTCH-467
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 0.9.0
>         Environment: all
>            Reporter: Dennis Kubes
>         Assigned To: Andrzej Bialecki
>             Fix For: 1.0.0
>
>         Attachments: nutch-467.patch
>
>
> If any of the segment indexes have 0 documents, then the DDRecordReader in DeleteDuplicates throws an IndexOutOfBoundsException.  The record reader needs to check for empty document segment indexes.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.