[jira] Created: (TIKA-139) Add a composite parser

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (TIKA-139) Add a composite parser

JIRA jira@apache.org
Add a composite parser
----------------------

                 Key: TIKA-139
                 URL: https://issues.apache.org/jira/browse/TIKA-139
             Project: Tika
          Issue Type: New Feature
          Components: parser
            Reporter: Jukka Zitting
            Assignee: Jukka Zitting
            Priority: Minor


There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-139) Add a composite parser

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588036#action_12588036 ]

Chris A. Mattmann commented on TIKA-139:
----------------------------------------

Hey Jukka:

How is this different from the AutoDetectParser?

Thanks!

Cheers,
 Chris


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-139) Add a composite parser

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588405#action_12588405 ]

Jukka Zitting commented on TIKA-139:
------------------------------------

> How is this different from the AutoDetectParser

Separation of concerns. AutoDetectParser would both 1) auto-detect the content type, and 2) use the type to select an underlying parser to which the parse call is delegated.

The CompositeParser as added in revision 647181 takes care of the latter concern. AutoDetectParser now extends CompositeParser and just adds the auto-detection functionality.


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-139) Add a composite parser

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588460#action_12588460 ]

Chris A. Mattmann commented on TIKA-139:
----------------------------------------

Hey Jukka:

+1, AWESOME! Yep, totally makes sense.

Thanks!

Cheers,
 Chris


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (TIKA-139) Add a composite parser

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-139.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 0.2-incubating

Implemented in revision 647181.

> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.2-incubating
>
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.