[jira] [Commented] (TIKA-2644) Improve RecursiveParserWrapper API

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-2644) Improve RecursiveParserWrapper API

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16479543#comment-16479543 ]

Hudson commented on TIKA-2644:
------------------------------

UNSTABLE: Integrated in Jenkins build tika-2.x-windows #252 (See [https://builds.apache.org/job/tika-2.x-windows/252/])
TIKA-2644 - refactor recursiveparserwrapper api (tallison: rev 5f05b511d7d1184f6f25a2b644b615c4f21b8e68)
* (edit) tika-app/src/main/java/org/apache/tika/gui/TikaGUI.java
* (add) tika-core/src/main/java/org/apache/tika/sax/RecursiveParserWrapperHandler.java
* (edit) tika-eval/src/test/java/org/apache/tika/eval/SimpleComparerTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/ocr/TesseractOCRParserTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
* (edit) tika-batch/src/main/java/org/apache/tika/batch/fs/RecursiveParserWrapperFSConsumer.java
* (edit) tika-core/src/main/java/org/apache/tika/sax/ContentHandlerFactory.java
* (edit) tika-eval/src/test/java/org/apache/tika/eval/io/ExtractReaderTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/pkg/ZipParserTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/RecursiveParserWrapperTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/jdbc/SQLite3ParserTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/pkg/CompressorParserTest.java
* (edit) tika-app/src/test/java/org/apache/tika/cli/TikaCLIBatchIntegrationTest.java
* (edit) tika-server/src/main/java/org/apache/tika/server/resource/RecursiveMetadataResource.java
* (edit) tika-core/src/test/java/org/apache/tika/TikaTest.java
* (edit) tika-serialization/src/main/java/org/apache/tika/metadata/serialization/PrettyMetadataKeyComparator.java
* (edit) tika-app/src/main/java/org/apache/tika/cli/TikaCLI.java
* (add) tika-core/src/main/java/org/apache/tika/sax/AbstractRecursiveParserWrapperHandler.java
* (edit) tika-eval/src/main/java/org/apache/tika/eval/AbstractProfiler.java
* (edit) tika-eval/src/main/java/org/apache/tika/eval/ExtractProfiler.java
* (edit) tika-batch/src/test/java/org/apache/tika/batch/RecursiveParserWrapperFSConsumerTest.java
* (edit) tika-batch/src/main/java/org/apache/tika/batch/fs/builders/BasicTikaFSConsumersBuilder.java
* (edit) tika-example/src/main/java/org/apache/tika/example/ParsingExample.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/mbox/MboxParserTest.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/mail/RFC822ParserTest.java
* (edit) tika-core/src/main/java/org/apache/tika/sax/BasicContentHandlerFactory.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/JackcessParserTest.java
* (edit) tika-server/src/test/java/org/apache/tika/server/RecursiveMetadataResourceTest.java
* (edit) tika-eval/src/main/java/org/apache/tika/eval/io/ExtractReader.java
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/multiple/PickBestTextEncodingParser.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/html/HtmlParserTest.java
* (edit) tika-core/src/test/java/org/apache/tika/MultiThreadedTikaTest.java
* (edit) tika-batch/src/main/java/org/apache/tika/batch/fs/BasicTikaFSConsumer.java
* (edit) CHANGES.txt
* (edit) tika-core/src/main/java/org/apache/tika/parser/RecursiveParserWrapper.java
* (edit) tika-eval/src/main/java/org/apache/tika/eval/ExtractComparer.java


> Improve RecursiveParserWrapper API
> ----------------------------------
>
>                 Key: TIKA-2644
>                 URL: https://issues.apache.org/jira/browse/TIKA-2644
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> The RecursiveParserWrapper stores the data in the wrapper, which makes it not thread safe, and, um, different from the other parsers, API-wise.
> Let's create a RecursiveParserWrapperHandler that puts more of the handling in the handler.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)