[jira] Created: (TIKA-136) Exception during command line calling

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (TIKA-136) Exception during command line calling

JIRA jira@apache.org
Exception during command line calling
-------------------------------------

                 Key: TIKA-136
                 URL: https://issues.apache.org/jira/browse/TIKA-136
             Project: Tika
          Issue Type: Bug
          Components: config
    Affects Versions: 0.2-incubating
         Environment: Windows XP; Java 1.5
            Reporter: Karl Heinz Marbaise
            Priority: Blocker


Exception in thread "main" java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser
        at org.pdfbox.pdmodel.font.PDFont.getAFM(PDFont.java:350)
        at org.pdfbox.pdmodel.font.PDFont.getAverageFontWidthFromAFMFile(PDFont.java:313)
        at org.pdfbox.pdmodel.font.PDSimpleFont.getAverageFontWidth(PDSimpleFont.java:231)
        at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:276)
        at org.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:80)
        at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
        at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
        at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
        at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
        at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259
)
        at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
        at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
        at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:53)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:8
4)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:118)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:64)


--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-136) Exception during command line calling

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12583184#action_12583184 ]

Jukka Zitting commented on TIKA-136:
------------------------------------

Karl in http://markmail.org/message/ejeddz5aeeblfbw2:
> i have taken a look into it and found that the above ticket based on an missing dependency in the pom file.
>
> You have only add the following parts:
> <dependency>
>   <groupId>org.fontbox</groupId>
>   <artifactId>fontbox</artifactId>
>   <verison>0.1.0</version>
> </dependency>

FontBox should come in as a transitive dependency from PDFBox 0.7.3, so AFAIK we don't need to explicitly add it as a dependency.

My version of the -bin packages do contain fontbox and I have no problem parsing PDF files with the Tika command line.

> Exception during command line calling
> -------------------------------------
>
>                 Key: TIKA-136
>                 URL: https://issues.apache.org/jira/browse/TIKA-136
>             Project: Tika
>          Issue Type: Bug
>          Components: config
>    Affects Versions: 0.2-incubating
>         Environment: Windows XP; Java 1.5
>            Reporter: Karl Heinz Marbaise
>            Priority: Blocker
>
> Exception in thread "main" java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser
>         at org.pdfbox.pdmodel.font.PDFont.getAFM(PDFont.java:350)
>         at org.pdfbox.pdmodel.font.PDFont.getAverageFontWidthFromAFMFile(PDFont.java:313)
>         at org.pdfbox.pdmodel.font.PDSimpleFont.getAverageFontWidth(PDSimpleFont.java:231)
>         at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:276)
>         at org.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:80)
>         at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
>         at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
>         at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
>         at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
>         at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259
> )
>         at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
>         at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
>         at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:53)
>         at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
>         at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:8
> 4)
>         at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:118)
>         at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:64)

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (TIKA-136) Exception during command line calling

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/TIKA-136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-136.
--------------------------------

    Resolution: Cannot Reproduce

Resolved as Cannot Reproduce. Please reopen with more details if you still see the issue.

> Exception during command line calling
> -------------------------------------
>
>                 Key: TIKA-136
>                 URL: https://issues.apache.org/jira/browse/TIKA-136
>             Project: Tika
>          Issue Type: Bug
>          Components: config
>    Affects Versions: 0.2-incubating
>         Environment: Windows XP; Java 1.5
>            Reporter: Karl Heinz Marbaise
>            Priority: Blocker
>
> Exception in thread "main" java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser
>         at org.pdfbox.pdmodel.font.PDFont.getAFM(PDFont.java:350)
>         at org.pdfbox.pdmodel.font.PDFont.getAverageFontWidthFromAFMFile(PDFont.java:313)
>         at org.pdfbox.pdmodel.font.PDSimpleFont.getAverageFontWidth(PDSimpleFont.java:231)
>         at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:276)
>         at org.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:80)
>         at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:452)
>         at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:215)
>         at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:174)
>         at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
>         at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259
> )
>         at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
>         at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
>         at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:53)
>         at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:69)
>         at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:8
> 4)
>         at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:118)
>         at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:64)

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.