[jira] Created: (TIKA-606) NumberFormatException when parsing an mp3-file

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (TIKA-606) NumberFormatException when parsing an mp3-file

Hudson (Jira)
NumberFormatException when parsing an mp3-file
----------------------------------------------

                 Key: TIKA-606
                 URL: https://issues.apache.org/jira/browse/TIKA-606
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 0.9
            Reporter: Erna de Groot


When parsing an mp3-file with lyrics in the metadata I get the following error:

org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp3.Mp3Parser@cf546f8
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
        at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
        at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:89)
        at javax.swing.TransferHandler.importData(TransferHandler.java:762)
        at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1485)
        at java.awt.dnd.DropTarget.drop(DropTarget.java:446)
        at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1210)
        at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:517)
        at sun.awt.X11.XDropTargetContextPeer.processDropMessage(XDropTargetContextPeer.java:183)
        at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:830)
        at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:754)
        at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
        at java.awt.Component.dispatchEventImpl(Component.java:4324)
        at java.awt.Container.dispatchEventImpl(Container.java:2163)
        at java.awt.Component.dispatchEvent(Component.java:4295)
        at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4461)
        at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4196)
        at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4050)
        at java.awt.Container.dispatchEventImpl(Container.java:2149)
        at java.awt.Window.dispatchEventImpl(Window.java:2478)
        at java.awt.Component.dispatchEvent(Component.java:4295)
        at java.awt.EventQueue.dispatchEvent(EventQueue.java:604)
        at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:275)
        at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:200)
        at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:190)
        at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:185)
        at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:177)
        at java.awt.EventDispatchThread.run(EventDispatchThread.java:138)
Caused by: java.lang.NumberFormatException: For input string: "peg is"
        at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
        at java.lang.Integer.parseInt(Integer.java:481)
        at java.lang.Integer.parseInt(Integer.java:514)
        at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:92)
        at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:42)
        at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:151)
        at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:64)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        ... 29 more


I tried to do it with tika version 0.8 and 0.9. They had the same result.
I can send you the mp3's if necessary.


--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-606) NumberFormatException when parsing an mp3-file

Hudson (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12998367#comment-12998367 ]

Nick Burch commented on TIKA-606:
---------------------------------

Can you please upload the problem mp3 so we can take a look?

We have a sample mp3 with lyrics in our test suite that works fine (metadata, lyrics etc), so we'll need to try to identify what's going wrong with your file

> NumberFormatException when parsing an mp3-file
> ----------------------------------------------
>
>                 Key: TIKA-606
>                 URL: https://issues.apache.org/jira/browse/TIKA-606
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Erna de Groot
>
> When parsing an mp3-file with lyrics in the metadata I get the following error:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp3.Mp3Parser@cf546f8
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
> at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
> at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:89)
> at javax.swing.TransferHandler.importData(TransferHandler.java:762)
> at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1485)
> at java.awt.dnd.DropTarget.drop(DropTarget.java:446)
> at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1210)
> at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:517)
> at sun.awt.X11.XDropTargetContextPeer.processDropMessage(XDropTargetContextPeer.java:183)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:830)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:754)
> at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
> at java.awt.Component.dispatchEventImpl(Component.java:4324)
> at java.awt.Container.dispatchEventImpl(Container.java:2163)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4461)
> at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4196)
> at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4050)
> at java.awt.Container.dispatchEventImpl(Container.java:2149)
> at java.awt.Window.dispatchEventImpl(Window.java:2478)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.EventQueue.dispatchEvent(EventQueue.java:604)
> at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:275)
> at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:200)
> at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:190)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:185)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:177)
> at java.awt.EventDispatchThread.run(EventDispatchThread.java:138)
> Caused by: java.lang.NumberFormatException: For input string: "peg is"
> at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
> at java.lang.Integer.parseInt(Integer.java:481)
> at java.lang.Integer.parseInt(Integer.java:514)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:92)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:42)
> at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:151)
> at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:64)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> ... 29 more
> I tried to do it with tika version 0.8 and 0.9. They had the same result.
> I can send you the mp3's if necessary.

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (TIKA-606) NumberFormatException when parsing an mp3-file

Hudson (Jira)
In reply to this post by Hudson (Jira)

     [ https://issues.apache.org/jira/browse/TIKA-606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Erna de Groot updated TIKA-606:
-------------------------------

    Attachment: winnipeg is a frozen shithole.mp3

I send you one of the mp3's hereby. We have several from the same  
album that has this problem.

regards,
Erna






----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.



> NumberFormatException when parsing an mp3-file
> ----------------------------------------------
>
>                 Key: TIKA-606
>                 URL: https://issues.apache.org/jira/browse/TIKA-606
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Erna de Groot
>         Attachments: winnipeg is a frozen shithole.mp3
>
>
> When parsing an mp3-file with lyrics in the metadata I get the following error:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp3.Mp3Parser@cf546f8
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
> at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
> at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:89)
> at javax.swing.TransferHandler.importData(TransferHandler.java:762)
> at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1485)
> at java.awt.dnd.DropTarget.drop(DropTarget.java:446)
> at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1210)
> at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:517)
> at sun.awt.X11.XDropTargetContextPeer.processDropMessage(XDropTargetContextPeer.java:183)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:830)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:754)
> at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
> at java.awt.Component.dispatchEventImpl(Component.java:4324)
> at java.awt.Container.dispatchEventImpl(Container.java:2163)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4461)
> at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4196)
> at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4050)
> at java.awt.Container.dispatchEventImpl(Container.java:2149)
> at java.awt.Window.dispatchEventImpl(Window.java:2478)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.EventQueue.dispatchEvent(EventQueue.java:604)
> at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:275)
> at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:200)
> at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:190)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:185)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:177)
> at java.awt.EventDispatchThread.run(EventDispatchThread.java:138)
> Caused by: java.lang.NumberFormatException: For input string: "peg is"
> at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
> at java.lang.Integer.parseInt(Integer.java:481)
> at java.lang.Integer.parseInt(Integer.java:514)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:92)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:42)
> at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:151)
> at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:64)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> ... 29 more
> I tried to do it with tika version 0.8 and 0.9. They had the same result.
> I can send you the mp3's if necessary.

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (TIKA-606) NumberFormatException when parsing an mp3-file

Hudson (Jira)
In reply to this post by Hudson (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13002686#comment-13002686 ]

Nick Burch commented on TIKA-606:
---------------------------------

Found it - LYRICS overall uses a 6 digit length, but each tag only has a 5 digit length

Fixed in r1078031.

Incidentally, We could use a small (100kb max) mp3 that's free to distribute which includes lyrics. Currently, we do have a file with the lyrics tag in it, which is used for some tests, but it's an empty tag with no text contents (which is why the test hadn't flagged the issue)

> NumberFormatException when parsing an mp3-file
> ----------------------------------------------
>
>                 Key: TIKA-606
>                 URL: https://issues.apache.org/jira/browse/TIKA-606
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Erna de Groot
>             Fix For: 1.0
>
>         Attachments: winnipeg is a frozen shithole.mp3
>
>
> When parsing an mp3-file with lyrics in the metadata I get the following error:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp3.Mp3Parser@cf546f8
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
> at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
> at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:89)
> at javax.swing.TransferHandler.importData(TransferHandler.java:762)
> at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1485)
> at java.awt.dnd.DropTarget.drop(DropTarget.java:446)
> at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1210)
> at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:517)
> at sun.awt.X11.XDropTargetContextPeer.processDropMessage(XDropTargetContextPeer.java:183)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:830)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:754)
> at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
> at java.awt.Component.dispatchEventImpl(Component.java:4324)
> at java.awt.Container.dispatchEventImpl(Container.java:2163)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4461)
> at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4196)
> at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4050)
> at java.awt.Container.dispatchEventImpl(Container.java:2149)
> at java.awt.Window.dispatchEventImpl(Window.java:2478)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.EventQueue.dispatchEvent(EventQueue.java:604)
> at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:275)
> at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:200)
> at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:190)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:185)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:177)
> at java.awt.EventDispatchThread.run(EventDispatchThread.java:138)
> Caused by: java.lang.NumberFormatException: For input string: "peg is"
> at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
> at java.lang.Integer.parseInt(Integer.java:481)
> at java.lang.Integer.parseInt(Integer.java:514)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:92)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:42)
> at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:151)
> at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:64)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> ... 29 more
> I tried to do it with tika version 0.8 and 0.9. They had the same result.
> I can send you the mp3's if necessary.

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (TIKA-606) NumberFormatException when parsing an mp3-file

Hudson (Jira)
In reply to this post by Hudson (Jira)

     [ https://issues.apache.org/jira/browse/TIKA-606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-606.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.0

> NumberFormatException when parsing an mp3-file
> ----------------------------------------------
>
>                 Key: TIKA-606
>                 URL: https://issues.apache.org/jira/browse/TIKA-606
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Erna de Groot
>             Fix For: 1.0
>
>         Attachments: winnipeg is a frozen shithole.mp3
>
>
> When parsing an mp3-file with lyrics in the metadata I get the following error:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp3.Mp3Parser@cf546f8
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
> at org.apache.tika.gui.TikaGUI.importStream(TikaGUI.java:186)
> at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:89)
> at javax.swing.TransferHandler.importData(TransferHandler.java:762)
> at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1485)
> at java.awt.dnd.DropTarget.drop(DropTarget.java:446)
> at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1210)
> at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:517)
> at sun.awt.X11.XDropTargetContextPeer.processDropMessage(XDropTargetContextPeer.java:183)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:830)
> at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:754)
> at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
> at java.awt.Component.dispatchEventImpl(Component.java:4324)
> at java.awt.Container.dispatchEventImpl(Container.java:2163)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4461)
> at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4196)
> at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4050)
> at java.awt.Container.dispatchEventImpl(Container.java:2149)
> at java.awt.Window.dispatchEventImpl(Window.java:2478)
> at java.awt.Component.dispatchEvent(Component.java:4295)
> at java.awt.EventQueue.dispatchEvent(EventQueue.java:604)
> at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:275)
> at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:200)
> at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:190)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:185)
> at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:177)
> at java.awt.EventDispatchThread.run(EventDispatchThread.java:138)
> Caused by: java.lang.NumberFormatException: For input string: "peg is"
> at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
> at java.lang.Integer.parseInt(Integer.java:481)
> at java.lang.Integer.parseInt(Integer.java:514)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:92)
> at org.apache.tika.parser.mp3.LyricsHandler.<init>(LyricsHandler.java:42)
> at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:151)
> at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:64)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
> ... 29 more
> I tried to do it with tika version 0.8 and 0.9. They had the same result.
> I can send you the mp3's if necessary.

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira