[jira] [Created] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

Hudson (Jira)
Tika TrueTypeParser add metadata from Naming tables
---------------------------------------------------

                 Key: TIKA-857
                 URL: https://issues.apache.org/jira/browse/TIKA-857
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.10
            Reporter: Craig Stires


The current parsing of TrueType fonts (.ttf) parses the tables from the file, but does not assign to the metadata attributes returned to the calling application.

The patch included is just a mapping from the NamingTables to the appropriate metadata categories.

This allows the calling application to get things like the font name, description, author, and copyright information back as metadata attributes.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

Hudson (Jira)

     [ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Craig Stires updated TIKA-857:
------------------------------

    Attachment: TrueTypeParser_AddMetadata.patch

this is the patch against org.apache.tika.parser.font.TrueTypeParser.java
               

> Tika TrueTypeParser add metadata from Naming tables
> ---------------------------------------------------
>
>                 Key: TIKA-857
>                 URL: https://issues.apache.org/jira/browse/TIKA-857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Craig Stires
>         Attachments: TrueTypeParser_AddMetadata.patch
>
>
> The current parsing of TrueType fonts (.ttf) parses the tables from the file, but does not assign to the metadata attributes returned to the calling application.
> The patch included is just a mapping from the NamingTables to the appropriate metadata categories.
> This allows the calling application to get things like the font name, description, author, and copyright information back as metadata attributes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Resolved] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

Hudson (Jira)
In reply to this post by Hudson (Jira)

     [ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Craig Stires resolved TIKA-857.
-------------------------------

       Resolution: Fixed
    Fix Version/s: 1.1
   

> Tika TrueTypeParser add metadata from Naming tables
> ---------------------------------------------------
>
>                 Key: TIKA-857
>                 URL: https://issues.apache.org/jira/browse/TIKA-857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Craig Stires
>             Fix For: 1.1
>
>         Attachments: TrueTypeParser_AddMetadata.patch
>
>
> The current parsing of TrueType fonts (.ttf) parses the tables from the file, but does not assign to the metadata attributes returned to the calling application.
> The patch included is just a mapping from the NamingTables to the appropriate metadata categories.
> This allows the calling application to get things like the font name, description, author, and copyright information back as metadata attributes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

Hudson (Jira)
In reply to this post by Hudson (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202928#comment-13202928 ]

Nick Burch commented on TIKA-857:
---------------------------------

Not sure that this issue should have been resolved, as the patch hasn't been applied yet. Could whoever resolved it please clarify why?
               

> Tika TrueTypeParser add metadata from Naming tables
> ---------------------------------------------------
>
>                 Key: TIKA-857
>                 URL: https://issues.apache.org/jira/browse/TIKA-857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Craig Stires
>             Fix For: 1.1
>
>         Attachments: TrueTypeParser_AddMetadata.patch
>
>
> The current parsing of TrueType fonts (.ttf) parses the tables from the file, but does not assign to the metadata attributes returned to the calling application.
> The patch included is just a mapping from the NamingTables to the appropriate metadata categories.
> This allows the calling application to get things like the font name, description, author, and copyright information back as metadata attributes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (TIKA-857) Tika TrueTypeParser add metadata from Naming tables

Hudson (Jira)
In reply to this post by Hudson (Jira)

    [ https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202962#comment-13202962 ]

Nick Burch commented on TIKA-857:
---------------------------------

Looking at the patch, my only comment is wondering if we need to bring in the list of TTF constants ourselves. Could we not instead just use the existing constants in FontBox, such as something like the ones in org.apache.fontbox.ttf.NameRecord?
               

> Tika TrueTypeParser add metadata from Naming tables
> ---------------------------------------------------
>
>                 Key: TIKA-857
>                 URL: https://issues.apache.org/jira/browse/TIKA-857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Craig Stires
>             Fix For: 1.1
>
>         Attachments: TrueTypeParser_AddMetadata.patch
>
>
> The current parsing of TrueType fonts (.ttf) parses the tables from the file, but does not assign to the metadata attributes returned to the calling application.
> The patch included is just a mapping from the NamingTables to the appropriate metadata categories.
> This allows the calling application to get things like the font name, description, author, and copyright information back as metadata attributes.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira