[jira] [Created] (TIKA-3042) Date format extraction problem in XLS/XLSX

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (TIKA-3042) Date format extraction problem in XLS/XLSX

Hudson (Jira)
Zoltan Farago created TIKA-3042:
-----------------------------------

             Summary: Date format extraction problem in XLS/XLSX
                 Key: TIKA-3042
                 URL: https://issues.apache.org/jira/browse/TIKA-3042
             Project: Tika
          Issue Type: Task
            Reporter: Zoltan Farago


Currently TIKA/ManifoldCF 2.10 extracts dates from the attached file tis way:

2018.05.10 -> 10/05/18
2002.02.02 -> 2/2/2

We need this format:

2018.05.10 -> 2018-05-10

2002.02.02 -> 2002-02-02

This occurs only when the field type is date. When the field type is text then the output is fine.

 

Please help us with a recommendation with any settings in the pipeline (Tika configs, excel setting, OS local settings, etc.), or provide a fix. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)