Quantcast

[jira] [Commented] (TIKA-2267) Add common tokens files for tika-eval

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[jira] [Commented] (TIKA-2267) Add common tokens files for tika-eval

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15872088#comment-15872088 ]

Hudson commented on TIKA-2267:
------------------------------

SUCCESS: Integrated in Jenkins build tika-2.x #219 (See [https://builds.apache.org/job/tika-2.x/219/])
TIKA-2267 -- add common tokens for some languages into tika-eval (tallison: rev 544ba97520f8b141972f5d70e95e094460267a13)
* (add) tika-eval/src/main/resources/common_tokens/vi
* (add) tika-eval/src/main/resources/common_tokens/fr
* (add) tika-eval/src/main/resources/common_tokens/nl
* (add) tika-eval/src/main/resources/common_tokens/ar
* (add) tika-eval/src/main/resources/common_tokens/de
* (edit) tika-eval/src/main/resources/common_tokens/es
* (add) tika-eval/src/main/resources/common_tokens/he
* (add) tika-eval/src/main/resources/common_tokens/zh-cn
* (add) tika-eval/src/main/resources/common_tokens/zh-tw
* (add) tika-eval/src/main/resources/common_tokens/it
* (add) tika-eval/src/main/resources/common_tokens/el
* (edit) tika-eval/src/main/resources/common_tokens/en
* (add) tika-eval/src/main/resources/common_tokens/fa
* (add) tika-eval/src/main/resources/common_tokens/pt
* (add) tika-eval/src/main/resources/common_tokens/id
* (add) tika-eval/src/main/resources/common_tokens/ko
* (add) tika-eval/src/main/resources/common_tokens/hi
* (add) tika-eval/src/main/resources/common_tokens/ja
* (add) tika-eval/src/main/resources/common_tokens/ur
* (add) tika-eval/src/main/resources/common_tokens/ru


> Add common tokens files for tika-eval
> -------------------------------------
>
>                 Key: TIKA-2267
>                 URL: https://issues.apache.org/jira/browse/TIKA-2267
>             Project: Tika
>          Issue Type: Improvement
>          Components: tika-eval
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.15
>
>
> We should add some common tokens files for popular languages for tika-eval so that users don't have to generate their own.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
Loading...