Quantcast

[jira] [Commented] (TIKA-2267) Add common tokens files for tika-eval

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[jira] [Commented] (TIKA-2267) Add common tokens files for tika-eval

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TIKA-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15872096#comment-15872096 ]

Hudson commented on TIKA-2267:
------------------------------

SUCCESS: Integrated in Jenkins build Tika-trunk #1203 (See [https://builds.apache.org/job/Tika-trunk/1203/])
TIKA-2267 -- add common tokens for some languages into tika-eval (tallison: rev 9cf8258975582200985b33e563978b3b7962cdc6)
* (edit) tika-eval/src/main/resources/common_tokens/es
* (add) tika-eval/src/main/resources/common_tokens/vi
* (add) tika-eval/src/main/resources/common_tokens/id
* (add) tika-eval/src/main/resources/common_tokens/it
* (add) tika-eval/src/main/resources/common_tokens/ru
* (add) tika-eval/src/main/resources/common_tokens/ur
* (add) tika-eval/src/main/resources/common_tokens/hi
* (add) tika-eval/src/main/resources/common_tokens/fa
* (add) tika-eval/src/main/resources/common_tokens/ko
* (add) tika-eval/src/main/resources/common_tokens/pt
* (add) tika-eval/src/main/resources/common_tokens/ar
* (add) tika-eval/src/main/resources/common_tokens/ja
* (edit) tika-eval/src/main/resources/common_tokens/en
* (add) tika-eval/src/main/resources/common_tokens/nl
* (add) tika-eval/src/main/resources/common_tokens/zh-tw
* (add) tika-eval/src/main/resources/common_tokens/fr
* (add) tika-eval/src/main/resources/common_tokens/zh-cn
* (add) tika-eval/src/main/resources/common_tokens/de
* (add) tika-eval/src/main/resources/common_tokens/el
* (add) tika-eval/src/main/resources/common_tokens/he


> Add common tokens files for tika-eval
> -------------------------------------
>
>                 Key: TIKA-2267
>                 URL: https://issues.apache.org/jira/browse/TIKA-2267
>             Project: Tika
>          Issue Type: Improvement
>          Components: tika-eval
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.15
>
>
> We should add some common tokens files for popular languages for tika-eval so that users don't have to generate their own.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
Loading...