Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
1234567 ... 726
Topics (25381)
Replies Last Post Views
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3265) Tika 2.0.0 -- improvements to image preprocessing in TesseractOCRParser by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3264) Improve the per page OCR heuristics for AUTO mode by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Updated] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Updated] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3262) Undo reverse ClassLoader sort in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3259) Improve logging for TesseractOCRParser by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
jira->dev list down? by Tim Allison
0
by Tim Allison
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3263) WriteLimitReachedException is not public by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Resolved] (TIKA-3259) Improve logging for TesseractOCRParser by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Resolved] (TIKA-2548) Add Python Path configuration to TesseractOCRParser by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3262) Undo reverse ClassLoader sort in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
1234567 ... 726