Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
123456 ... 726
Topics (25381)
Replies Last Post Views
[jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[GitHub] [tika] tballison opened a new pull request #396: TIKA-3266 by GitBox
1
by GitBox
[jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3244) General upgrades for 1.26 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3244) General upgrades for 1.26 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-1735) Unsupported AutoCAD drawing version: AC1027 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-1735) Unsupported AutoCAD drawing version: AC1027 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[GitHub] [tika] nddipiazza opened a new pull request #395: TIKA-1735 - ac2017 and add ability to use dwgread if it is installed. by GitBox
0
by GitBox
[jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-1735) Unsupported AutoCAD drawing version: AC1027 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3268) TikaConfig -- throw exception if exclude parser can't be loaded by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3268) TikaConfig -- throw exception if exclude parser can't be loaded by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3265) Tika 2.0.0 -- improvements to image preprocessing in TesseractOCRParser by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
123456 ... 726