Apache Tika - Development

This forum is an archive for the mailing list tika-dev@lucene.apache.org (more options) Messages posted here will be sent to this mailing list.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
12345 ... 726
Topics (25381)
Replies Last Post Views
droste.zip by Tim Allison
1
by Tim Allison
OCR testing by Peter Kronenberg
1
by Tim Allison
OCR Testing by Peter Kronenberg
0
by Peter Kronenberg
[jira] [Commented] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Resolved] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3274) Tika 2.0.0 -- Move parser specific metadata out of tika-core to parser modules by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Updated] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3273) Further metadat cleanup for TIka 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3272) Improve Rotation handling by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Resolved] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Updated] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3271) Change default image resize size in TesseractParser's pre-processing step by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3271) Change default image resize size in TesseractParser's pre-processing step by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3270) Render non-text in PDFs for OCR by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3269) Update artifact releases for 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3269) Update artifact releases for 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Created] (TIKA-3269) Update artifact releases for 2.0.0 by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Resolved] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
2.0.0-ALPHA? by Tim Allison
0
by Tim Allison
[jira] [Resolved] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
[jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers by Bilahari T H (Jira)
0
by Bilahari T H (Jira)
12345 ... 726