Tim Allison created TIKA-3271:
---------------------------------
Summary: Change default image resize size in TesseractParser's pre-processing step
Key: TIKA-3271
URL:
https://issues.apache.org/jira/browse/TIKA-3271 Project: Tika
Issue Type: Improvement
Reporter: Tim Allison
If users have ImageMagick installed and they select image preprocessing, one of the things we are currently doing is telling ImageMagick to expand the image by 900%. This _may_ make sense for small images..tbd...however, this can lead to massive files and dramatic increases in processing time.
At some point, we should probably increase the image size based on the initial image size, e.g. dynamic resizing.
Until then, for Tika 2.0.0, I propose that we change the default to 200%. This value is completely heuristic and not based on much data aside from Peter Kronenberg's work:
https://lists.apache.org/thread.html/rb1dece05760d10f1b165b03b97fef8b609dc40c4cd06bdb8cc36469d%40%3Cuser.tika.apache.org%3E--
This message was sent by Atlassian Jira
(v8.3.4#803005)