---------- Forwarded message ----------
From: Jukka Zitting <[hidden email]>
Date: Wed, Dec 2, 2009 at 7:28 PM
Subject: Re: a 'lite' version of ooxml-schemas jar
To: POI Developers List <[hidden email]>
On Tue, Nov 24, 2009 at 11:02 AM, Yegor Kozlov <[hidden email]> wrote:
> For Maven this change is transparent - POM for the poi-ooxml module depends
> on poi-ooxml-schemas instead of ooxml-schemas, this means Maven users will
> only need to update the version of POI from 3.5-FINAL to 3.6, the rest will
> be handled by Maven automatically.
I just had a chance to test this with Tika, and it works beautifully.
After upgrading to a POI 3.6-beta1-20091202 snapshot the size of the
tika-app jar dropped from 25MB to 15MB. That's a major improvement,
thanks! I can't wait for the next POI release.
The only odd thing about the upgrade was that I needed to comment out
a piece of Tika extraction code that uses the
class as returned from XWPFParagraph.getCTP().getBookmarkStartArray().
It looks like that class is not included in the poi-ooxml-schemas jar
even though the CTP class with the getBookmarkStartArray() method is