In ApacheCon US I made a Fast Feather Track presentation on Tika, and
I'd like to do the same again in ApacheCon EU next week. Unless anyone
else wants to take the stage, I'll propose a Tika presentation to the
FFT crew later today.
> On Mon, Mar 31, 2008 at 9:54 AM, Jukka Zitting <[hidden email]>
>> ....Unless anyone
>> else wants to take the stage, I'll propose a Tika presentation to the
>> FFT crew later today....
> Please do!
Chris Mattmann, Ph.D.
[hidden email] Cognizant Development Engineer
Early Detection Research Network Project
Jet Propulsion Laboratory Pasadena, CA
Office: 171-266B Mailstop: 171-246
Disclaimer: The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.
On undefined, Jukka Zitting <[hidden email]> wrote:
> In ApacheCon US I made a Fast Feather Track presentation on Tika, and
> I'd like to do the same again in ApacheCon EU next week.
My proposal (see below) was accepted and is initially scheduled for
MIME Magic with Apache Tika
Apache Tika is the modern version of the Unix file(1) and strings(1)
commands; a toolkit that can extract the MIME type, full text content,
and various metadata from unidentified octet streams. An alternative
to dealing with multiple incompatible parsers APIs, Tika provides a
simple and unified interface to handling a number of file formats and
MIME content types. This presentation is an entry-level introduction to
the (currently incubating) Apache Tika toolkit.