zip in solr

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

zip in solr

Jörg Agatz
Hallo..
i don't know who i can indexing "zip" Dokuments, richtext, pdf and office
documents works pretty fine, but from the "zip" files i only get the Name of
ziped dokumentds, not the Content.
maybe i have to do some other thinks bye indexing zip, but i have read that
Tika can read zip and jar and and and..

my configuration is:

one PC, with Solr and tika is installed. one other PC as crawler send
dokuments with "curl" like:
curl "
http://192.168.105.66:8983/solr/update/extract?literal.id=zip&uprefix=attr_commit=true"
-F "myfile=@file.zip"