Passing Metadata from an RTF-file via TIKA to SOLR ...
I am quite new to Lucene/Solr/Tika, etc., so I would appreciate you help
concerning the following matter.
I have a RTF-document, that I want to index in Solr, using Tika.
The RTF-indexing works in general, but since I changed the Solr-schema,
the indexer complains about missing mandatory fields, like "module-id".
The rtf-file is generated by me and I added the metadata-fields to the
RTF-document in the "userprops"-section of the RTF-file (see below) -- so
Tika should be able to read it and to provide it.
The problem is: I don't know HOW or WHERE Tika provides this metadata, so
I don't know how to access it. As a result, I don't know how I can map it
to the respective Solr-fields, like "module-id", that are mandatory in my
Can someone give me a hint, please?
I am running out of ideas here ... :-/
Re: Metadata passed with CURL (via literal) is not recognized by SOLR ...?
Ok, I found the solution myself.
Reason for this behaviour was the "lowernames = true"-configuration of the
Tika-requestHandler, that transformed the "module-id" to "module_id".
I added a fitting copyField to my schema and it seems to work now.
Maybe, this information is useful for someone ... of course, it is
mentioned the manual, but finding it is the problem, if you don't know,
what you are looking for. ;)
Mit freundlichen Grüßen/ With kind regards
Systems Engineering Cluster Instruments
Continental Automotive GmbH
ID S3 RM
VDO-Strasse 1, 64832 Babenhausen, Germany