Using Lucene 2.3.0 with PDFBox

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Using Lucene 2.3.0 with PDFBox

Naman Gupta
Hey

I am having problem using PDF Box for parsing the PDF and coverting
them to Lucene Document using the following statement.

Document doc = LucenePDFDocument.getDocument( file );

PDF Box uses a particular function of the Object 'Field' which is only
there in the lucene 1.4.3.
                 *Field.UnIndexed("path", file.getPath() )
In subsequent versions it has been removed. Kindly suggest a solution
for the PDF Box integration with lucene 2.3.0

Thanks

Naman K Gupta

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Using Lucene 2.3.0 with PDFBox

Jan Peter Stotz
Naman Gupta schrieb:

> PDF Box uses a particular function of the Object 'Field' which is only
> there in the lucene 1.4.3.
>                  *Field.UnIndexed("path", file.getPath() )

This statement should be a good replacement:

new Field("path", file.getPath(), Field.Store.YES,
Field.Index.UN_TOKENIZED));

Jan

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]