search pdf

classic Classic list List threaded Threaded
9 messages Options
Reply | Threaded
Open this post in threaded view
|

search pdf

Shajahan
Hi,
can i use Lucene for searching text in PDF.

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Malcolm Clark

Hi,
You have to parse/index the PDF files and then you can search  the index
with Lucene.
Have a look at Lucene in Action and the source code which comes with
it.There is a good demo which parses common formats such as PDF,Word XML
etc.
Cheers,
MC


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Erik Hatcher
In reply to this post by Shajahan

On Apr 16, 2006, at 10:04 AM, Shajahan wrote:
> can i use Lucene for searching text in PDF.
>

Yes, indirectly.  The PDF must be parsed into the text to be indexed  
first.  PDFBox does this nicely.  Check the Lucene in Action codebase  
for examples.

        Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Shajahan
In reply to this post by Malcolm Clark
Hi,
Thank you for your replay.i am new to this Lucene & PDF. if u dontmin please tell me where i can get the demo file. please give me the URL.

and please tell me the Instaletion of that Lucene

Thankingyor,
Shajahan
Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Karl Wettin-3

16 apr 2006 kl. 17.03 skrev Shajahan:

> Thank you for your replay.i am new to this Lucene & PDF. if u  
> dontmin please
> tell me where i can get the demo file. please give me the URL.

http://lucenebook.com/

> and please tell me the Instaletion of that Lucene

I'm not sure I understand your question. How to install Lucene?  
Lucene is not something you install and run. It is an API and you  
have to do a bit of coding in order for Lucene to do something.  
Perhaps Egothor (another open search engine written in Java) suits  
better if you just want it working out of the box.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Malcolm Clark
In reply to this post by Shajahan
URL for all the source code:

http://www.lucenebook.com/LuceneInAction.zip

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Shajahan
Hi,
thankyou for your replay.
i am very sorry for asking again, but i am new to this Lucene. please tell me how to run this code. i downloaded this LuceneInAction zip file. and i didnot find any readme file for instructions. and i am also downloaded the lucene-1.4.3 also.

so please tell me how to run this code.

thanking you,
Shajahan
Reply | Threaded
Open this post in threaded view
|

RE: search pdf

Aditya Liviandi-2

Please take a moment to learn java and how to use java APIs.

After that, re-read the emails you just sent us, and answer your own
question.

-----Original Message-----
From: Shajahan [mailto:[hidden email]]
Sent: Monday, April 17, 2006 2:22 PM
To: [hidden email]
Subject: Re: search pdf


Hi,
thankyou for your replay.
i am very sorry for asking again, but i am new to this Lucene. please
tell
me how to run this code. i downloaded this LuceneInAction zip file. and
i
didnot find any readme file for instructions. and i am also downloaded
the
lucene-1.4.3 also.

so please tell me how to run this code.

thanking you,
Shajahan
--
View this message in context:
http://www.nabble.com/search-pdf-t1457831.html#a3946467
Sent from the Lucene - Java Users forum at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]


------------ Institute For Infocomm Research - Disclaimer -------------
This email is confidential and may be privileged.  If you are not the intended recipient, please delete it and notify us immediately. Please do not copy or use it for any purpose, or disclose its contents to any other person. Thank you.
--------------------------------------------------------

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: search pdf

Erik Hatcher
In reply to this post by Shajahan
There _is_ a README file at the root of the unzipped Lucene In Action  
code.   It does require some basic Java and Ant know-how.

        Erik


On Apr 17, 2006, at 2:21 AM, Shajahan wrote:

>
> Hi,
> thankyou for your replay.
> i am very sorry for asking again, but i am new to this Lucene.  
> please tell
> me how to run this code. i downloaded this LuceneInAction zip file.  
> and i
> didnot find any readme file for instructions. and i am also  
> downloaded the
> lucene-1.4.3 also.
>
> so please tell me how to run this code.
>
> thanking you,
> Shajahan
> --
> View this message in context: http://www.nabble.com/search-pdf- 
> t1457831.html#a3946467
> Sent from the Lucene - Java Users forum at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]