What is the best choice: nutch/lucene or nutch/solr?

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

What is the best choice: nutch/lucene or nutch/solr?

clusterboy
I am going over mailing list and still didn't find an answer.

For a project, I need to crawl the web, index it and merge that content with
another site's content which is stored inside the key-value storage system.

What is the best approach to merge these two contents in to a lucene index,
solr index or keep the index separate but merge during the search query
results?
Reply | Threaded
Open this post in threaded view
|

Re: What is the best choice: nutch/lucene or nutch/solr?

Otis Gospodnetic-2-2
Sounds like Nutch for crawling to gather the data, custom tools to read the gathered data, call the KV store, construct SolrInputDocuments, and index those to Solr.  If you want Solr and not Lucene, which is a bigger question that I can't answer without knowing the details.

 Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch



----- Original Message ----

> From: Mr Hadoop <[hidden email]>
> To: [hidden email]
> Sent: Fri, December 4, 2009 2:51:47 PM
> Subject: What is the best choice: nutch/lucene or nutch/solr?
>
> I am going over mailing list and still didn't find an answer.
>
> For a project, I need to crawl the web, index it and merge that content with
> another site's content which is stored inside the key-value storage system.
>
> What is the best approach to merge these two contents in to a lucene index,
> solr index or keep the index separate but merge during the search query
> results?