Databases

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

Databases

manjula wijewickrema
Hi,

Normally, when I am building my index directory for indexed documents, I
used to keep my indexed files simply in a directory called 'filesToIndex'.
So in this case, I do not use any standar database management system such
as mySql or any other.

1) Will it be possible to use mySql or any other for the purpose of manage
indexed documents in Lucene?

2) Is it necessary to follow such kind of methodology with Lucene?

3) If we do not use such type of database management system, will there be
any disadvantages with large number of indexed files?

Appreciate any reply from you.
Thanks,
Manjula.
Reply | Threaded
Open this post in threaded view
|

Re: Databases

Glen Newton
LuSql is a tool specifically oriented to extracting from JDBC
accessible databases and indexing the contents.
You can find it here:
 http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/LuSql
User manual:
 http://cuvier.cisti.nrc.ca/~gnewton/lusql/v0.9/lusqlManual.pdf.html

A new version is coming out in the next  month, but the existing one
should be fine for what you have described.
If you have any questions, just let me know.

Note that if you are interested in using Solr for your application,
the data import handler (DIH) is a very flexible way of doing what you
are describing, in a Solr context.
http://wiki.apache.org/solr/DataImportHandler

Thanks,
-Glen Newton
LuSql author
http://zzzoot.blogspot.com/

On 23 July 2010 15:46, manjula wijewickrema <[hidden email]> wrote:

> Hi,
>
> Normally, when I am building my index directory for indexed documents, I
> used to keep my indexed files simply in a directory called 'filesToIndex'.
> So in this case, I do not use any standar database management system such
> as mySql or any other.
>
> 1) Will it be possible to use mySql or any other for the purpose of manage
> indexed documents in Lucene?
>
> 2) Is it necessary to follow such kind of methodology with Lucene?
>
> 3) If we do not use such type of database management system, will there be
> any disadvantages with large number of indexed files?
>
> Appreciate any reply from you.
> Thanks,
> Manjula.
>



--

-

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Databases

chrislusf
In reply to this post by manjula wijewickrema
3) Sounds you want to use Lucene for storage, without databases like
mysql. It may work, but hard for later data management.
1) and 2) You can use mysql as main storage, and pull data out to create
Lucene indexes. Pay attention to incremental changes. It's a continuous
process, not one-time data import. Or you would have to put a hook in
your program to write new content to the index. Anyway, you can get it
work, but maybe not as simple as you expected.

--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes:
http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes 

DBSight customer, a shopping comparison site, (anonymous per request)
got 2.6 Million Euro funding!


On 7/22/2010 10:46 PM, manjula wijewickrema wrote:

> Hi,
>
> Normally, when I am building my index directory for indexed documents, I
> used to keep my indexed files simply in a directory called 'filesToIndex'.
> So in this case, I do not use any standar database management system such
> as mySql or any other.
>
> 1) Will it be possible to use mySql or any other for the purpose of manage
> indexed documents in Lucene?
>
> 2) Is it necessary to follow such kind of methodology with Lucene?
>
> 3) If we do not use such type of database management system, will there be
> any disadvantages with large number of indexed files?
>
> Appreciate any reply from you.
> Thanks,
> Manjula.
>
>    


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Databases

tarunsapra
In reply to this post by manjula wijewickrema
You can use HibernateSearch to maintain the synchronization between Lucene
index and Mysql  RDBMS.

On Fri, Jul 23, 2010 at 11:16 AM, manjula wijewickrema
<[hidden email]>wrote:

> Hi,
>
> Normally, when I am building my index directory for indexed documents, I
> used to keep my indexed files simply in a directory called 'filesToIndex'.
> So in this case, I do not use any standar database management system such
> as mySql or any other.
>
> 1) Will it be possible to use mySql or any other for the purpose of manage
> indexed documents in Lucene?
>
> 2) Is it necessary to follow such kind of methodology with Lucene?
>
> 3) If we do not use such type of database management system, will there be
> any disadvantages with large number of indexed files?
>
> Appreciate any reply from you.
> Thanks,
> Manjula.
>



--
Thanks & Regards
Tarun Sapra
Reply | Threaded
Open this post in threaded view
|

Re: Databases

manjula wijewickrema
Hi,

Thanks a lot for your information.

Regards,
Manjula.

On Fri, Jul 23, 2010 at 12:48 PM, tarun sapra <[hidden email]> wrote:

> You can use HibernateSearch to maintain the synchronization between Lucene
> index and Mysql  RDBMS.
>
> On Fri, Jul 23, 2010 at 11:16 AM, manjula wijewickrema
> <[hidden email]>wrote:
>
> > Hi,
> >
> > Normally, when I am building my index directory for indexed documents, I
> > used to keep my indexed files simply in a directory called
> 'filesToIndex'.
> > So in this case, I do not use any standar database management system such
> > as mySql or any other.
> >
> > 1) Will it be possible to use mySql or any other for the purpose of
> manage
> > indexed documents in Lucene?
> >
> > 2) Is it necessary to follow such kind of methodology with Lucene?
> >
> > 3) If we do not use such type of database management system, will there
> be
> > any disadvantages with large number of indexed files?
> >
> > Appreciate any reply from you.
> > Thanks,
> > Manjula.
> >
>
>
>
> --
> Thanks & Regards
> Tarun Sapra
>