CLOB Hadoop and Solr

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

CLOB Hadoop and Solr

Jon Morisi
Hi,
I'm doing some preliminary investigation and am wondering if anyone can provide guidance.
I have a lot of CLOB data in an Oracle database.  I also have a Hadoop cluster and am planning to install Solr (HDP Search).

What would be the best way to use Solr for indexing this data?  Sqoop to Hive and index that?  Dump the clobs as individual txt files and index those?

There seem to be a lot of options.  Using the ClobTransformer directly on the Oracle DB is something I'd like to avoid.  I'd rather move the data to Hadoop and manage my full-text indexing there.  (I don't want to stress the DB with the indexing).

Thanks,
Jon
Reply | Threaded
Open this post in threaded view
|

Re: CLOB Hadoop and Solr

spoonerk
I keep trying to unsubscribe but I am still getting endless emails.


On 3/8/2018 10:30 AM, Jon Morisi wrote:

> Hi,
> I'm doing some preliminary investigation and am wondering if anyone can provide guidance.
> I have a lot of CLOB data in an Oracle database.  I also have a Hadoop cluster and am planning to install Solr (HDP Search).
>
> What would be the best way to use Solr for indexing this data?  Sqoop to Hive and index that?  Dump the clobs as individual txt files and index those?
>
> There seem to be a lot of options.  Using the ClobTransformer directly on the Oracle DB is something I'd like to avoid.  I'd rather move the data to Hadoop and manage my full-text indexing there.  (I don't want to stress the DB with the indexing).
>
> Thanks,
> Jon
>

Reply | Threaded
Open this post in threaded view
|

Re: CLOB Hadoop and Solr

Malcolm Upayavira Holmes
I will unsubscribe you both now.

Upayavira

On Fri, 9 Mar 2018, at 2:05 PM, Bob Carpenter wrote:

> Me, too.  Can someone fix the unsubscribe mechanism?
>
> Thanks.
>
>
> > On Mar 8, 2018, at 10:23 PM, john spooner <[hidden email]> wrote:
> >
> > I keep trying to unsubscribe but I am still getting endless emails.
> >
> >
> > On 3/8/2018 10:30 AM, Jon Morisi wrote:
> >> Hi,
> >> I'm doing some preliminary investigation and am wondering if anyone can provide guidance.
> >> I have a lot of CLOB data in an Oracle database.  I also have a Hadoop cluster and am planning to install Solr (HDP Search).
> >>
> >> What would be the best way to use Solr for indexing this data?  Sqoop to Hive and index that?  Dump the clobs as individual txt files and index those?
> >>
> >> There seem to be a lot of options.  Using the ClobTransformer directly on the Oracle DB is something I'd like to avoid.  I'd rather move the data to Hadoop and manage my full-text indexing there.  (I don't want to stress the DB with the indexing).
> >>
> >> Thanks,
> >> Jon
> >>
> >
>