How does remote streaming works for xml files?

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

How does remote streaming works for xml files?

Leonardo Santagada
With most of the default solrconfig.xml and setting:

   <requestDispatcher handleSelect="true" >
     <!--Make sure your system has some authentication before enabling  
remote streaming!  -->
     <requestParsers enableRemoteStreaming="true"  
multipartUploadLimitInKB="2048" />
   </requestDispatcher>

I think I should be able to ask solr to index a file on the  
filesystem, but how does it work? I tried GET on the update url with  
stream.file parameter and it didn't seem to work. My file is in the  
said place and it has one document, so I don't know what I am doing  
wrong.

Here is my error:

Jan 31, 2008 9:39:01 PM org.apache.solr.core.SolrException log
SEVERE: org.apache.solr.core.SolrException: missing content stream
        at  
org
.apache
.solr
.handler
.XmlUpdateRequestHandler
.handleRequestBody(XmlUpdateRequestHandler.java:75)
        at  
org
.apache
.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:
77)
        at org.apache.solr.core.SolrCore.execute(SolrCore.java:658)
        at  
org
.apache
.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:191)
        at  
org
.apache
.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:159)
        at org.mortbay.jetty.servlet.ServletHandler
$CachedChain.doFilter(ServletHandler.java:1089)
        at  
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:365)
        at  
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:
216)
        at  
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
        at  
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:712)
        at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:
405)
        at  
org
.mortbay
.jetty
.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:
211)
        at  
org
.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:
114)
        at  
org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:139)
        at org.mortbay.jetty.Server.handle(Server.java:285)
        at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:
502)
        at org.mortbay.jetty.HttpConnection
$RequestHandler.headerComplete(HttpConnection.java:821)
        at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:513)
        at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:208)
        at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:378)
        at org.mortbay.jetty.bio.SocketConnector
$Connection.run(SocketConnector.java:226)
        at org.mortbay.thread.BoundedThreadPool
$PoolThread.run(BoundedThreadPool.java:442)

Jan 31, 2008 9:39:01 PM org.apache.solr.core.SolrCore execute
INFO: /update stream.filename=/tmp/
commited_1201822625MainThread0_add_file.xml 0 0


thanks in advance,
--
Leonardo Santagada



Reply | Threaded
Open this post in threaded view
|

Re: How does remote streaming works for xml files?

Ryan McKinley
>
> Jan 31, 2008 9:39:01 PM org.apache.solr.core.SolrCore execute
> INFO: /update
> stream.filename=/tmp/commited_1201822625MainThread0_add_file.xml 0 0
>

isn't "stream.file" the parameter name?

ryan

Reply | Threaded
Open this post in threaded view
|

Re: How does remote streaming works for xml files?

Leonardo Santagada

On 31/01/2008, at 22:56, Ryan McKinley wrote:

>> Jan 31, 2008 9:39:01 PM org.apache.solr.core.SolrCore execute
>> INFO: /update stream.filename=/tmp/
>> commited_1201822625MainThread0_add_file.xml 0 0
>
> isn't "stream.file" the parameter name?
>
> ryan
>


Thanks, that was stupid... But there is so little information about  
this. Is the result to the get assynchronous or does it process the  
file before returning data?

--
Leonardo Santagada



Reply | Threaded
Open this post in threaded view
|

Re: How does remote streaming works for xml files?

Ryan McKinley
Leonardo Santagada wrote:

>
> On 31/01/2008, at 22:56, Ryan McKinley wrote:
>
>>> Jan 31, 2008 9:39:01 PM org.apache.solr.core.SolrCore execute
>>> INFO: /update
>>> stream.filename=/tmp/commited_1201822625MainThread0_add_file.xml 0 0
>>
>> isn't "stream.file" the parameter name?
>>
>> ryan
>>
>
>
> Thanks, that was stupid... But there is so little information about
> this. Is the result to the get assynchronous or does it process the file
> before returning data?
>

the stream.file (or stream.body) is just a way to replace what content
is used for the input.  it does not affect how the content is processed.
  The content is processed then data is returned.

ryan