[jira] [Updated] (NUTCH-2751) nutch clean does not work with secured solr cloud

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (NUTCH-2751) nutch clean does not work with secured solr cloud

Tim Allison (Jira)

     [ https://issues.apache.org/jira/browse/NUTCH-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel updated NUTCH-2751:
-----------------------------------
    Fix Version/s: 1.17

> nutch clean does not work with secured solr cloud
> -------------------------------------------------
>
>                 Key: NUTCH-2751
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2751
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.16
>            Reporter: Daniel Hammling
>            Priority: Critical
>             Fix For: 1.17
>
>
> I am calling nutch clean to remove 404 entries from Solr, but fail with exception below.
> Adding and updating entries is working fine. Hence, index-writer config seems to be correct in general.
> Identical behaviour in 1.15 and 1.16, although SolrIndexWriter.java has been modified for delete case.
> No more ideas, where to look at....
>  
> 2019-11-01 14:45:55,664 INFO solr.SolrIndexWriter - SolrIndexer: deleting 14/14 documents
> 2019-11-01 14:45:55,768 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 0
> 2019-11-01 14:45:55,780 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 1
> 2019-11-01 14:45:55,858 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 2
> 2019-11-01 14:45:55,887 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 3
> 2019-11-01 14:45:55,903 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 4
> 2019-11-01 14:45:55,938 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 5
> 2019-11-01 14:45:55,938 DEBUG concurrent.ExecutorHelper - afterExecute in thread: pool-4-thread-1, runnable type: java.util.concurrent.FutureTask
> 2019-11-01 14:45:55,940 INFO mapred.LocalJobRunner - reduce task executor complete.
> 2019-11-01 14:45:55,941 WARN mapred.LocalJobRunner - job_local2086525572_0001
> java.lang.Exception: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
>  at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:491)
>  at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:558)
> Caused by: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.directUpdate(CloudSolrClient.java:553)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.sendRequest(CloudSolrClient.java:1014)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:885)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.request(CloudSolrClient.java:818)
>  at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219)
>  at org.apache.nutch.indexwriter.solr.SolrIndexWriter.push(SolrIndexWriter.java:270)
>  at org.apache.nutch.indexwriter.solr.SolrIndexWriter.commit(SolrIndexWriter.java:214)
>  at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:205)
>  at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:257)
>  at org.apache.nutch.indexer.CleaningJob$DeleterReducer.cleanup(CleaningJob.java:115)
>  at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:179)
>  at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:627)
>  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
>  at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:346)
>  at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>  at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.solr.client.solrj.SolrServerException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
>  at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:657)
>  at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:255)
>  at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:244)
>  at org.apache.solr.client.solrj.impl.LBHttpSolrClient.doRequest(LBHttpSolrClient.java:483)
>  at org.apache.solr.client.solrj.impl.LBHttpSolrClient.request(LBHttpSolrClient.java:413)
>  at org.apache.solr.client.solrj.impl.CloudSolrClient.lambda$directUpdate$0(CloudSolrClient.java:528)
>  at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>  at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188)
>  ... 3 more
> Caused by: org.apache.http.client.ClientProtocolException
>  at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:187)
>  at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:83)
>  at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:56)
>  at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:542)
>  ... 10 more
> Caused by: org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
>  at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:226)
>  at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:185)
>  at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
>  at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:111)
>  at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
>  ... 13 more



--
This message was sent by Atlassian Jira
(v8.3.4#803005)