[jira] [Created] (NUTCH-2751) nutch clean does not work with secured solr cloud

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (NUTCH-2751) nutch clean does not work with secured solr cloud

Tim Allison (Jira)
Daniel Hammling created NUTCH-2751:
--------------------------------------

             Summary: nutch clean does not work with secured solr cloud
                 Key: NUTCH-2751
                 URL: https://issues.apache.org/jira/browse/NUTCH-2751
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.16
            Reporter: Daniel Hammling


I am calling nutch clean to remove 404 entries from Solr, but fail with exception below.

Adding and updating entries is working fine. Hence, index-writer config seems to be correct in general.

Identical behaviour in 1.15 and 1.16, although SolrIndexWriter.java has been modified for delete case.

No more ideas, where to look at....

 

2019-11-01 14:45:55,664 INFO solr.SolrIndexWriter - SolrIndexer: deleting 14/14 documents
2019-11-01 14:45:55,768 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 0
2019-11-01 14:45:55,780 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 1
2019-11-01 14:45:55,858 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 2
2019-11-01 14:45:55,887 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 3
2019-11-01 14:45:55,903 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 4
2019-11-01 14:45:55,938 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
, retry? 5
2019-11-01 14:45:55,938 DEBUG concurrent.ExecutorHelper - afterExecute in thread: pool-4-thread-1, runnable type: java.util.concurrent.FutureTask
2019-11-01 14:45:55,940 INFO mapred.LocalJobRunner - reduce task executor complete.
2019-11-01 14:45:55,941 WARN mapred.LocalJobRunner - job_local2086525572_0001
java.lang.Exception: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
 at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:491)
 at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:558)
Caused by: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
 at org.apache.solr.client.solrj.impl.CloudSolrClient.directUpdate(CloudSolrClient.java:553)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.sendRequest(CloudSolrClient.java:1014)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:885)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.request(CloudSolrClient.java:818)
 at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219)
 at org.apache.nutch.indexwriter.solr.SolrIndexWriter.push(SolrIndexWriter.java:270)
 at org.apache.nutch.indexwriter.solr.SolrIndexWriter.commit(SolrIndexWriter.java:214)
 at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:205)
 at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:257)
 at org.apache.nutch.indexer.CleaningJob$DeleterReducer.cleanup(CleaningJob.java:115)
 at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:179)
 at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:627)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
 at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:346)
 at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
 at java.lang.Thread.run(Thread.java:748)
Caused by: org.apache.solr.client.solrj.SolrServerException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
 at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:657)
 at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:255)
 at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:244)
 at org.apache.solr.client.solrj.impl.LBHttpSolrClient.doRequest(LBHttpSolrClient.java:483)
 at org.apache.solr.client.solrj.impl.LBHttpSolrClient.request(LBHttpSolrClient.java:413)
 at org.apache.solr.client.solrj.impl.CloudSolrClient.lambda$directUpdate$0(CloudSolrClient.java:528)
 at java.util.concurrent.FutureTask.run(FutureTask.java:266)
 at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188)
 ... 3 more
Caused by: org.apache.http.client.ClientProtocolException
 at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:187)
 at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:83)
 at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:56)
 at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:542)
 ... 10 more
Caused by: org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
 at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:226)
 at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:185)
 at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
 at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:111)
 at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
 ... 13 more



--
This message was sent by Atlassian Jira
(v8.3.4#803005)