[jira] [Commented] (NUTCH-2447) Work-around SSLProtocolException: handshake alert: unrecognized_name

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (NUTCH-2447) Work-around SSLProtocolException: handshake alert: unrecognized_name

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16351909#comment-16351909 ]

Sebastian Nagel commented on NUTCH-2447:
----------------------------------------

Hi [~markus17], problem and solution confirmed. Two questions:
 * both patch files are identical, according to your comment the second/newer one should point to this issue, right?
 * why are errors during the reconnect ignored? (try-catch inside the catch block) The connection cannot be used, if the reconnect fails as well?

> Work-around SSLProtocolException: handshake alert: unrecognized_name
> --------------------------------------------------------------------
>
>                 Key: NUTCH-2447
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2447
>             Project: Nutch
>          Issue Type: Bug
>          Components: protocol
>    Affects Versions: 1.13
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Critical
>             Fix For: 1.15
>
>         Attachments: NUTCH-2447.patch, NUTCH-2447.patch
>
>
> Nutch is unable to crawl some websites, regardless of protocol plugin you are using. The work-around you frequently find (-Djsse.enableSNIExtension=false) does not work at all, so the internet is clearly lying to us!
> {code}
> 2017-10-23 12:43:52,911 INFO  api.HttpRobotRulesParser - Couldn't get robots.txt for https://www.eidsiva.net/: javax.net.ssl.SSLProtocolException: handshake alert:  unrecognized_name
> 2017-10-23 12:43:53,011 ERROR http.Http - Failed to get protocol output
> javax.net.ssl.SSLProtocolException: handshake alert:  unrecognized_name
>         at sun.security.ssl.ClientHandshaker.handshakeAlert(ClientHandshaker.java:1446)
>         at sun.security.ssl.SSLSocketImpl.recvAlert(SSLSocketImpl.java:2016)
>         at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:1125)
>         at sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1375)
>         at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1403)
>         at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1387)
>         at org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:152)
>         at org.apache.nutch.protocol.http.Http.getResponse(Http.java:72)
>         at org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:271)
>         at org.apache.nutch.fetcher.FetcherThread.run(FetcherThread.java:327)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)