[jira] Created: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

JIRA jira@apache.org
HttpClient plugin does not work with BasicAuthentication
--------------------------------------------------------

                 Key: NUTCH-539
                 URL: https://issues.apache.org/jira/browse/NUTCH-539
             Project: Nutch
          Issue Type: Bug
          Components: fetcher
    Affects Versions: 0.8
            Reporter: Ravi Chintakunta
            Priority: Minor


For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials.

For this to work:

1. Add the username and password credentials to nutch-site.xml as below:

<property>
  <name>http.auth.basic.username</name>
  <value>myusername</value>
  <description>
        username for http basic auth
  </description>
</property>

<property>
  <name>http.auth.basic.password</name>
  <value>mypassword</value>
  <description>
        password for http basic auth
  </description>
</property>

2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java




--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ravi Chintakunta updated NUTCH-539:
-----------------------------------

    Attachment: httpclient_http.patch

Patch for nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java to fix this issue

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials.
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530165 ]

Alexis Votta commented on NUTCH-539:
------------------------------------

This patch doesn't work for me. I don't get any useful debug logs too.

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials.
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Issue Comment Edited: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530175 ]

susam edited comment on NUTCH-539 at 9/25/07 10:54 AM:
-----------------------------------------------------------

1. There is a bug in the patch. The domain name is hardcoded in the AuthScope constructor. It should be the domain name extracted from the URL being fetched.
2. This patch removes NTLM authentication scheme support from protocol-httpclient. Both NTLM and digest authentication schemes should be supported.

These two issues have been fixed in NUTCH-559.

      was (Author: susam):
    1. There is a bug in the patch. The domain name is hardcoded in the AuthScope constructor. It should be the domain name extracted from the URL being fetched.
2. This patch removes NTLM authentication scheme support from protocol-httpclient. Both NTLM and digest authentication schemes should be supported.

The following two issues have been fixed in NUTCH-559.
 

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Priority: Minor
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials.
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Closed: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doğacan Güney closed NUTCH-539.
-------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0.0
         Assignee: Doğacan Güney

Fixed as part of NUTCH-559.

> HttpClient plugin does not work with BasicAuthentication
> --------------------------------------------------------
>
>                 Key: NUTCH-539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-539
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.8
>            Reporter: Ravi Chintakunta
>            Assignee: Doğacan Güney
>            Priority: Minor
>             Fix For: 1.0.0
>
>         Attachments: httpclient_http.patch
>
>
> For Nutch to fetch pages with basic authentication, the HttpClient should be configured with the username and password credentials.
> For this to work:
> 1. Add the username and password credentials to nutch-site.xml as below:
> <property>
>   <name>http.auth.basic.username</name>
>   <value>myusername</value>
>   <description>
> username for http basic auth
>   </description>
> </property>
> <property>
>   <name>http.auth.basic.password</name>
>   <value>mypassword</value>
>   <description>
> password for http basic auth
>   </description>
> </property>
> 2. Configure httpclient with these credentials by applying the attached patch to nutch/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.