[jira] [Created] (NUTCH-2619) protocol-okhttp: allow to keep partially fetched docs as truncated

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (NUTCH-2619) protocol-okhttp: allow to keep partially fetched docs as truncated

JIRA jira@apache.org
Sebastian Nagel created NUTCH-2619:
--------------------------------------

             Summary: protocol-okhttp: allow to keep partially fetched docs as truncated
                 Key: NUTCH-2619
                 URL: https://issues.apache.org/jira/browse/NUTCH-2619
             Project: Nutch
          Issue Type: Improvement
          Components: protocol
    Affects Versions: 1.15
            Reporter: Sebastian Nagel
             Fix For: 1.16


Sometimes fetching a larger document times out after some content has already been downloaded. For some use cases it may be better to save this partially fetched document and mark it as truncated, instead of retrying the fetch later (may fail for the same reason again).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)