[jira] [Created] (NUTCH-2510) Crawl script modification. HostDb : generate, optional usage and descirption

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (NUTCH-2510) Crawl script modification. HostDb : generate, optional usage and descirption

JIRA jira@apache.org
Semyon Semyonov created NUTCH-2510:
--------------------------------------

             Summary: Crawl script modification. HostDb : generate, optional usage and descirption
                 Key: NUTCH-2510
                 URL: https://issues.apache.org/jira/browse/NUTCH-2510
             Project: Nutch
          Issue Type: Improvement
          Components: bin
    Affects Versions: 1.15
            Reporter: Semyon Semyonov
             Fix For: 1.14


Script crawl now includes hostdb update as a part of crawling cycle, but :
1) There is no hostdb parameter for generate

2) Generation of hostdb is not optional, therefore hostdb is generated each step without asking of user. It should be an optional parameter.

3) Description of 1 and 2.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)