[jira] Created: (NUTCH-176) Using -dir: creates an error, when the directory already exists

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-176) Using -dir: creates an error, when the directory already exists

JIRA jira@apache.org
Using -dir: creates an error, when the directory already exists
---------------------------------------------------------------

         Key: NUTCH-176
         URL: http://issues.apache.org/jira/browse/NUTCH-176
     Project: Nutch
        Type: Bug
    Versions: 0.7.1    
 Environment: SUSE Linux 9.3
    Reporter: Matthias Günter
    Priority: Minor


In my opinion -dir should work even, when the directory already exists.

The error message is:
guenter@deimos:~/workspace/lucene/nutch-0.7.1/bin> sh ./nutch crawl ../../urllist.txt  -dir tmpdir
060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/nutch-default.xml
060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/crawl-tool.xml
060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/nutch-site.xml
060115 140500 No FS indicated, using default:local
Exception in thread "main" java.lang.RuntimeException: tmpdir already exists.
        at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:121)


--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (NUTCH-176) Using -dir: creates an error, when the directory already exists

JIRA jira@apache.org
     [ http://issues.apache.org/jira/browse/NUTCH-176?page=all ]
     
Doug Cutting resolved NUTCH-176:
--------------------------------

    Resolution: Won't Fix

This check is intentionally made to prevent folks from accidentally overwriting crawls.

> Using -dir: creates an error, when the directory already exists
> ---------------------------------------------------------------
>
>          Key: NUTCH-176
>          URL: http://issues.apache.org/jira/browse/NUTCH-176
>      Project: Nutch
>         Type: Bug
>     Versions: 0.7.1
>  Environment: SUSE Linux 9.3
>     Reporter: Matthias Günter
>     Priority: Minor

>
> In my opinion -dir should work even, when the directory already exists.
> The error message is:
> guenter@deimos:~/workspace/lucene/nutch-0.7.1/bin> sh ./nutch crawl ../../urllist.txt  -dir tmpdir
> 060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/nutch-default.xml
> 060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/crawl-tool.xml
> 060115 140500 parsing file:/home/guenter/workspace/lucene/nutch-0.7.1/conf/nutch-site.xml
> 060115 140500 No FS indicated, using default:local
> Exception in thread "main" java.lang.RuntimeException: tmpdir already exists.
>         at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:121)

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira