[jira] [Commented] (NUTCH-2691) Improve logging from scoring-depth plugin

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (NUTCH-2691) Improve logging from scoring-depth plugin

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16748867#comment-16748867 ]

ASF GitHub Bot commented on NUTCH-2691:
---------------------------------------

YossiTamari commented on pull request #434: NUTCH-2691: Improve logging from scoring-depth plugin
URL: https://github.com/apache/nutch/pull/434
 
 
   Exit distributeScoreToOutlinks immediately if there are no outlinks. This is a very small performance improvement, but more importantly it prevents the plugin from emitting a "Missing depth, removing all outlinks from url" warn message for every page that failed parsing.
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


> Improve logging from scoring-depth plugin
> -----------------------------------------
>
>                 Key: NUTCH-2691
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2691
>             Project: Nutch
>          Issue Type: Improvement
>          Components: scoring
>    Affects Versions: 1.15
>            Reporter: Yossi Tamari
>            Priority: Minor
>             Fix For: 1.16
>
>
> Currently the scoring-depth plugin emits a "Missing depth, removing all outlinks from url" log message for every page that failed parsing (and does not have outlinks anyway).
> Will provide a patch that exits immediately when there is no outlinks.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)