[jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16361054#comment-16361054 ]

ASF GitHub Bot commented on NUTCH-2481:
---------------------------------------

okedoki commented on issue #278: NUTCH-2481
URL: https://github.com/apache/nutch/pull/278#issuecomment-364985404
 
 
   @YossiTamari
   Refactored according to your suggestion. It is quite bad that we have a utility for it and it wasnt used.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


> HostDatum deltas(previous step statistics) and Metadata expressions
> -------------------------------------------------------------------
>
>                 Key: NUTCH-2481
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2481
>             Project: Nutch
>          Issue Type: Improvement
>          Components: hostdb
>            Reporter: Semyon Semyonov
>            Priority: Minor
>
> To allow the usage of previous step statistics(deltas of fetched,unfetced etc) in hostdb. The motivation is usage of this statistics in generate with maxCount expressions.
>  
> The solution allows to fill in metadata of hostdatum based on custom JEXL expression using two hostdatum: before update(previousHostDatum) and after update(currentHostDatum)..
> For example to fill in difference in quantity of fetched at round t and t-1 we can use the following expression
> <property>
>  <name>hostdb.deltaExpression</name>
>  <value>\{return new ("javafx.util.Pair","FetchedDelta", currentHostDatum.fetched - previousHostDatum.fetched);}</value>
> </property>
> A pull request will be provided shortly.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)