Updated: (HADOOP-492) Global counters

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view

Updated: (HADOOP-492) Global counters

Hudson (Jira)

     [ https://issues.apache.org/jira/browse/HADOOP-492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David Bowen updated HADOOP-492:

    Attachment: counters2.patch

Here is the updated patch with the fixes: (1) Reporter is an interface again, (2) Statistics is renamed Counters, and (3) a commented-out method has been removed.

> Global counters
> ---------------
>                 Key: HADOOP-492
>                 URL: https://issues.apache.org/jira/browse/HADOOP-492
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: arkady borkovsky
>         Assigned To: David Bowen
>         Attachments: counters1.patch, counters2.patch
> It would be nice to have map / reduce job keep aggregated counts for arbitrary events occuring in its tasks -- the numer of records processed, the numer of exceptions of a specific type, the number of sentences in passive voice, whatever the jobs finds useful.
> This can be implemented by tasks periodically sending <name, value> pairs to the jobtracker (in some implementations such messages are piggy-backed on the heartbeats), so that the job tracker stores all the latests values from each task and aggregates them on a request.  It should also make the aggregated values available at the job end.  The value for a task would be flushed when the task fails.
> #491 and #490 may be related to this one.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.