[jira] [Created] (LUCENE-3444) Distinct field value count per group

classic Classic list List threaded Threaded
8 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
Distinct field value count per group
------------------------------------

                 Key: LUCENE-3444
                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
             Project: Lucene - Java
          Issue Type: New Feature
          Components: modules/grouping
            Reporter: Martijn van Groningen


Support a second pass collector that counts unique field values of a field per group.
This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen updated LUCENE-3444:
------------------------------------------

    Attachment: LUCENE-3444.patch

Attached initial version of a second pass collector that count the unique field values per group for a specific field.

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>         Attachments: LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Issue Comment Edited] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13109540#comment-13109540 ]

Martijn van Groningen edited comment on LUCENE-3444 at 9/21/11 2:45 PM:
------------------------------------------------------------------------

Attached initial version of a second pass collector that counts the unique field values per group for a specific field.

      was (Author: martijn.v.groningen):
    Attached initial version of a second pass collector that count the unique field values per group for a specific field.
 

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>         Attachments: LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen updated LUCENE-3444:
------------------------------------------

    Attachment: LUCENE-3444.patch

Updated patch. I've split the DistinctCountCollector into abstract base class and a term based implementation. This allows other implementations such as IDV and function based implementations.
               

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>         Attachments: LUCENE-3444.patch, LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen updated LUCENE-3444:
------------------------------------------

    Attachment: LUCENE-3444.patch

Updated the patch and added a docvalues based implementation.

Things to do:
* Add implementation that uses MutableValue.
* Add random tests.
               

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>         Attachments: LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen updated LUCENE-3444:
------------------------------------------

    Fix Version/s: 4.0
   

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>             Fix For: 4.0
>
>         Attachments: LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen updated LUCENE-3444:
------------------------------------------

    Attachment: LUCENE-3444.patch

Added new patch.
* Updated patch to current trunk.
* Added random test. Fails now in some cases.
* Added function (mutable value) based implementation.

It is almost ready to be committed!
               

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>             Fix For: 4.0
>
>         Attachments: LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Resolved] (LUCENE-3444) Distinct field value count per group

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-3444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Martijn van Groningen resolved LUCENE-3444.
-------------------------------------------

       Resolution: Fixed
    Lucene Fields:   (was: New)

Committed to trunk.
               

> Distinct field value count per group
> ------------------------------------
>
>                 Key: LUCENE-3444
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3444
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/grouping
>            Reporter: Martijn van Groningen
>             Fix For: 4.0
>
>         Attachments: LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch, LUCENE-3444.patch
>
>
> Support a second pass collector that counts unique field values of a field per group.
> This is just one example of group statistics that one might want.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]