[jira] Commented: (SOLR-153) Facet Index

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-153) Facet Index

Hudson (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12921460#action_12921460 ]

Yonik Seeley commented on SOLR-153:
-----------------------------------

I think this facet algorithm could do well when both the number of unique terms are high, and the number of values per document is high.  That's really the only case where our existing algorithms fall down.

There's more info about how this should work, starting here:
http://search.lucidimagination.com/search/document/6ccbec5e602687ae/facet_optimizing
And then the comments in the code of course.

bq. How much work would it to integrate your work into facets? E.g. to get an idea on real data?

Not sure... it's been a long time, and I was brainstorming in code - I never tried running it, so I guarantee there are tons of bugs.  Cool stuff though - wish I had time to work on it again.

> Facet Index
> -----------
>
>                 Key: SOLR-153
>                 URL: https://issues.apache.org/jira/browse/SOLR-153
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Yonik Seeley
>         Attachments: facettree.patch, facettree.patch
>
>
> A facet index, initially for non-hierarchical facets.
> Start with all terms, and a set of documents for each term.  Group lower level nodes by taking the union of the sets, but keep track of the largest set going back all the way to the leaves (the max doc-freq for that node).

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]