[jira] Created: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

classic Classic list List threaded Threaded
24 messages Options
12
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
Allow facet.field=* to facet on all fields (without knowing what they are)
--------------------------------------------------------------------------

                 Key: SOLR-247
                 URL: https://issues.apache.org/jira/browse/SOLR-247
             Project: Solr
          Issue Type: Improvement
            Reporter: Ryan McKinley
            Priority: Minor


I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.

This came out of trying to use faceting for the LukeRequestHandler top term collecting.
http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)

     [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ryan McKinley updated SOLR-247:
-------------------------------

    Attachment: SOLR-247-FacetAllFields.patch

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498311 ]

Erik Hatcher commented on SOLR-247:
-----------------------------------

I can see value in supporting the dynamicField wildcard syntax, so *_facet would work.   In fact, maybe that'd be a good syntax to support for all fl-like parameters too.

* scares me, and it'd certainly be discouraged for anything but small indexes!   But of course I don't have to use it.   :)

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498336 ]

Hoss Man commented on SOLR-247:
-------------------------------

I have a really hard time imagining anything but the most trivial use cases for facet.field=* ... it doesn't really sime like a problem in need of a solution.

with somehting like fl=*, we're only talking about stored fields ... storing a field makes no sense unless you plan on returning it in the field list some of the time, so fl=* makes sense as a "return all of hte fields that are possible to return" option.

There are *lots* of reasons why a field might be indexed though, so faceting on every indexed field doesn't seem like it would ever make sense.

in my opinion a "best practice" is not to use fl=* unless you are debugging anyway, otherwise you find yourself getting slammed with large amounts of data you don't want as the index evolves over time ... something like facet.field=* would be worse because it's not just the amount of data getting returned that would increase, but the amount of computation (and time and poor cache performance) that would spike as well.

if we do this, i would think it only makes sense to generalize the use of "*" in both fl and facet.field into a true glob style syntax, so we can at least encourage people who want this type of syntax to use a naming convention to help limit how much they hurt themselves.

(i have no problem giving people enough rope to hang themselves, but we shouldn't tie a noose in the rope before we give it to them)

 

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498338 ]

Ryan McKinley commented on SOLR-247:
------------------------------------

>
> There are *lots* of reasons why a field might be indexed though, so faceting on every indexed field doesn't seem like it would ever make sense.
>

agreed, but *_facet would be useful

>
> if we do this, i would think it only makes sense to generalize the use of "*" in both fl and facet.field into a true glob style syntax

One issue is that fl=XXX is typically a field list separated with "," or "|", facet.field expects each field as a separate parameter.




> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

Re: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Chris Hostetter-3

: One issue is that fl=XXX is typically a field list separated with "," or
: "|", facet.field expects each field as a separate parameter.

personally, i've never really lke that splitting behavior of fl, i'd
really rather not add it to facet.field.


-Hoss

Reply | Threaded
Open this post in threaded view
|

RE: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Will Johnson
In reply to this post by Michael Gibney (Jira)
What about adding an optional parameter to the field definition in the
IndexSchema for defaultFacet="true/false".  This would make solr's
functionality/configuration similar with many of the major search engine
vendors and keep people from having to follow naming conventions for
fields.  Then facet.field=* just turns on those fields with
defaultFacet="true" but still lets you facet on others if you deem
necessary.  If there were a list of default facet fields it might also
let the index warming process pre-cache the results of those filter
queries which would be a nice side benefit.

The *_facet thing scares me because I'm afraid I'll eventually be
'forced' to have field names like:

myfield_facet_vector_stem_morelikethis_highlight.

- will

-----Original Message-----
From: Ryan McKinley (JIRA) [mailto:[hidden email]]
Sent: Wednesday, May 23, 2007 3:38 PM
To: [hidden email]
Subject: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on
all fields (without knowing what they are)


    [
https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.p
lugin.system.issuetabpanels:comment-tabpanel#action_12498338 ]

Ryan McKinley commented on SOLR-247:
------------------------------------

>
> There are *lots* of reasons why a field might be indexed though, so
faceting on every indexed field doesn't seem like it would ever make
sense.
>

agreed, but *_facet would be useful

>
> if we do this, i would think it only makes sense to generalize the use
of "*" in both fl and facet.field into a true glob style syntax

One issue is that fl=XXX is typically a field list separated with "," or
"|", facet.field expects each field as a separate parameter.




> Allow facet.field=* to facet on all fields (without knowing what they
are)
>
------------------------------------------------------------------------
--

>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a
bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top
term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

RE: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Chris Hostetter-3

: What about adding an optional parameter to the field definition in the
: IndexSchema for defaultFacet="true/false".  This would make solr's

information should go in schema.xml if the are inherient to the data
and the physical index.  Things should go in the solrconfig.xml if they
relate to how the index is used -- a master might have a differnet
solrconfig then a slave because it doesn't get used for queries, while two
diffenret slaves might have differnet solrconfigs because they get used by
different sets of clients and need differnet cache configs or request
handler configs -- but all three would use the same schema.xml because the
physical index is the same in all cases.

a mechanism already exists to say "by default, i want clients to get
facets on certian fields" in teh solrconfig.xml, it's just a default param
for hte requestHandler ...

  <requestHandler name="standard" class="solr.StandardRequestHandler">
    <!-- default values for query parameters -->
     <lst name="defaults">
       <str name="facet.field">category</str>
       <str name="facet.field">author</str>
       <str name="facet.field">type</str>
       ...

...then the params are defaulted for everyone, and the only thingthe user
needs in the URL is "facet=true" ... or that can be defaulted as well.


-Hoss

Reply | Threaded
Open this post in threaded view
|

RE: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Will Johnson
Good point, I was proposing it as an alternative to myfield_facet since
that seems to overload the field name a bit too much.  I agree that
solrconfig + specialized request handlers are a much better location for
that kind of stuff.  

Also, the reason other engines require you to mark the fields in the
index definition is because they actually index the data differently if
it is a facet vs a normal indexed field.  It's cool that solr doesn't
have to do this but there may be a case where it would be a good idea
someday.

- will

-----Original Message-----
From: Chris Hostetter [mailto:[hidden email]]
Sent: Wednesday, May 23, 2007 6:34 PM
To: Solr Dev
Subject: RE: [jira] Commented: (SOLR-247) Allow facet.field=* to facet
on all fields (without knowing what they are)


: What about adding an optional parameter to the field definition in the
: IndexSchema for defaultFacet="true/false".  This would make solr's

information should go in schema.xml if the are inherient to the data
and the physical index.  Things should go in the solrconfig.xml if they
relate to how the index is used -- a master might have a differnet
solrconfig then a slave because it doesn't get used for queries, while
two
diffenret slaves might have differnet solrconfigs because they get used
by
different sets of clients and need differnet cache configs or request
handler configs -- but all three would use the same schema.xml because
the
physical index is the same in all cases.

a mechanism already exists to say "by default, i want clients to get
facets on certian fields" in teh solrconfig.xml, it's just a default
param
for hte requestHandler ...

  <requestHandler name="standard" class="solr.StandardRequestHandler">
    <!-- default values for query parameters -->
     <lst name="defaults">
       <str name="facet.field">category</str>
       <str name="facet.field">author</str>
       <str name="facet.field">type</str>
       ...

...then the params are defaulted for everyone, and the only thingthe
user
needs in the URL is "facet=true" ... or that can be defaulted as well.


-Hoss

Reply | Threaded
Open this post in threaded view
|

RE: [jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Chris Hostetter-3

: Also, the reason other engines require you to mark the fields in the
: index definition is because they actually index the data differently if
: it is a facet vs a normal indexed field.  It's cool that solr doesn't
: have to do this but there may be a case where it would be a good idea
: someday.

right ... if down the road we find a way to imrpvoe faceting (or any other
feature) by storing more data on disk at indexing time, then configuration
to tell you that data was there and how to use it would live in the
schema.xml -- but options that don't matter once the data is already
written (or can be differet for differnet people depending on how they
use the data) can/should live in solrconfig.xml (like the options in
mainIndex right now)

Alternately: if kwe add some custom "facet caching" that doens't require
any new data on disk, but builds new in memory structures, that should
live in the solrconfig.xml as well since it's the kind of thing that would
likely be configured idfferently for masters/slaves.


-Hoss

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498761 ]

Hoss Man commented on SOLR-247:
-------------------------------

see some follow up comments in the mailing lists...

http://www.nabble.com/forum/Search.jtp?forum=14479&local=y&query=SOLR-247

in a nut shell, i think this issue *can* be resolved won't fix ... but i'm not opposed to leaving open if someone wants to work on it.  there are ways for people to configure solr so that all the fields they want to facet on are faceted on by defualt (when configuring the requestHanlder) which is safer then wild carding.

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12522345 ]

Pieter Berkel commented on SOLR-247:
------------------------------------

Some recent discussion on this topic:

http://www.nabble.com/Structured-Lucene-documents-tf4234661.html

I get the impression that general wildcard syntax support for field listing parameters (i.e. the reverse of dynamic fields) as described in the above thread would be far more useful than a simple '*' match-anything syntax (not only in faceting but other cases like hl.fl and perhaps even mlt.fl).

I haven't really considered the performance issues of this approach however, as it would involve checking each field supplied in the parameter for '*' before expanding it into full field names for every query.

Given the above, the fact that it could be used across multiple response handlers and subhandlers like SimpleFacets & Highlighting, and that it would require access to IndexReader to getFieldNames(), where might be the most sensible place to put this code?


> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Issue Comment Edited: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12557719#action_12557719 ]

mruno edited comment on SOLR-247 at 1/10/08 9:45 AM:
------------------------------------------------------------

http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a '*', I'd love to be able to do a 'attribute_*'. It just makes using the dynamic fields so much easier.

      was (Author: mruno):
    http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a "*", I'd love to be able to do a "attribute_*". It just makes using the dynamic fields so much easier.
 

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12557719#action_12557719 ]

Matthew Runo commented on SOLR-247:
-----------------------------------

http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a "*", I'd love to be able to do a "attribute_*". It just makes using the dynamic fields so much easier.

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Issue Comment Edited: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12557719#action_12557719 ]

mruno edited comment on SOLR-247 at 1/10/08 9:46 AM:
------------------------------------------------------------

http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a facet on *, I'd love to be able to do one on attribute_*. It just makes using the dynamic fields so much easier.

      was (Author: mruno):
    http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a '*', I'd love to be able to do a 'attribute_*'. It just makes using the dynamic fields so much easier.
 

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Issue Comment Edited: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12557719#action_12557719 ]

mruno edited comment on SOLR-247 at 1/10/08 9:46 AM:
------------------------------------------------------------

http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a facet on the wildcard, I'd love to be able to do one on attribute_<wildcard>. It just makes using the dynamic fields so much easier.

      was (Author: mruno):
    http://www.nabble.com/Dynamic-fields---Facets-to14739422.html

also provides a use case for this to be fixed. While I'd never do a facet on *, I'd love to be able to do one on attribute_*. It just makes using the dynamic fields so much easier.
 

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12557884#action_12557884 ]

Hoss Man commented on SOLR-247:
-------------------------------

i've put soem thoughts on the broader issues of having solr admin control over how field names are dealt with (globs, regexes, aliasing, etc...)  in various contexts on the wiki...

http://wiki.apache.org/solr/FieldAliasesAndGlobsInParams

...it might be best to use that as a whiteboard for a design discussion since the ultimate issues are a little bigger then this issue originally set out to tackle.

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Issue Comment Edited: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12498336#action_12498336 ]

hossman edited comment on SOLR-247 at 4/15/08 10:34 AM:
---------------------------------------------------------

I have a really hard time imagining anything but the most trivial use cases for facet.field=* ... it doesn't really sime like a problem in need of a solution.

with somehting like {{fl=\*}}, we're only talking about stored fields ... storing a field makes no sense unless you plan on returning it in the field list some of the time, so {{fl=\*}} makes sense as a "return all of hte fields that are possible to return" option.

There are *lots* of reasons why a field might be indexed though, so faceting on every indexed field doesn't seem like it would ever make sense.

in my opinion a "best practice" is not to use fl=* unless you are debugging anyway, otherwise you find yourself getting slammed with large amounts of data you don't want as the index evolves over time ... something like facet.field=* would be worse because it's not just the amount of data getting returned that would increase, but the amount of computation (and time and poor cache performance) that would spike as well.

if we do this, i would think it only makes sense to generalize the use of "*" in both fl and facet.field into a true glob style syntax, so we can at least encourage people who want this type of syntax to use a naming convention to help limit how much they hurt themselves.

(i have no problem giving people enough rope to hang themselves, but we shouldn't tie a noose in the rope before we give it to them)

 

      was (Author: hossman):
    I have a really hard time imagining anything but the most trivial use cases for facet.field=* ... it doesn't really sime like a problem in need of a solution.

with somehting like fl=*, we're only talking about stored fields ... storing a field makes no sense unless you plan on returning it in the field list some of the time, so fl=* makes sense as a "return all of hte fields that are possible to return" option.

There are *lots* of reasons why a field might be indexed though, so faceting on every indexed field doesn't seem like it would ever make sense.

in my opinion a "best practice" is not to use fl=* unless you are debugging anyway, otherwise you find yourself getting slammed with large amounts of data you don't want as the index evolves over time ... something like facet.field=* would be worse because it's not just the amount of data getting returned that would increase, but the amount of computation (and time and poor cache performance) that would spike as well.

if we do this, i would think it only makes sense to generalize the use of "*" in both fl and facet.field into a true glob style syntax, so we can at least encourage people who want this type of syntax to use a naming convention to help limit how much they hurt themselves.

(i have no problem giving people enough rope to hang themselves, but we shouldn't tie a noose in the rope before we give it to them)

 
 

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

     [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lars Kotthoff updated SOLR-247:
-------------------------------

    Attachment: SOLR-247.patch

Attaching patch which implements support for wildcards in facet field specifications similar to SOLR-540. If the facet field specification contains an asterisk, every indexed field the reader knows about is matched against the corresponding regular expression.

Note that the unit tests part of the patch sort of depends on SOLR-645. When applied to the current trunk it will create the new facets test file with all the old tests plus the new ones. This doesn't cause anything to not work anymore, but duplicates the old tests. I can provide a new patch either against the current trunk or against the trunk with SOLR-645 committed, whichever is required.

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch, SOLR-247.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (SOLR-247) Allow facet.field=* to facet on all fields (without knowing what they are)

Michael Gibney (Jira)
In reply to this post by Michael Gibney (Jira)

     [ https://issues.apache.org/jira/browse/SOLR-247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lars Kotthoff updated SOLR-247:
-------------------------------

    Attachment: SOLR-247.patch

Attaching new patch which applies to current TRUNK.

> Allow facet.field=* to facet on all fields (without knowing what they are)
> --------------------------------------------------------------------------
>
>                 Key: SOLR-247
>                 URL: https://issues.apache.org/jira/browse/SOLR-247
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-247-FacetAllFields.patch, SOLR-247.patch, SOLR-247.patch
>
>
> I don't know if this is a good idea to include -- it is potentially a bad idea to use it, but that can be ok.
> This came out of trying to use faceting for the LukeRequestHandler top term collecting.
> http://www.nabble.com/Luke-request-handler-issue-tf3762155.html

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

12