[jira] [Created] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
8 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
Greg Bowyer created SOLR-3553:
---------------------------------

             Summary: Make solr Plugins (FieldType, Analyser etc) be inheritable
                 Key: SOLR-3553
                 URL: https://issues.apache.org/jira/browse/SOLR-3553
             Project: Solr
          Issue Type: New Feature
    Affects Versions: 4.0
            Reporter: Greg Bowyer


During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml

Using the current schema I am working with as an example,

{code:xml}
<fieldType name="text" class="solr.TextField"
    positionIncrementGap="100"
    autoGeneratePhraseQueries="true">
    <analyzer type="index">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
       
        <filter class="solr.TrimFilterFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.WordDelimiterFilterFactory"
           generateWordParts="1" generateNumberParts="1"
           catenateWords="0" catenateNumbers="1"  catenateAll="0"
           splitOnCaseChange="1" splitOnNumerics="0"
           stemEnglishPossesive="0" preserveOriginal="0" />
        <filter class="solr.ASCIIFoldingFilterFactory"/>

        <filter class="solr.StopFilterFactory"
           ignoreCase="true" words="stopwords.txt"
           enablePositionIncrements="true" />
       

... And so on
{code}

What I would like to be able to say is

{code:xml}
<fieldType name="text_bm25" parent="text">
     <similarity class="solr.BM25SimilarityFactory">
        <float name="k1">1.2</float>
        <float name="b">0.76</float>
    </similarity>
</fieldType>
{code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment: SOLR-3553-Inheritable-schema-elements.patch

Rough attempt at making schema elements inheritable, probably broken in ways I cant see yet
               

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment: SOLR-3553-Inheritable-schema-elements.patch

Fixed a mistake making abstract required
               

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch, SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment: SOLR-3553-Inheritable-schema-elements.patch
   

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch, SOLR-3553-Inheritable-schema-elements.patch, SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment:     (was: SOLR-3553-Inheritable-schema-elements.patch)
   

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment: SOLR-3553-Inheritable-schema-elements.patch
   

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment:     (was: SOLR-3553-Inheritable-schema-elements.patch)
   

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (SOLR-3553) Make solr Plugins (FieldType, Analyser etc) be inheritable

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/SOLR-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Bowyer updated SOLR-3553:
------------------------------

    Attachment:     (was: SOLR-3553-Inheritable-schema-elements.patch)
   

> Make solr Plugins (FieldType, Analyser etc) be inheritable
> ----------------------------------------------------------
>
>                 Key: SOLR-3553
>                 URL: https://issues.apache.org/jira/browse/SOLR-3553
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Greg Bowyer
>         Attachments: SOLR-3553-Inheritable-schema-elements.patch
>
>
> During defining an experimental schema for trialling all the combinations of scoring models for lucene 4 it struck me how much duplication is involved in the schema xml
> Using the current schema I am working with as an example,
> {code:xml}
> <fieldType name="text" class="solr.TextField"
>     positionIncrementGap="100"
>     autoGeneratePhraseQueries="true">
>     <analyzer type="index">
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        
>         <filter class="solr.TrimFilterFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>         <filter class="solr.WordDelimiterFilterFactory"
>            generateWordParts="1" generateNumberParts="1"
>            catenateWords="0" catenateNumbers="1"  catenateAll="0"
>            splitOnCaseChange="1" splitOnNumerics="0"
>            stemEnglishPossesive="0" preserveOriginal="0" />
>         <filter class="solr.ASCIIFoldingFilterFactory"/>
>         <filter class="solr.StopFilterFactory"
>            ignoreCase="true" words="stopwords.txt"
>            enablePositionIncrements="true" />
>        
> ... And so on
> {code}
> What I would like to be able to say is
> {code:xml}
> <fieldType name="text_bm25" parent="text">
>      <similarity class="solr.BM25SimilarityFactory">
>         <float name="k1">1.2</float>
>         <float name="b">0.76</float>
>     </similarity>
> </fieldType>
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]