Quantcast

codec="Pulsing" per field broken?

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

codec="Pulsing" per field broken?

eks dev
on the latest trunk, my schema.xml with field type declaration
containing //codec="Pulsing"// does not work any more (throws
exception from FieldType). It used to work wit approx. a month old
trunk version.

I didn't dig deeper, can be that the old schema.xml  was broken and
worked by accident.

----------------------------------------------------------------

org.apache.solr.common.SolrException: Plugin Initializing failure for
[schema.xml] fieldType
        at org.apache.solr.util.plugin.AbstractPluginLoader.load(AbstractPluginLoader.java:183)
        at org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:368)
        at org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:107)
        at org.apache.solr.core.CoreContainer.create(CoreContainer.java:651)
        at org.apache.solr.core.CoreContainer.load(CoreContainer.java:409)
        at org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:243)
        at org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:93)
        at org.mortbay.jetty.servlet.FilterHolder.doStart(FilterHolder.java:97)
        at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
        at org.mortbay.jetty.servlet.ServletHandler.initialize(ServletHandler.java:713)
        at org.mortbay.jetty.servlet.Context.startContext(Context.java:140)
        at org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1282)
        at org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:518)
        at org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:499)
        at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
        at org.mortbay.jetty.handler.HandlerWrapper.doStart(HandlerWrapper.java:130)
        at org.mortbay.jetty.Server.doStart(Server.java:224)
        at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
        at runjettyrun.Bootstrap.main(Bootstrap.java:86)
Caused by: java.lang.RuntimeException: schema fieldtype
storableCity(XXXXXXXXX.StorableField) invalid
arguments:{codec=Pulsing}
        at org.apache.solr.schema.FieldType.setArgs(FieldType.java:177)
        at org.apache.solr.schema.FieldTypePluginLoader.init(FieldTypePluginLoader.java:127)
        at org.apache.solr.schema.FieldTypePluginLoader.init(FieldTypePluginLoader.java:43)
        at org.apache.solr.util.plugin.AbstractPluginLoader.load(AbstractPluginLoader.java:180)
        ... 18 more
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

Re: codec="Pulsing" per field broken?

Robert Muir
On Sun, Dec 11, 2011 at 11:34 AM, eks dev <[hidden email]> wrote:
> on the latest trunk, my schema.xml with field type declaration
> containing //codec="Pulsing"// does not work any more (throws
> exception from FieldType). It used to work wit approx. a month old
> trunk version.
>
> I didn't dig deeper, can be that the old schema.xml  was broken and
> worked by accident.
>

Hi,

The short answer is, you should change this to //postingsFormat="Pulsing40"//
See http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/conf/schema_codec.xml

The longer answer is that the Codec API in lucene trunk was extended recently:
https://issues.apache.org/jira/browse/LUCENE-3490

Previously "Codec" only allowed you to customize the format of the
postings lists.
We are working to have it cover the entire index segment (at the
moment nearly everything except deletes and encoding of compound files
can be customized).

For example, look at SimpleText now:
http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/simpletext/
As you see, it now implements plain-text stored fields, term vectors,
norms, segments file, fieldinfos, etc.
See Codec.java (http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/Codec.java)
or LUCENE-3490 for more details.

Because of this, what you had before is now just "PostingsFormat", as
Pulsing is just a wrapper around a postings implementation that
inlines low frequency terms.
Lucene's default Codec uses a per-field postings setup, so you can
still configure the postings per-field, just differently.

--
lucidimagination.com
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

Re: codec="Pulsing" per field broken?

eks dev
Thanks Robert,

I've missed LUCENE-3490... Awesome!

On Sun, Dec 11, 2011 at 6:37 PM, Robert Muir <[hidden email]> wrote:

> On Sun, Dec 11, 2011 at 11:34 AM, eks dev <[hidden email]> wrote:
>> on the latest trunk, my schema.xml with field type declaration
>> containing //codec="Pulsing"// does not work any more (throws
>> exception from FieldType). It used to work wit approx. a month old
>> trunk version.
>>
>> I didn't dig deeper, can be that the old schema.xml  was broken and
>> worked by accident.
>>
>
> Hi,
>
> The short answer is, you should change this to //postingsFormat="Pulsing40"//
> See http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/conf/schema_codec.xml
>
> The longer answer is that the Codec API in lucene trunk was extended recently:
> https://issues.apache.org/jira/browse/LUCENE-3490
>
> Previously "Codec" only allowed you to customize the format of the
> postings lists.
> We are working to have it cover the entire index segment (at the
> moment nearly everything except deletes and encoding of compound files
> can be customized).
>
> For example, look at SimpleText now:
> http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/simpletext/
> As you see, it now implements plain-text stored fields, term vectors,
> norms, segments file, fieldinfos, etc.
> See Codec.java (http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/src/java/org/apache/lucene/index/codecs/Codec.java)
> or LUCENE-3490 for more details.
>
> Because of this, what you had before is now just "PostingsFormat", as
> Pulsing is just a wrapper around a postings implementation that
> inlines low frequency terms.
> Lucene's default Codec uses a per-field postings setup, so you can
> still configure the postings per-field, just differently.
>
> --
> lucidimagination.com
Loading...