spellcheck collation

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

spellcheck collation

dsteiger
I've noticed a few things with the new spellcheck component that seem  
a little strange.

Here's my document:

<doc>
   <field name="id">5</field>
   <field name="spell">wii blackberry blackjack creative labs zen ipod  
video nano</field>
</doc>

Some sample queries:

http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberri+wi&spellcheck=true&spellcheck.collate=true

http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberr+wi&spellcheck=true&spellcheck.collate=true

http://localhost:8983/solr/core1/spellCheckCompRH?q=blackber+wi&spellcheck=true&spellcheck.collate=true

When spellchecking 'blackberri wi', the collation returned is  
'blackberry wii'.  When spellchecking 'blackberr wi', the collation  
returned is 'blackberrywii'.  'blackber wi' returns 'blackberrwiiwi'.

Doug
Reply | Threaded
Open this post in threaded view
|

Re: spellcheck collation

Guillaume Smet
Hi Doug,

On Wed, Aug 13, 2008 at 8:25 PM, Doug Steigerwald
<[hidden email]> wrote:
> When spellchecking 'blackberri wi', the collation returned is 'blackberry
> wii'.  When spellchecking 'blackberr wi', the collation returned is
> 'blackberrywii'.  'blackber wi' returns 'blackberrwiiwi'.

It looks a lot like the last case reported by Otis in
https://issues.apache.org/jira/browse/SOLR-630 ("united states
oftAmericaa").

--
Guillaume
Reply | Threaded
Open this post in threaded view
|

Re: spellcheck collation

Grant Ingersoll-2
In reply to this post by dsteiger
I believe I just fixed this on SOLR-606 (thanks to Stefan's patch).  
Give it a try and let us know.

-Grant

On Aug 13, 2008, at 2:25 PM, Doug Steigerwald wrote:

> I've noticed a few things with the new spellcheck component that  
> seem a little strange.
>
> Here's my document:
>
> <doc>
>  <field name="id">5</field>
>  <field name="spell">wii blackberry blackjack creative labs zen ipod  
> video nano</field>
> </doc>
>
> Some sample queries:
>
> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberri+wi&spellcheck=true&spellcheck.collate=true
>
> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberr+wi&spellcheck=true&spellcheck.collate=true
>
> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackber+wi&spellcheck=true&spellcheck.collate=true
>
> When spellchecking 'blackberri wi', the collation returned is  
> 'blackberry wii'.  When spellchecking 'blackberr wi', the collation  
> returned is 'blackberrywii'.  'blackber wi' returns 'blackberrwiiwi'.
>
> Doug


Reply | Threaded
Open this post in threaded view
|

Re: spellcheck collation

dsteiger
I'd try, but the build is failing from (guessing) Ryan's last commit:

compile:
     [mkdir] Created dir: /Users/dsteiger/Desktop/java/solr/build/core
     [javac] Compiling 337 source files to /Users/dsteiger/Desktop/
java/solr/build/core
     [javac] /Users/dsteiger/Desktop/java/solr/client/java/solrj/src/
org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java:129:  
cannot find symbol
     [javac] symbol  : method isEnabled()
     [javac] location: class org.apache.solr.core.CoreContainer
     [javac]           multicore.isEnabled() ) {

Doug

On Aug 14, 2008, at 2:24 PM, Grant Ingersoll wrote:

> I believe I just fixed this on SOLR-606 (thanks to Stefan's patch).  
> Give it a try and let us know.
>
> -Grant
>
> On Aug 13, 2008, at 2:25 PM, Doug Steigerwald wrote:
>
>> I've noticed a few things with the new spellcheck component that  
>> seem a little strange.
>>
>> Here's my document:
>>
>> <doc>
>> <field name="id">5</field>
>> <field name="spell">wii blackberry blackjack creative labs zen ipod  
>> video nano</field>
>> </doc>
>>
>> Some sample queries:
>>
>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberri+wi&spellcheck=true&spellcheck.collate=true
>>
>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberr+wi&spellcheck=true&spellcheck.collate=true
>>
>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackber+wi&spellcheck=true&spellcheck.collate=true
>>
>> When spellchecking 'blackberri wi', the collation returned is  
>> 'blackberry wii'.  When spellchecking 'blackberr wi', the collation  
>> returned is 'blackberrywii'.  'blackber wi' returns 'blackberrwiiwi'.
>>
>> Doug
>

Reply | Threaded
Open this post in threaded view
|

Re: spellcheck collation

Ryan McKinley
have you updated recently?

isEnabled() was removed last night...


On Aug 14, 2008, at 2:30 PM, Doug Steigerwald wrote:

> I'd try, but the build is failing from (guessing) Ryan's last commit:
>
> compile:
>    [mkdir] Created dir: /Users/dsteiger/Desktop/java/solr/build/core
>    [javac] Compiling 337 source files to /Users/dsteiger/Desktop/
> java/solr/build/core
>    [javac] /Users/dsteiger/Desktop/java/solr/client/java/solrj/src/
> org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java:129:  
> cannot find symbol
>    [javac] symbol  : method isEnabled()
>    [javac] location: class org.apache.solr.core.CoreContainer
>    [javac]           multicore.isEnabled() ) {
>
> Doug
>
> On Aug 14, 2008, at 2:24 PM, Grant Ingersoll wrote:
>
>> I believe I just fixed this on SOLR-606 (thanks to Stefan's  
>> patch).  Give it a try and let us know.
>>
>> -Grant
>>
>> On Aug 13, 2008, at 2:25 PM, Doug Steigerwald wrote:
>>
>>> I've noticed a few things with the new spellcheck component that  
>>> seem a little strange.
>>>
>>> Here's my document:
>>>
>>> <doc>
>>> <field name="id">5</field>
>>> <field name="spell">wii blackberry blackjack creative labs zen  
>>> ipod video nano</field>
>>> </doc>
>>>
>>> Some sample queries:
>>>
>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberri+wi&spellcheck=true&spellcheck.collate=true
>>>
>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberr+wi&spellcheck=true&spellcheck.collate=true
>>>
>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackber+wi&spellcheck=true&spellcheck.collate=true
>>>
>>> When spellchecking 'blackberri wi', the collation returned is  
>>> 'blackberry wii'.  When spellchecking 'blackberr wi', the  
>>> collation returned is 'blackberrywii'.  'blackber wi' returns  
>>> 'blackberrwiiwi'.
>>>
>>> Doug
>>
>

Reply | Threaded
Open this post in threaded view
|

Re: spellcheck collation

dsteiger
Right before I sent the message.  Did a 'svn up src/;and clean;ant  
dist' and it failed.  Seems to work fine now.

On Aug 14, 2008, at 2:38 PM, Ryan McKinley wrote:

> have you updated recently?
>
> isEnabled() was removed last night...
>
>
> On Aug 14, 2008, at 2:30 PM, Doug Steigerwald wrote:
>
>> I'd try, but the build is failing from (guessing) Ryan's last commit:
>>
>> compile:
>>   [mkdir] Created dir: /Users/dsteiger/Desktop/java/solr/build/core
>>   [javac] Compiling 337 source files to /Users/dsteiger/Desktop/
>> java/solr/build/core
>>   [javac] /Users/dsteiger/Desktop/java/solr/client/java/solrj/src/
>> org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java:129:  
>> cannot find symbol
>>   [javac] symbol  : method isEnabled()
>>   [javac] location: class org.apache.solr.core.CoreContainer
>>   [javac]           multicore.isEnabled() ) {
>>
>> Doug
>>
>> On Aug 14, 2008, at 2:24 PM, Grant Ingersoll wrote:
>>
>>> I believe I just fixed this on SOLR-606 (thanks to Stefan's  
>>> patch).  Give it a try and let us know.
>>>
>>> -Grant
>>>
>>> On Aug 13, 2008, at 2:25 PM, Doug Steigerwald wrote:
>>>
>>>> I've noticed a few things with the new spellcheck component that  
>>>> seem a little strange.
>>>>
>>>> Here's my document:
>>>>
>>>> <doc>
>>>> <field name="id">5</field>
>>>> <field name="spell">wii blackberry blackjack creative labs zen  
>>>> ipod video nano</field>
>>>> </doc>
>>>>
>>>> Some sample queries:
>>>>
>>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberri+wi&spellcheck=true&spellcheck.collate=true
>>>>
>>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackberr+wi&spellcheck=true&spellcheck.collate=true
>>>>
>>>> http://localhost:8983/solr/core1/spellCheckCompRH?q=blackber+wi&spellcheck=true&spellcheck.collate=true
>>>>
>>>> When spellchecking 'blackberri wi', the collation returned is  
>>>> 'blackberry wii'.  When spellchecking 'blackberr wi', the  
>>>> collation returned is 'blackberrywii'.  'blackber wi' returns  
>>>> 'blackberrwiiwi'.
>>>>
>>>> Doug
>>>
>>