Grant Ingersoll's 2009 blog article- is there a newer version?

classic Classic list List threaded Threaded
12 messages Options
Reply | Threaded
Open this post in threaded view
|

Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
Hi,-
  Is there a newer version of this great article from Mr. Grant Ingersoll?

  https://lucidworks.com/2009/08/05/getting-started-with-payloads/ Thanks

This article is based on Lucene 2.9.
Best regards

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

Ishan Chattopadhyaya
Try these, maybe?

https://lucidworks.com/2017/09/14/solr-payloads/
http://www.textsearch.io/?p=5

On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:

> Hi,-
>  Is there a newer version of this great article from Mr. Grant Ingersoll?
>
>  https://lucidworks.com/2009/08/05/getting-started-with-payloads/ Thanks
>
> This article is based on Lucene 2.9.
> Best regards
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>
>
Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
Thanks i saw these posts but Grant's article is based on Lucene.

i am not using Solr. Many classes in that article does not exist in
latest versions of Lucene like version 6.1.

For instance BoostingTermQuery does not exist in 6.1 and the way docs
are indexed are also different on 6.1.

There is a new class PayloadScoreQuery but there is no examples like
this great article how to put them together.

Best regards


On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

Erick Erickson
Maybe look at the Solr payload code to see how to do it in Lucene?

But yeah, that article is quite out of date.

On Thu, Jul 5, 2018 at 8:23 AM,  <[hidden email]> wrote:

> Thanks i saw these posts but Grant's article is based on Lucene.
>
> i am not using Solr. Many classes in that article does not exist in latest
> versions of Lucene like version 6.1.
>
> For instance BoostingTermQuery does not exist in 6.1 and the way docs are
> indexed are also different on 6.1.
>
> There is a new class PayloadScoreQuery but there is no examples like this
> great article how to put them together.
>
> Best regards
>
>
> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>
>> Try these, maybe?
>>
>>
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e=
>>
>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e=
>>
>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>
>>> Hi,-
>>>   Is there a newer version of this great article from Mr. Grant
>>> Ingersoll?
>>>
>>>
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e=
>>> Thanks
>>>
>>> This article is based on Lucene 2.9.
>>> Best regards
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
Sure, can You please point me to the location under Lucene Solr?

In Grant's article:

i want to know the need to use BoostingTermQuery (now in latest version
PayloadScoreQuery)

where we already specify payloads in the data in the form
<data>|<payload or score>.

Best regards



On 7/5/18 11:41 AM, Erick Erickson wrote:

> Maybe look at the Solr payload code to see how to do it in Lucene?
>
> But yeah, that article is quite out of date.
>
> On Thu, Jul 5, 2018 at 8:23 AM,  <[hidden email]> wrote:
>> Thanks i saw these posts but Grant's article is based on Lucene.
>>
>> i am not using Solr. Many classes in that article does not exist in latest
>> versions of Lucene like version 6.1.
>>
>> For instance BoostingTermQuery does not exist in 6.1 and the way docs are
>> indexed are also different on 6.1.
>>
>> There is a new class PayloadScoreQuery but there is no examples like this
>> great article how to put them together.
>>
>> Best regards
>>
>>
>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>> Try these, maybe?
>>>
>>>
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e=
>>>
>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e=
>>>
>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>
>>>> Hi,-
>>>>    Is there a newer version of this great article from Mr. Grant
>>>> Ingersoll?
>>>>
>>>>
>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e=
>>>> Thanks
>>>>
>>>> This article is based on Lucene 2.9.
>>>> Best regards
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
i mean i know the function of BoostingTermQuery class:

The BoostingTermQuery is very similar to the SpanTermQuery except that
it factors in the value of the payload located at each of the positions
where the Term occurs.

In order to take advantage of this, you must override
Similarity.scorePayload(String, byte[],int,int) which returns 1 by default.

Payload scores are averaged across term occurrences in the document.


what i am asking is as follows:

Does this mean this (BoostingTermQuery in Lucene 2.9 or
PayloadScoerQuery in latest Lucene) needs to be called for ***all the
words*** scored in the format i mentioned <data>|<payload or score> in
the data?

Best regards

On 7/5/18 1:13 PM, [hidden email] wrote:

> Sure, can You please point me to the location under Lucene Solr?
>
> In Grant's article:
>
> i want to know the need to use BoostingTermQuery (now in latest
> version PayloadScoreQuery)
>
> where we already specify payloads in the data in the form
> <data>|<payload or score>.
>
> Best regards
>
>
>
> On 7/5/18 11:41 AM, Erick Erickson wrote:
>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>
>> But yeah, that article is quite out of date.
>>
>> On Thu, Jul 5, 2018 at 8:23 AM,  <[hidden email]> wrote:
>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>
>>> i am not using Solr. Many classes in that article does not exist in
>>> latest
>>> versions of Lucene like version 6.1.
>>>
>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>> docs are
>>> indexed are also different on 6.1.
>>>
>>> There is a new class PayloadScoreQuery but there is no examples like
>>> this
>>> great article how to put them together.
>>>
>>> Best regards
>>>
>>>
>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>> Try these, maybe?
>>>>
>>>>
>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>
>>>>
>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>
>>>>
>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>
>>>>> Hi,-
>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>> Ingersoll?
>>>>>
>>>>>
>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>
>>>>> Thanks
>>>>>
>>>>> This article is based on Lucene 2.9.
>>>>> Best regards
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
org.apache.lucene.index.IndexWriter class does not have setSimilarity
method, am i missing something for this?

i checked multiple Lucene versions.


next, i have this problem:

After defining the Analyzer as the PayloadAnalyzer like on the blog
mentioned before,

i declared org.apache.lucene.search.QueryParser (with the analyzer
mentioned above as the parameter) which was then used in declaring the
org.apache.lucene.search.Query object via parse method of parser.

Now, i wonder how i can use PayloadScoreQuery in this scenario.


Best regards



On 7/5/18 1:19 PM, [hidden email] wrote:

> i mean i know the function of BoostingTermQuery class:
>
> The BoostingTermQuery is very similar to the SpanTermQuery except that
> it factors in the value of the payload located at each of the
> positions where the Term occurs.
>
> In order to take advantage of this, you must override
> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
> default.
>
> Payload scores are averaged across term occurrences in the document.
>
>
> what i am asking is as follows:
>
> Does this mean this (BoostingTermQuery in Lucene 2.9 or
> PayloadScoerQuery in latest Lucene) needs to be called for ***all the
> words*** scored in the format i mentioned <data>|<payload or score> in
> the data?
>
> Best regards
>
> On 7/5/18 1:13 PM, [hidden email] wrote:
>> Sure, can You please point me to the location under Lucene Solr?
>>
>> In Grant's article:
>>
>> i want to know the need to use BoostingTermQuery (now in latest
>> version PayloadScoreQuery)
>>
>> where we already specify payloads in the data in the form
>> <data>|<payload or score>.
>>
>> Best regards
>>
>>
>>
>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>
>>> But yeah, that article is quite out of date.
>>>
>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>
>>>> i am not using Solr. Many classes in that article does not exist in
>>>> latest
>>>> versions of Lucene like version 6.1.
>>>>
>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>> docs are
>>>> indexed are also different on 6.1.
>>>>
>>>> There is a new class PayloadScoreQuery but there is no examples
>>>> like this
>>>> great article how to put them together.
>>>>
>>>> Best regards
>>>>
>>>>
>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>> Try these, maybe?
>>>>>
>>>>>
>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>>
>>>>>
>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>>
>>>>>
>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>
>>>>>> Hi,-
>>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>>> Ingersoll?
>>>>>>
>>>>>>
>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>>
>>>>>> Thanks
>>>>>>
>>>>>> This article is based on Lucene 2.9.
>>>>>> Best regards
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>
>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>
>>>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
One thing i noticed is that org.apache.lucene.index.IndexWriter class
does not have setSimilarity and it is moved to

org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the
first question below.

Best regards


On 7/5/18 3:17 PM, [hidden email] wrote:

> org.apache.lucene.index.IndexWriter class does not have setSimilarity
> method, am i missing something for this?
>
> i checked multiple Lucene versions.
>
>
> next, i have this problem:
>
> After defining the Analyzer as the PayloadAnalyzer like on the blog
> mentioned before,
>
> i declared org.apache.lucene.search.QueryParser (with the analyzer
> mentioned above as the parameter) which was then used in declaring the
> org.apache.lucene.search.Query object via parse method of parser.
>
> Now, i wonder how i can use PayloadScoreQuery in this scenario.
>
>
> Best regards
>
>
>
> On 7/5/18 1:19 PM, [hidden email] wrote:
>> i mean i know the function of BoostingTermQuery class:
>>
>> The BoostingTermQuery is very similar to the SpanTermQuery except
>> that it factors in the value of the payload located at each of the
>> positions where the Term occurs.
>>
>> In order to take advantage of this, you must override
>> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
>> default.
>>
>> Payload scores are averaged across term occurrences in the document.
>>
>>
>> what i am asking is as follows:
>>
>> Does this mean this (BoostingTermQuery in Lucene 2.9 or
>> PayloadScoerQuery in latest Lucene) needs to be called for ***all the
>> words*** scored in the format i mentioned <data>|<payload or score>
>> in the data?
>>
>> Best regards
>>
>> On 7/5/18 1:13 PM, [hidden email] wrote:
>>> Sure, can You please point me to the location under Lucene Solr?
>>>
>>> In Grant's article:
>>>
>>> i want to know the need to use BoostingTermQuery (now in latest
>>> version PayloadScoreQuery)
>>>
>>> where we already specify payloads in the data in the form
>>> <data>|<payload or score>.
>>>
>>> Best regards
>>>
>>>
>>>
>>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>>
>>>> But yeah, that article is quite out of date.
>>>>
>>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>>
>>>>> i am not using Solr. Many classes in that article does not exist
>>>>> in latest
>>>>> versions of Lucene like version 6.1.
>>>>>
>>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>>> docs are
>>>>> indexed are also different on 6.1.
>>>>>
>>>>> There is a new class PayloadScoreQuery but there is no examples
>>>>> like this
>>>>> great article how to put them together.
>>>>>
>>>>> Best regards
>>>>>
>>>>>
>>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>>> Try these, maybe?
>>>>>>
>>>>>>
>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>>>
>>>>>>
>>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>>>
>>>>>>
>>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>>
>>>>>>> Hi,-
>>>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>>>> Ingersoll?
>>>>>>>
>>>>>>>
>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>> This article is based on Lucene 2.9.
>>>>>>> Best regards
>>>>>>>
>>>>>>> ---------------------------------------------------------------------
>>>>>>>
>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>
>>>>>>>
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
Another piece resolved: DefaultSimilarity is now ClassicSimilarity since
LUCENE VERSION 5.4.

Best regards


On 7/5/18 4:30 PM, [hidden email] wrote:

> One thing i noticed is that org.apache.lucene.index.IndexWriter class
> does not have setSimilarity and it is moved to
>
> org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the
> first question below.
>
> Best regards
>
>
> On 7/5/18 3:17 PM, [hidden email] wrote:
>> org.apache.lucene.index.IndexWriter class does not have setSimilarity
>> method, am i missing something for this?
>>
>> i checked multiple Lucene versions.
>>
>>
>> next, i have this problem:
>>
>> After defining the Analyzer as the PayloadAnalyzer like on the blog
>> mentioned before,
>>
>> i declared org.apache.lucene.search.QueryParser (with the analyzer
>> mentioned above as the parameter) which was then used in declaring
>> the org.apache.lucene.search.Query object via parse method of parser.
>>
>> Now, i wonder how i can use PayloadScoreQuery in this scenario.
>>
>>
>> Best regards
>>
>>
>>
>> On 7/5/18 1:19 PM, [hidden email] wrote:
>>> i mean i know the function of BoostingTermQuery class:
>>>
>>> The BoostingTermQuery is very similar to the SpanTermQuery except
>>> that it factors in the value of the payload located at each of the
>>> positions where the Term occurs.
>>>
>>> In order to take advantage of this, you must override
>>> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
>>> default.
>>>
>>> Payload scores are averaged across term occurrences in the document.
>>>
>>>
>>> what i am asking is as follows:
>>>
>>> Does this mean this (BoostingTermQuery in Lucene 2.9 or
>>> PayloadScoerQuery in latest Lucene) needs to be called for ***all
>>> the words*** scored in the format i mentioned <data>|<payload or
>>> score> in the data?
>>>
>>> Best regards
>>>
>>> On 7/5/18 1:13 PM, [hidden email] wrote:
>>>> Sure, can You please point me to the location under Lucene Solr?
>>>>
>>>> In Grant's article:
>>>>
>>>> i want to know the need to use BoostingTermQuery (now in latest
>>>> version PayloadScoreQuery)
>>>>
>>>> where we already specify payloads in the data in the form
>>>> <data>|<payload or score>.
>>>>
>>>> Best regards
>>>>
>>>>
>>>>
>>>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>>>
>>>>> But yeah, that article is quite out of date.
>>>>>
>>>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>>>
>>>>>> i am not using Solr. Many classes in that article does not exist
>>>>>> in latest
>>>>>> versions of Lucene like version 6.1.
>>>>>>
>>>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>>>> docs are
>>>>>> indexed are also different on 6.1.
>>>>>>
>>>>>> There is a new class PayloadScoreQuery but there is no examples
>>>>>> like this
>>>>>> great article how to put them together.
>>>>>>
>>>>>> Best regards
>>>>>>
>>>>>>
>>>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>>>> Try these, maybe?
>>>>>>>
>>>>>>>
>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>>>>
>>>>>>>
>>>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>>>>
>>>>>>>
>>>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>>>
>>>>>>>> Hi,-
>>>>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>>>>> Ingersoll?
>>>>>>>>
>>>>>>>>
>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>>>>
>>>>>>>> Thanks
>>>>>>>>
>>>>>>>> This article is based on Lucene 2.9.
>>>>>>>> Best regards
>>>>>>>>
>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>
>>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>>
>>>>>>>>
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>
>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
can someone point to any usage of PayloadScoreQuery class from Lucene
please?

Best regards


On 7/5/18 4:49 PM, [hidden email] wrote:

> Another piece resolved: DefaultSimilarity is now ClassicSimilarity
> since LUCENE VERSION 5.4.
>
> Best regards
>
>
> On 7/5/18 4:30 PM, [hidden email] wrote:
>> One thing i noticed is that org.apache.lucene.index.IndexWriter class
>> does not have setSimilarity and it is moved to
>>
>> org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the
>> first question below.
>>
>> Best regards
>>
>>
>> On 7/5/18 3:17 PM, [hidden email] wrote:
>>> org.apache.lucene.index.IndexWriter class does not have
>>> setSimilarity method, am i missing something for this?
>>>
>>> i checked multiple Lucene versions.
>>>
>>>
>>> next, i have this problem:
>>>
>>> After defining the Analyzer as the PayloadAnalyzer like on the blog
>>> mentioned before,
>>>
>>> i declared org.apache.lucene.search.QueryParser (with the analyzer
>>> mentioned above as the parameter) which was then used in declaring
>>> the org.apache.lucene.search.Query object via parse method of parser.
>>>
>>> Now, i wonder how i can use PayloadScoreQuery in this scenario.
>>>
>>>
>>> Best regards
>>>
>>>
>>>
>>> On 7/5/18 1:19 PM, [hidden email] wrote:
>>>> i mean i know the function of BoostingTermQuery class:
>>>>
>>>> The BoostingTermQuery is very similar to the SpanTermQuery except
>>>> that it factors in the value of the payload located at each of the
>>>> positions where the Term occurs.
>>>>
>>>> In order to take advantage of this, you must override
>>>> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
>>>> default.
>>>>
>>>> Payload scores are averaged across term occurrences in the document.
>>>>
>>>>
>>>> what i am asking is as follows:
>>>>
>>>> Does this mean this (BoostingTermQuery in Lucene 2.9 or
>>>> PayloadScoerQuery in latest Lucene) needs to be called for ***all
>>>> the words*** scored in the format i mentioned <data>|<payload or
>>>> score> in the data?
>>>>
>>>> Best regards
>>>>
>>>> On 7/5/18 1:13 PM, [hidden email] wrote:
>>>>> Sure, can You please point me to the location under Lucene Solr?
>>>>>
>>>>> In Grant's article:
>>>>>
>>>>> i want to know the need to use BoostingTermQuery (now in latest
>>>>> version PayloadScoreQuery)
>>>>>
>>>>> where we already specify payloads in the data in the form
>>>>> <data>|<payload or score>.
>>>>>
>>>>> Best regards
>>>>>
>>>>>
>>>>>
>>>>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>>>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>>>>
>>>>>> But yeah, that article is quite out of date.
>>>>>>
>>>>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>>>>
>>>>>>> i am not using Solr. Many classes in that article does not exist
>>>>>>> in latest
>>>>>>> versions of Lucene like version 6.1.
>>>>>>>
>>>>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>>>>> docs are
>>>>>>> indexed are also different on 6.1.
>>>>>>>
>>>>>>> There is a new class PayloadScoreQuery but there is no examples
>>>>>>> like this
>>>>>>> great article how to put them together.
>>>>>>>
>>>>>>> Best regards
>>>>>>>
>>>>>>>
>>>>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>>>>> Try these, maybe?
>>>>>>>>
>>>>>>>>
>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>>>>>
>>>>>>>>
>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>>>>>
>>>>>>>>
>>>>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>>>>
>>>>>>>>> Hi,-
>>>>>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>>>>>> Ingersoll?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>>>>>
>>>>>>>>> Thanks
>>>>>>>>>
>>>>>>>>> This article is based on Lucene 2.9.
>>>>>>>>> Best regards
>>>>>>>>>
>>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>>
>>>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>>>
>>>>>>>>>
>>>>>>>
>>>>>>> ---------------------------------------------------------------------
>>>>>>>
>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>
>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>
>>>>>
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar
In reply to this post by baris.kazar
 it seems with PayloadScoreQuery, QueryParser is also supported right for searching/indexing ***multiple*** terms.
However, there are no examples in the Javadocs.
@Grant Can You please publish new version of Your great blog article?
Thanks very much


----- Original Message -----
From: [hidden email]
To: [hidden email]
Sent: Friday, July 6, 2018 10:51:46 AM GMT -05:00 US/Canada Eastern
Subject: Re: Grant Ingersoll's 2009 blog article- is there a newer version?

can someone point to any usage of PayloadScoreQuery class from Lucene
please?

Best regards


On 7/5/18 4:49 PM, [hidden email] wrote:

> Another piece resolved: DefaultSimilarity is now ClassicSimilarity
> since LUCENE VERSION 5.4.
>
> Best regards
>
>
> On 7/5/18 4:30 PM, [hidden email] wrote:
>> One thing i noticed is that org.apache.lucene.index.IndexWriter class
>> does not have setSimilarity and it is moved to
>>
>> org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the
>> first question below.
>>
>> Best regards
>>
>>
>> On 7/5/18 3:17 PM, [hidden email] wrote:
>>> org.apache.lucene.index.IndexWriter class does not have
>>> setSimilarity method, am i missing something for this?
>>>
>>> i checked multiple Lucene versions.
>>>
>>>
>>> next, i have this problem:
>>>
>>> After defining the Analyzer as the PayloadAnalyzer like on the blog
>>> mentioned before,
>>>
>>> i declared org.apache.lucene.search.QueryParser (with the analyzer
>>> mentioned above as the parameter) which was then used in declaring
>>> the org.apache.lucene.search.Query object via parse method of parser.
>>>
>>> Now, i wonder how i can use PayloadScoreQuery in this scenario.
>>>
>>>
>>> Best regards
>>>
>>>
>>>
>>> On 7/5/18 1:19 PM, [hidden email] wrote:
>>>> i mean i know the function of BoostingTermQuery class:
>>>>
>>>> The BoostingTermQuery is very similar to the SpanTermQuery except
>>>> that it factors in the value of the payload located at each of the
>>>> positions where the Term occurs.
>>>>
>>>> In order to take advantage of this, you must override
>>>> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
>>>> default.
>>>>
>>>> Payload scores are averaged across term occurrences in the document.
>>>>
>>>>
>>>> what i am asking is as follows:
>>>>
>>>> Does this mean this (BoostingTermQuery in Lucene 2.9 or
>>>> PayloadScoerQuery in latest Lucene) needs to be called for ***all
>>>> the words*** scored in the format i mentioned <data>|<payload or
>>>> score> in the data?
>>>>
>>>> Best regards
>>>>
>>>> On 7/5/18 1:13 PM, [hidden email] wrote:
>>>>> Sure, can You please point me to the location under Lucene Solr?
>>>>>
>>>>> In Grant's article:
>>>>>
>>>>> i want to know the need to use BoostingTermQuery (now in latest
>>>>> version PayloadScoreQuery)
>>>>>
>>>>> where we already specify payloads in the data in the form
>>>>> <data>|<payload or score>.
>>>>>
>>>>> Best regards
>>>>>
>>>>>
>>>>>
>>>>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>>>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>>>>
>>>>>> But yeah, that article is quite out of date.
>>>>>>
>>>>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>>>>
>>>>>>> i am not using Solr. Many classes in that article does not exist
>>>>>>> in latest
>>>>>>> versions of Lucene like version 6.1.
>>>>>>>
>>>>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>>>>> docs are
>>>>>>> indexed are also different on 6.1.
>>>>>>>
>>>>>>> There is a new class PayloadScoreQuery but there is no examples
>>>>>>> like this
>>>>>>> great article how to put them together.
>>>>>>>
>>>>>>> Best regards
>>>>>>>
>>>>>>>
>>>>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>>>>> Try these, maybe?
>>>>>>>>
>>>>>>>>
>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e= 
>>>>>>>>
>>>>>>>>
>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e= 
>>>>>>>>
>>>>>>>>
>>>>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>>>>
>>>>>>>>> Hi,-
>>>>>>>>>    Is there a newer version of this great article from Mr. Grant
>>>>>>>>> Ingersoll?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e= 
>>>>>>>>>
>>>>>>>>> Thanks
>>>>>>>>>
>>>>>>>>> This article is based on Lucene 2.9.
>>>>>>>>> Best regards
>>>>>>>>>
>>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>>
>>>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>>>
>>>>>>>>>
>>>>>>>
>>>>>>> ---------------------------------------------------------------------
>>>>>>>
>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>
>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>
>>>>>
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

baris.kazar

Since i cant use payloads, i tried another way: i decomposed my text
into two fields and i want the first field to be searched until nothing
is found/matched there.
then i want the second field to be searched.

currently i added field.setBoost but that does not help. i see scores
being modified but the order of results is pretty much the same.

what happens is that when _part of the search string_ is found on both
fields then those entries are hit first since Lucene scoring takes
number of occurrences as dominant in scoring.

But _i want the search string to be fully-matched with the first field
first, then if not successful, do partial matching on both fields._

i also read
https://lucene.apache.org/core/6_0_1/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html

how can i achieve this in lucene?
Best regards

On 7/8/18 4:08 PM, Baris Kazar wrote:

>   it seems with PayloadScoreQuery, QueryParser is also supported right for searching/indexing ***multiple*** terms.
> However, there are no examples in the Javadocs.
> @Grant Can You please publish new version of Your great blog article?
> Thanks very much
>
>
> ----- Original Message -----
> From: [hidden email]
> To: [hidden email]
> Sent: Friday, July 6, 2018 10:51:46 AM GMT -05:00 US/Canada Eastern
> Subject: Re: Grant Ingersoll's 2009 blog article- is there a newer version?
>
> can someone point to any usage of PayloadScoreQuery class from Lucene
> please?
>
> Best regards
>
>
> On 7/5/18 4:49 PM, [hidden email] wrote:
>> Another piece resolved: DefaultSimilarity is now ClassicSimilarity
>> since LUCENE VERSION 5.4.
>>
>> Best regards
>>
>>
>> On 7/5/18 4:30 PM, [hidden email] wrote:
>>> One thing i noticed is that org.apache.lucene.index.IndexWriter class
>>> does not have setSimilarity and it is moved to
>>>
>>> org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the
>>> first question below.
>>>
>>> Best regards
>>>
>>>
>>> On 7/5/18 3:17 PM, [hidden email] wrote:
>>>> org.apache.lucene.index.IndexWriter class does not have
>>>> setSimilarity method, am i missing something for this?
>>>>
>>>> i checked multiple Lucene versions.
>>>>
>>>>
>>>> next, i have this problem:
>>>>
>>>> After defining the Analyzer as the PayloadAnalyzer like on the blog
>>>> mentioned before,
>>>>
>>>> i declared org.apache.lucene.search.QueryParser (with the analyzer
>>>> mentioned above as the parameter) which was then used in declaring
>>>> the org.apache.lucene.search.Query object via parse method of parser.
>>>>
>>>> Now, i wonder how i can use PayloadScoreQuery in this scenario.
>>>>
>>>>
>>>> Best regards
>>>>
>>>>
>>>>
>>>> On 7/5/18 1:19 PM, [hidden email] wrote:
>>>>> i mean i know the function of BoostingTermQuery class:
>>>>>
>>>>> The BoostingTermQuery is very similar to the SpanTermQuery except
>>>>> that it factors in the value of the payload located at each of the
>>>>> positions where the Term occurs.
>>>>>
>>>>> In order to take advantage of this, you must override
>>>>> Similarity.scorePayload(String, byte[],int,int) which returns 1 by
>>>>> default.
>>>>>
>>>>> Payload scores are averaged across term occurrences in the document.
>>>>>
>>>>>
>>>>> what i am asking is as follows:
>>>>>
>>>>> Does this mean this (BoostingTermQuery in Lucene 2.9 or
>>>>> PayloadScoerQuery in latest Lucene) needs to be called for ***all
>>>>> the words*** scored in the format i mentioned <data>|<payload or
>>>>> score> in the data?
>>>>>
>>>>> Best regards
>>>>>
>>>>> On 7/5/18 1:13 PM, [hidden email] wrote:
>>>>>> Sure, can You please point me to the location under Lucene Solr?
>>>>>>
>>>>>> In Grant's article:
>>>>>>
>>>>>> i want to know the need to use BoostingTermQuery (now in latest
>>>>>> version PayloadScoreQuery)
>>>>>>
>>>>>> where we already specify payloads in the data in the form
>>>>>> <data>|<payload or score>.
>>>>>>
>>>>>> Best regards
>>>>>>
>>>>>>
>>>>>>
>>>>>> On 7/5/18 11:41 AM, Erick Erickson wrote:
>>>>>>> Maybe look at the Solr payload code to see how to do it in Lucene?
>>>>>>>
>>>>>>> But yeah, that article is quite out of date.
>>>>>>>
>>>>>>> On Thu, Jul 5, 2018 at 8:23 AM, <[hidden email]> wrote:
>>>>>>>> Thanks i saw these posts but Grant's article is based on Lucene.
>>>>>>>>
>>>>>>>> i am not using Solr. Many classes in that article does not exist
>>>>>>>> in latest
>>>>>>>> versions of Lucene like version 6.1.
>>>>>>>>
>>>>>>>> For instance BoostingTermQuery does not exist in 6.1 and the way
>>>>>>>> docs are
>>>>>>>> indexed are also different on 6.1.
>>>>>>>>
>>>>>>>> There is a new class PayloadScoreQuery but there is no examples
>>>>>>>> like this
>>>>>>>> great article how to put them together.
>>>>>>>>
>>>>>>>> Best regards
>>>>>>>>
>>>>>>>>
>>>>>>>> On 7/5/18 11:18 AM, Ishan Chattopadhyaya wrote:
>>>>>>>>> Try these, maybe?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2017_09_14_solr-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=9hxjhLoi6Lnb7KbYaOeb4-SP039x4Zx0XIynF_HzOJk&e=
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__www.textsearch.io_-3Fp-3D5&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=elEAMRZBIF2jldvS2kCD9B3r43kZ3hOToKVyR0I4qzo&e=
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Thu, Jul 5, 2018 at 8:26 PM, <[hidden email]> wrote:
>>>>>>>>>
>>>>>>>>>> Hi,-
>>>>>>>>>>     Is there a newer version of this great article from Mr. Grant
>>>>>>>>>> Ingersoll?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lucidworks.com_2009_08_05_getting-2Dstarted-2Dwith-2Dpayloads_&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=Ak4sr1zTaxPibIGJz26XQrj9fM4hZls8OegNbEWu1lI&s=isAZ026j7ugASeuPdoeXnoi5XfSGfxEgiWECE2ziURo&e=
>>>>>>>>>>
>>>>>>>>>> Thanks
>>>>>>>>>>
>>>>>>>>>> This article is based on Lucene 2.9.
>>>>>>>>>> Best regards
>>>>>>>>>>
>>>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>>>
>>>>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>
>>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>>
>>>>>>> ---------------------------------------------------------------------
>>>>>>>
>>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>>
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>> To unsubscribe, e-mail: [hidden email]
>>>>>> For additional commands, e-mail: [hidden email]
>>>>>>
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>