Search returning unexpected matches at the top

classic Classic list List threaded Threaded
10 messages Options
Reply | Threaded
Open this post in threaded view
|

Search returning unexpected matches at the top

rhys J
I have a search box that is just searching every possible core, and every
possible field.

When I enter 'owl-2924-8', I expect the clt_ref_no of OWL-2924-8 to float
to the top, however it is the third result in my list.

Here is the code from the search:

on_data({
  "responseHeader":{
    "status":0,
    "QTime":31,
    "params":{
      "hl":"true",
      "indent":"on",
      "fl":"debt_id, clt_ref_no",
      "start":"0",
      "sort":"score desc, id asc",
      "rows":"500",
      "version":"2.2",
      "q":"clt_ref_no:owl\\-2924\\-8 debt_descr:owl\\-2924\\-8
comments:owl\\-2924\\-8 reference_no:owl\\-2924\\-8 ",
      "core":"debt",
      "json.wrf":"on_data",
      "urlquery":"owl-2924-8",
      "callback":"?",
      "wt":"json"}},
  "response":{"numFound":85675,"start":0,"docs":[
      {
        "clt_ref_no":"2924",
        "debt_id":"574574"},
      {
        "clt_ref_no":"2924",
        "debt_id":"598663"},
      {
        "clt_ref_no":"OWL-2924-8",
        "debt_id":"624401"},
      {
        "clt_ref_no":"OWL-2924-8",
        "debt_id":"628157"},
      {
        "clt_ref_no":"2924",
        "debt_id":"584807"},
      {
        "clt_ref_no":"U615-2924-8",
        "debt_id":"628310"},
      {
        "clt_ref_no":"OWL-2924-8/73847",
        "debt_id":"596713"},
      {
        "clt_ref_no":"OWL-2924-8/73847",
        "debt_id":"624401"},
      {
        "clt_ref_no":"OWL-2924-8/73847",
        "debt_id":"628157"},
      {

I'm not interested in having a specific search with quotes around it,
because this is searching everything, so it's a fuzzy search. But I am
interested in understanding why 'owl-2924-8' doesn't come out on top of the
search.

As you can see, I'm sorting by score and then id, which should take care of
things, but it's not.

Thanks,

Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

Alexandre Rafalovitch
You can enable debug which will show you what matches and why. Check
the reference guide for parameters:
https://lucene.apache.org/solr/guide/8_1/common-query-parameters.html#debug-parameter

Regards,
   Alex.

On Fri, 6 Dec 2019 at 11:00, rhys J <[hidden email]> wrote:

>
> I have a search box that is just searching every possible core, and every
> possible field.
>
> When I enter 'owl-2924-8', I expect the clt_ref_no of OWL-2924-8 to float
> to the top, however it is the third result in my list.
>
> Here is the code from the search:
>
> on_data({
>   "responseHeader":{
>     "status":0,
>     "QTime":31,
>     "params":{
>       "hl":"true",
>       "indent":"on",
>       "fl":"debt_id, clt_ref_no",
>       "start":"0",
>       "sort":"score desc, id asc",
>       "rows":"500",
>       "version":"2.2",
>       "q":"clt_ref_no:owl\\-2924\\-8 debt_descr:owl\\-2924\\-8
> comments:owl\\-2924\\-8 reference_no:owl\\-2924\\-8 ",
>       "core":"debt",
>       "json.wrf":"on_data",
>       "urlquery":"owl-2924-8",
>       "callback":"?",
>       "wt":"json"}},
>   "response":{"numFound":85675,"start":0,"docs":[
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"574574"},
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"598663"},
>       {
>         "clt_ref_no":"OWL-2924-8",
>         "debt_id":"624401"},
>       {
>         "clt_ref_no":"OWL-2924-8",
>         "debt_id":"628157"},
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"584807"},
>       {
>         "clt_ref_no":"U615-2924-8",
>         "debt_id":"628310"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"596713"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"624401"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"628157"},
>       {
>
> I'm not interested in having a specific search with quotes around it,
> because this is searching everything, so it's a fuzzy search. But I am
> interested in understanding why 'owl-2924-8' doesn't come out on top of the
> search.
>
> As you can see, I'm sorting by score and then id, which should take care of
> things, but it's not.
>
> Thanks,
>
> Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

dhastings
whats the field type for:
clt_ref_no
*_no isnt a default dynamic character, and owl-2924-8 usually translates into
owl 2924 8




David J. Hastings | Lead Developer
[hidden email] | 716.882.2600 x 176

William S. Hein & Co., Inc.
2350 North Forest Road | Getzville, NY 14068
www.wshein.com/contact-us

________________________________________
From: Alexandre Rafalovitch <[hidden email]>
Sent: Friday, December 6, 2019 11:15 AM
To: solr-user
Subject: Re: Search returning unexpected matches at the top

You can enable debug which will show you what matches and why. Check
the reference guide for parameters:
https://lucene.apache.org/solr/guide/8_1/common-query-parameters.html#debug-parameter

Regards,
   Alex.

On Fri, 6 Dec 2019 at 11:00, rhys J <[hidden email]> wrote:

>
> I have a search box that is just searching every possible core, and every
> possible field.
>
> When I enter 'owl-2924-8', I expect the clt_ref_no of OWL-2924-8 to float
> to the top, however it is the third result in my list.
>
> Here is the code from the search:
>
> on_data({
>   "responseHeader":{
>     "status":0,
>     "QTime":31,
>     "params":{
>       "hl":"true",
>       "indent":"on",
>       "fl":"debt_id, clt_ref_no",
>       "start":"0",
>       "sort":"score desc, id asc",
>       "rows":"500",
>       "version":"2.2",
>       "q":"clt_ref_no:owl\\-2924\\-8 debt_descr:owl\\-2924\\-8
> comments:owl\\-2924\\-8 reference_no:owl\\-2924\\-8 ",
>       "core":"debt",
>       "json.wrf":"on_data",
>       "urlquery":"owl-2924-8",
>       "callback":"?",
>       "wt":"json"}},
>   "response":{"numFound":85675,"start":0,"docs":[
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"574574"},
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"598663"},
>       {
>         "clt_ref_no":"OWL-2924-8",
>         "debt_id":"624401"},
>       {
>         "clt_ref_no":"OWL-2924-8",
>         "debt_id":"628157"},
>       {
>         "clt_ref_no":"2924",
>         "debt_id":"584807"},
>       {
>         "clt_ref_no":"U615-2924-8",
>         "debt_id":"628310"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"596713"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"624401"},
>       {
>         "clt_ref_no":"OWL-2924-8/73847",
>         "debt_id":"628157"},
>       {
>
> I'm not interested in having a specific search with quotes around it,
> because this is searching everything, so it's a fuzzy search. But I am
> interested in understanding why 'owl-2924-8' doesn't come out on top of the
> search.
>
> As you can see, I'm sorting by score and then id, which should take care of
> things, but it's not.
>
> Thanks,
>
> Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

rhys J
On Fri, Dec 6, 2019 at 11:21 AM David Hastings <[hidden email]> wrote:

> whats the field type for:
> clt_ref_no
>

It is a text_general field because it can have numbers or alphanumeric
characters.

*_no isnt a default dynamic character, and owl-2924-8 usually translates
> into
> owl 2924 8
>
>
So it's matching on word breaks, am I understanding properly?

It's matching all things that match either 'owl' or '2924' or '8'?

Thanks,

Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

Erick Erickson
Please look at the admin UI>>collection_or_core>>analysis page. That will tell you exactly how your input is being transformed. Very often WordDelimiter(Graph)FilterFactory is what breaks data up like this, that’s what it’s _designed_ for.

Best,
Erick

> On Dec 6, 2019, at 11:25 AM, rhys J <[hidden email]> wrote:
>
> On Fri, Dec 6, 2019 at 11:21 AM David Hastings <[hidden email]> wrote:
>
>> whats the field type for:
>> clt_ref_no
>>
>
> It is a text_general field because it can have numbers or alphanumeric
> characters.
>
> *_no isnt a default dynamic character, and owl-2924-8 usually translates
>> into
>> owl 2924 8
>>
>>
> So it's matching on word breaks, am I understanding properly?
>
> It's matching all things that match either 'owl' or '2924' or '8'?
>
> Thanks,
>
> Rhys

Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

Paras Lehana
Hi Rhys,

Use Solr Query Debugger
<https://chrome.google.com/webstore/detail/solr-query-debugger/gmpkeiamnmccifccnbfljffkcnacmmdl?hl=en>
Chrome
Extension to see what's making up the score for both of them. I guess
fieldNorm should impact but that should not be the only thing - there's
another catch here.

On Fri, 6 Dec 2019 at 22:00, Erick Erickson <[hidden email]> wrote:

> Please look at the admin UI>>collection_or_core>>analysis page. That will
> tell you exactly how your input is being transformed. Very often
> WordDelimiter(Graph)FilterFactory is what breaks data up like this, that’s
> what it’s _designed_ for.
>
> Best,
> Erick
>
> > On Dec 6, 2019, at 11:25 AM, rhys J <[hidden email]> wrote:
> >
> > On Fri, Dec 6, 2019 at 11:21 AM David Hastings <[hidden email]>
> wrote:
> >
> >> whats the field type for:
> >> clt_ref_no
> >>
> >
> > It is a text_general field because it can have numbers or alphanumeric
> > characters.
> >
> > *_no isnt a default dynamic character, and owl-2924-8 usually translates
> >> into
> >> owl 2924 8
> >>
> >>
> > So it's matching on word breaks, am I understanding properly?
> >
> > It's matching all things that match either 'owl' or '2924' or '8'?
> >
> > Thanks,
> >
> > Rhys
>
>

--
--
Regards,

*Paras Lehana* [65871]
Development Engineer, Auto-Suggest,
IndiaMART Intermesh Ltd.

8th Floor, Tower A, Advant-Navis Business Park, Sector 142,
Noida, UP, IN - 201303

Mob.: +91-9560911996
Work: 01203916600 | Extn:  *8173*

--
*
*

 <https://www.facebook.com/IndiaMART/videos/578196442936091/>
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

rhys J
On Mon, Dec 9, 2019 at 12:06 AM Paras Lehana <[hidden email]>
wrote:

> Hi Rhys,
>
> Use Solr Query Debugger
> <
> https://chrome.google.com/webstore/detail/solr-query-debugger/gmpkeiamnmccifccnbfljffkcnacmmdl?hl=en
> >
> Chrome
> Extension to see what's making up the score for both of them. I guess
> fieldNorm should impact but that should not be the only thing - there's
> another catch here.
>

Oh wow, thank you for this!

I figured out that if I added quotes to the terms, and then added ^2 to the
score, that it floated to the top just like I expected.

Thanks,

Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

Paras Lehana
That's great.

But I also wanted to know why the concerned document was scored lower in
the original query. Anyways, glad that the issue is resolved. :)

On Tue, 10 Dec 2019 at 00:38, rhys J <[hidden email]> wrote:

> On Mon, Dec 9, 2019 at 12:06 AM Paras Lehana <[hidden email]>
> wrote:
>
> > Hi Rhys,
> >
> > Use Solr Query Debugger
> > <
> >
> https://chrome.google.com/webstore/detail/solr-query-debugger/gmpkeiamnmccifccnbfljffkcnacmmdl?hl=en
> > >
> > Chrome
> > Extension to see what's making up the score for both of them. I guess
> > fieldNorm should impact but that should not be the only thing - there's
> > another catch here.
> >
>
> Oh wow, thank you for this!
>
> I figured out that if I added quotes to the terms, and then added ^2 to the
> score, that it floated to the top just like I expected.
>
> Thanks,
>
> Rhys
>


--
--
Regards,

*Paras Lehana* [65871]
Development Engineer, Auto-Suggest,
IndiaMART Intermesh Ltd.

8th Floor, Tower A, Advant-Navis Business Park, Sector 142,
Noida, UP, IN - 201303

Mob.: +91-9560911996
Work: 01203916600 | Extn:  *8173*

--
*
*

 <https://www.facebook.com/IndiaMART/videos/578196442936091/>
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

rhys J
On Tue, Dec 10, 2019 at 12:35 AM Paras Lehana <[hidden email]>
wrote:

> That's great.
>
> But I also wanted to know why the concerned document was scored lower in
> the original query. Anyways, glad that the issue is resolved. :)
>
>
That I need to look into. If I find an answer, I will let you know.

Thanks,

Rhys
Reply | Threaded
Open this post in threaded view
|

Re: Search returning unexpected matches at the top

rhys J
I did the following query with debug=results turned on to get an
explanation about why owl-2924-8 does not float to the top of the search
results.

I'm not good at parsing it all, but it appears that it's doing what we
expected, in that it's treating the - like a word boundary and matching
owl, then 2924, then 8 on fields in the core?

So it appears that I'm just lucky that clt_ref_no: owl-2924-8 is even 3rd
in the results?

Thanks,

Rhys

pasted data:

on_data({
  "responseHeader":{
    "status":0,
    "QTime":741,
    "params":{
      "q":"clt_ref_no: owl-2924-8 debt_descr:owl-2924-8 comments:owl-2924-8
reference_no:owl-2924-8 ",
      "core":"debt",
      "json.wrf":"on_data",
      "debug":"results",
      "urlquery":"owl-2924-8",
      "hl":"true",
      "indent":"on",
      "start":"0",
      "callback":"?",
      "sort":"score desc, id asc",
      "rows":"500",
      "wt":"json"}},


 "response":{"numFound":85675,"start":0,"docs":[
      {
        "id":"574574-3",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"4523",
        "clt_ref_no":"2924",
        "comments":" ",
        "debt_descr":" ",
        "debt_id":"574574",
        "debt_no":3,
        "debt_type":"COM",
        "delq_date":"2014-09-03T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2015-01-22T00:00:00Z",
        "orig_clt":"4523",
        "orig_int_amt":0.0,
        "orig_princ_amt":993.84,
        "potential_bad_debt":0,
        "princ_paid":993.84,
        "reference_no":"2924",
        "serv_date":"2014-08-04T00:00:00Z",
        "status_code":520,
        "status_date":"2016-04-02T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2015-01-22T02:47:00Z",
        "_version_":1652199132255748098},
      {
        "id":"598663-1383",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"4436",
        "clt_ref_no":"2924",
        "comments":" ",
        "debt_descr":" ",
        "debt_id":"598663",
        "debt_no":1383,
        "debt_type":"COM",
        "delq_date":"2016-08-24T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2016-07-26T00:00:00Z",
        "orig_clt":"4436",
        "orig_int_amt":0.0,
        "orig_princ_amt":263.06,
        "potential_bad_debt":0,
        "princ_paid":263.06,
        "reference_no":"2924",
        "reg_number":"TQ030700D",
        "serv_date":"2016-07-25T00:00:00Z",
        "status_code":520,
        "status_date":"2017-01-01T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2016-07-26T05:01:00Z",
        "_version_":1652200013125648386},
      {
        "id":"624401-64",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"3026",
        "clt_ref_no":"OWL-2924-8",
        "comments":" ",
        "contract_number":"OWL-2924-8",
        "debt_descr":"PO/XREF: UCT RUN 6/26",
        "debt_id":"624401",
        "debt_no":64,
        "debt_type":"COM",
        "delq_date":"2018-11-17T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2018-07-25T00:00:00Z",
        "orig_clt":"3026",
        "orig_int_amt":0.0,
        "orig_princ_amt":1937.64,
        "potential_bad_debt":0,
        "princ_paid":2137.64,
        "reference_no":"invoice:OWL-2924-8",
        "salesperson":"Russell Hough",
        "serv_date":"2018-10-18T00:00:00Z",
        "status_code":102,
        "status_date":"2019-08-01T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2018-07-25T11:33:00Z",
        "_version_":1652200105184329729},
      {
        "id":"628157-332",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"3026",
        "clt_ref_no":"OWL-2924-8",
        "comments":" ",
        "contract_number":"OWL-2924-8",
        "debt_descr":"PO/XREF: UCT RUN 6/26",
        "debt_id":"628157",
        "debt_no":332,
        "debt_type":"COM",
        "delq_date":"2018-11-17T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2018-10-18T00:00:00Z",
        "orig_clt":"3026",
        "orig_int_amt":0.0,
        "orig_princ_amt":2137.64,
        "potential_bad_debt":0,
        "princ_paid":2137.64,
        "reference_no":"invoice:OWL-2924-8",
        "salesperson":"Russell Hough",
        "serv_date":"2018-10-18T00:00:00Z",
        "status_code":520,
        "status_date":"2018-10-24T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2018-10-18T00:22:00Z",
        "_version_":1652200116586545154},
      {
        "id":"584807-3",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"9601",
        "clt_ref_no":"2924",
        "comments":" ",
        "debt_descr":" ",
        "debt_id":"584807",
        "debt_no":3,
        "debt_type":"COM",
        "delq_date":"2015-10-08T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2015-10-08T00:00:00Z",
        "orig_clt":"9601",
        "orig_int_amt":0.0,
        "orig_princ_amt":785.01,
        "potential_bad_debt":0,
        "princ_paid":785.01,
        "reference_no":"IN:2924",
        "serv_date":"2015-10-08T00:00:00Z",
        "status_code":520,
        "status_date":"2015-10-08T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2015-10-08T22:30:00Z",
        "_version_":1652199173748948992},
      {
        "id":"628310-6004",
        "adjust_int":0.0,
        "adjust_princ":0.0,
        "clt_id":"3006",
        "clt_ref_no":"U615-2924-8",
        "comments":" ",
        "contract_number":"U615-2924-8",
        "debt_descr":"PO/XREF: CO-0073794",
        "debt_id":"628310",
        "debt_no":6004,
        "debt_type":"COM",
        "delq_date":"2018-12-30T00:00:00Z",
        "internal_adjustment":0,
        "list_date":"2018-12-08T00:00:00Z",
        "orig_clt":"3006",
        "orig_int_amt":0.0,
        "orig_princ_amt":1389.22,
        "potential_bad_debt":0,
        "princ_paid":0.0,
        "reference_no":"invoice:U615-2924-8",
        "salesperson":"James Hillstead",
        "serv_date":"2018-11-30T00:00:00Z",
        "status_code":134,
        "status_date":"2018-12-08T00:00:00Z",
        "storage_account":0,
        "time_stamp":"2018-12-08T22:52:00Z",
        "_version_":1652200120097177602},



"574574-3":"\n12.540382 = sum of:\n  6.159883 = weight(clt_ref_no:2924 in
272965) [SchemaSimilarity], result of:\n    6.159883 = score(freq=1.0),
product of:\n      11.528743 = idf, computed as log(1 + (N - n + 0.5) / (n
+ 0.5)) from:\n        28 = n, number of documents containing term\n
 2895437 = N, total number of documents with field\n      0.5343066 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n 1.0 =
freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     1.0 = dl, length of field\n        1.57457 = avgdl, average length of
field\n  6.3805 = weight(reference_no:2924 in 272965) [SchemaSimilarity],
result of:\n    6.3805 = score(freq=1.0), product of:\n      11.528412 =
idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n 28 = n, number
of documents containing term\n        2894478 = N, total number of
documents with field\n 0.5534587 = tf, computed as freq / (freq + k1 * (1 -
b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        1.0 = dl, length of field\n
 1.77578 = avgdl, average length of field\n",


"598663-1383":"\n12.540382 = sum of:\n  6.159883 = weight(clt_ref_no:2924
in 87499) [SchemaSimilarity], result of:\n    6.159883 = score(freq=1.0),
product of:\n      11.528743 = idf, computed as log(1 + (N - n + 0.5) / (n
+ 0.5)) from:\n        28 = n, number of documents containing term\n
 2895437 = N, total number of documents with field\n      0.5343066 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n
 1.0 = freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     1.0 = dl, length of field\n        1.57457 = avgdl, average length of
field\n  6.3805 = weight(reference_no:2924 in 87499) [SchemaSimilarity],
result of:\n    6.3805 = score(freq=1.0), product of:\n      11.528412 =
idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n,
number of documents containing term\n        2894478 = N, total number of
documents with field\n      0.5534587 = tf, computed as freq / (freq + k1 *
(1 - b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term
within document\n        1.2 = k1, term saturation parameter\n        0.75
= b, length normalization parameter\n        1.0 = dl, length of field\n
     1.77578 = avgdl, average length of field\n",

"624401-64":"\n12.104917 = sum of:\n  1.7596623 = weight(clt_ref_no:owl in
1745) [SchemaSimilarity], result of:\n    1.7596623 = score(freq=1.0),
product of:\n      5.304949 = idf, computed as log(1 + (N - n + 0.5) / (n +
0.5)) from:\n        14381 = n, number of documents containing term\n
 2895437 = N, total number of documents with field\n      0.33170202 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n
 1.0 = freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     3.0 = dl, length of field\n        1.57457 = avgdl, average length of
field\n  3.8241074 = weight(clt_ref_no:2924 in 1745) [SchemaSimilarity],
result of:\n    3.8241074 = score(freq=1.0), product of:\n      11.528743 =
idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n,
number of documents containing term\n        2895437 = N, total number of
documents with field\n      0.33170202 = tf, computed as freq / (freq + k1
* (1 - b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term
within document\n        1.2 = k1, term saturation parameter\n        0.75
= b, length normalization parameter\n        3.0 = dl, length of field\n
     1.57457 = avgdl, average length of field\n  1.1763713 =
weight(clt_ref_no:8 in 1745) [SchemaSimilarity], result of:\n    1.1763713
= score(freq=1.0), product of:\n      3.5464702 = idf, computed as log(1 +
(N - n + 0.5) / (n + 0.5)) from:\n        83464 = n, number of documents
containing term\n        2895437 = N, total number of documents with
field\n      0.33170202 = tf, computed as freq / (freq + k1 * (1 - b + b *
dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.57457 = avgdl, average length of field\n  4.0874248 =
weight(reference_no:2924 in 1745) [SchemaSimilarity], result of:\n
 4.0874248 = score(freq=1.0), product of:\n      11.528412 = idf, computed
as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n, number of
documents containing term\n        2894478 = N, total number of documents
with field\n      0.35455227 = tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.77578 = avgdl, average length of field\n  1.2573512 =
weight(reference_no:8 in 1745) [SchemaSimilarity], result of:\n
 1.2573512 = score(freq=1.0), product of:\n      3.5463068 = idf, computed
as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        83450 = n, number of
documents containing term\n        2894478 = N, total number of documents
with field\n      0.35455227 = tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.77578 = avgdl, average length of field\n",
"628157-332":"\n12.104917 = sum of:\n  1.7596623 = weight(clt_ref_no:owl in
32270) [SchemaSimilarity], result of:\n    1.7596623 = score(freq=1.0),
product of:\n      5.304949 = idf, computed as log(1 + (N - n + 0.5) / (n +
0.5)) from:\n        14381 = n, number of documents containing term\n
 2895437 = N, total number of documents with field\n      0.33170202 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n
 1.0 = freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     3.0 = dl, length of field\n        1.57457 = avgdl, average length of
field\n 3.8241074 = weight(clt_ref_no:2924 in 32270) [SchemaSimilarity],
result of:\n    3.8241074 = score(freq=1.0), product of:\n      11.528743 =
idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n,
number of documents containing term\n        2895437 = N, total number of
documents with field\n      0.33170202 = tf, computed as freq / (freq + k1
* (1 - b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term
within document\n        1.2 = k1, term saturation parameter\n        0.75
= b, length normalization parameter\n        3.0 = dl, length of field\n
     1.57457 = avgdl, average length of field\n  1.1763713 =
weight(clt_ref_no:8 in 32270) [SchemaSimilarity], result of:\n    1.1763713
= score(freq=1.0), product of:\n      3.5464702 = idf, computed as log(1 +
(N - n + 0.5) / (n + 0.5)) from:\n        83464 = n, number of documents
containing term\n        2895437 = N, total number of documents with
field\n      0.33170202 = tf, computed as freq / (freq + k1 * (1 - b + b *
dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n 0.75 = b, length
normalization parameter\n        3.0 = dl, length of field\n        1.57457
= avgdl, average length of field\n  4.0874248 = weight(reference_no:2924 in
32270) [SchemaSimilarity], result of:\n    4.0874248 = score(freq=1.0),
product of:\n      11.528412 = idf, computed as log(1 + (N - n + 0.5) / (n
+ 0.5)) from:\n        28 = n, number of documents containing term\n
 2894478 = N, total number of documents with field\n 0.35455227 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n
 1.0 = freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     3.0 = dl, length of field\n        1.77578 = avgdl, average length of
field\n  1.2573512 = weight(reference_no:8 in 32270) [SchemaSimilarity],
result of:\n    1.2573512 = score(freq=1.0), product of:\n 3.5463068 = idf,
computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        83450 = n,
number of documents containing term\n        2894478 = N, total number of
documents with field\n      0.35455227 = tf, computed as freq / (freq + k1
* (1 - b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term
within document\n        1.2 = k1, term saturation parameter\n        0.75
= b, length normalization parameter\n        3.0 = dl, length of field\n
1.77578 = avgdl, average length of field\n",

"584807-3":"\n11.142687 = sum of:\n  6.159883 = weight(clt_ref_no:2924 in
27502) [SchemaSimilarity], result of:\n    6.159883 = score(freq=1.0),
product of:\n      11.528743 = idf, computed as log(1 + (N - n + 0.5) / (n
+ 0.5)) from:\n        28 = n, number of documents containing term\n
 2895437 = N, total number of documents with field\n      0.5343066 = tf,
computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:\n
 1.0 = freq, occurrences of term within document\n        1.2 = k1, term
saturation parameter\n        0.75 = b, length normalization parameter\n
     1.0 = dl, length of field\n        1.57457 = avgdl, average length of
field\n  4.9828043 = weight(reference_no:2924 in 27502) [SchemaSimilarity],
result of:\n    4.9828043 = score(freq=1.0), product of:\n      11.528412 =
idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n,
number of documents containing term\n        2894478 = N, total number of
documents with field\n      0.4322195 = tf, computed as freq / (freq + k1 *
(1 - b + b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term
within document\n        1.2 = k1, term saturation parameter\n        0.75
= b, length normalization parameter\n        2.0 = dl, length of field\n
     1.77578 = avgdl, average length of field\n",
      "628310-6004":"\n10.345255 = sum of:\n  3.8241074 =
weight(clt_ref_no:2924 in 2391) [SchemaSimilarity], result of:\n
 3.8241074 = score(freq=1.0), product of:\n      11.528743 = idf, computed
as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n, number of
documents containing term\n        2895437 = N, total number of documents
with field\n      0.33170202 = tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.57457 = avgdl, average length of field\n  1.1763713 =
weight(clt_ref_no:8 in 2391) [SchemaSimilarity], result of:\n    1.1763713
= score(freq=1.0), product of:\n      3.5464702 = idf, computed as log(1 +
(N - n + 0.5) / (n + 0.5)) from:\n        83464 = n, number of documents
containing term\n        2895437 = N, total number of documents with
field\n      0.33170202 = tf, computed as freq / (freq + k1 * (1 - b + b *
dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.57457 = avgdl, average length of field\n  4.0874248 =
weight(reference_no:2924 in 2391) [SchemaSimilarity], result of:\n
 4.0874248 = score(freq=1.0), product of:\n      11.528412 = idf, computed
as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        28 = n, number of
documents containing term\n        2894478 = N, total number of documents
with field\n      0.35455227 = tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.77578 = avgdl, average length of field\n  1.2573512 =
weight(reference_no:8 in 2391) [SchemaSimilarity], result of:\n
 1.2573512 = score(freq=1.0), product of:\n      3.5463068 = idf, computed
as log(1 + (N - n + 0.5) / (n + 0.5)) from:\n        83450 = n, number of
documents containing term\n        2894478 = N, total number of documents
with field\n      0.35455227 = tf, computed as freq / (freq + k1 * (1 - b +
b * dl / avgdl)) from:\n        1.0 = freq, occurrences of term within
document\n        1.2 = k1, term saturation parameter\n        0.75 = b,
length normalization parameter\n        3.0 = dl, length of field\n
 1.77578 = avgdl, average length of field\n",

On Tue, Dec 10, 2019 at 10:56 AM rhys J <[hidden email]> wrote:

>
>
> On Tue, Dec 10, 2019 at 12:35 AM Paras Lehana <[hidden email]>
> wrote:
>
>> That's great.
>>
>> But I also wanted to know why the concerned document was scored lower in
>> the original query. Anyways, glad that the issue is resolved. :)
>>
>>
> That I need to look into. If I find an answer, I will let you know.
>
> Thanks,
>
> Rhys
>