[jira] [Created] (SOLR-3008) edismax pf clause makes no sense when query already has quoted subphrase

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (SOLR-3008) edismax pf clause makes no sense when query already has quoted subphrase

JIRA jira@apache.org
edismax pf clause makes no sense when query already has quoted subphrase
------------------------------------------------------------------------

                 Key: SOLR-3008
                 URL: https://issues.apache.org/jira/browse/SOLR-3008
             Project: Solr
          Issue Type: Bug
            Reporter: Hoss Man


As noted by ldavid2020 on the solr-user mailing list (Tue, 20 Dec 2011) the behavior of edismax when the pf param is used and the query string contains quotes arround part of the query makes no sense at all...

{quote}
For the same query: 2012 "japan airlines" flight status

dismax...

[http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=dismax]

outputs:

{noformat}
+((DisjunctionMaxQuery((TTL:2012)~0.1)
   DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1)
   DisjunctionMaxQuery((TTL:flight)~0.1)
   DisjunctionMaxQuery((TTL:status)~0.1)
  )~3)
DisjunctionMaxQuery((TTL:"2012 japan airlin flight status"~3)~0.1)
{noformat}

The parsedquery has DisjunctionMaxQuery((TTL:"2012 japan airlin flight status"~3)~0.1).

While edismax...

[http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=edismax]

outputs:

{noformat}
+((DisjunctionMaxQuery((TTL:2012)~0.1)
   DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1)
   DisjunctionMaxQuery((TTL:flight)~0.1)
   DisjunctionMaxQuery((TTL:status)~0.1)
  )~3)
DisjunctionMaxQuery((TTL:"2012 flight status"~3)~0.1)
{noformat}

The parsedquery has DisjunctionMaxQuery((TTL:"2012 flight status"~3)~0.1).

...

So it seems edismax ignores "japan airlines" for the pf matching. This could cause some issues, in that a document with exactly phrase "2012 japan airlines flight status" will have the same relevancy score with another one with two phrases "japan airlines", "2012 flight status" far away.
{quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]