[jira] Created: (SOLR-2166) termvector component has strange syntax

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (SOLR-2166) termvector component has strange syntax

Sebastian Nagel (Jira)
termvector component has strange syntax
---------------------------------------

                 Key: SOLR-2166
                 URL: https://issues.apache.org/jira/browse/SOLR-2166
             Project: Solr
          Issue Type: Improvement
            Reporter: Yonik Seeley


The termvector  response format could really be improved.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-2166) termvector component has strange syntax

Sebastian Nagel (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-2166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12921361#action_12921361 ]

Yonik Seeley commented on SOLR-2166:
------------------------------------

Here's one example snippet:

{code}
        "titl":{
          "tf":2,
          "offsets":{
            "start":10,
            "end":15,
            "start":28,
            "end":33},
          "positions":{
            "position":0,
            "position":2},
          "df":1,
          "tf-idf":2.0}},
{code}

Looks like offsets should be an array of int - start/end pairs, and positions should just be an array of int.
Also, the "doc-0" label is strange:
{code}
  "termVectors":{
    "doc-0":{
      "uniqueKey":"0",
      "test_basictv":{
{code}

Perhaps termVectors should just be an array, and the internal lucene docid should be just another key/value (if we really want to keep that).

> termvector component has strange syntax
> ---------------------------------------
>
>                 Key: SOLR-2166
>                 URL: https://issues.apache.org/jira/browse/SOLR-2166
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Yonik Seeley
>
> The termvector  response format could really be improved.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (SOLR-2166) termvector component has strange syntax

Sebastian Nagel (Jira)
In reply to this post by Sebastian Nagel (Jira)

    [ https://issues.apache.org/jira/browse/SOLR-2166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12921394#action_12921394 ]

Grant Ingersoll commented on SOLR-2166:
---------------------------------------

See SOLR-651 for much discussion around it.  I've always felt it was overly verbose, but I recall the general consensus at the time was to reuse Solr's existing capabilities (I don't recall us being able to just an array, but perhaps I missed it) instead of having a much more compact representation.

At any rate, +1 for a less verbose format.

> termvector component has strange syntax
> ---------------------------------------
>
>                 Key: SOLR-2166
>                 URL: https://issues.apache.org/jira/browse/SOLR-2166
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Yonik Seeley
>
> The termvector  response format could really be improved.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]