querying for positions (with term vector component)

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

querying for positions (with term vector component)

Arturas Mazeika
Hi All,

Term vector component allows to store and retrieve quite a bit of very
useful information, e.g., for a term one can get:

"verlag",[
"tf",6,
"positions",["position",5, "position",42, "position",64,
"position",85, "position",106, "position",127],
"offsets",[
  "start",35, "end",41,
  "start",258,"end",264,
  "start",380,"end",386,
  "start",498,"end",504,
  "start",616,"end",622,
  "start",734,"end",740],
"df",190,
"tf-idf",0.031578947368421054]


Is there a way to query for those? Example of queries would be "Hello"
occurred in the 1st 100 positions (bytes), the word "bye" occurred in the
last 100 positions (bytes).

Cheers,
Arturas