Quantcast

multi word synonym

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
2 messages Options
MS
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

multi word synonym

MS
I have found the previous discussions on multi word synonyms as as well as
the section on synonym injection in Hatcher's book, but have not been able
to come up with a satisfactory solution. I am indexing text that has several
multi word synonyms. Some of the synonyms may have single words as one of
the synonym. These words should only match for the appropriate multi word
phrase searches - in other words, "chest pain", should only match a query
for "chest pain" and not for "chest" or "pain". In addition, it will match
angina (synonym). So, in this sentence:
Lab results for alpha 1 antitrypsin level....
I would like to index 'alpha-1-antitrypsin', 'antitrypsin', 'antitrypsin,
alpha 1', 'A1AT' as synonyms for the phrase alpha-1-antitrypsin in the
sentence. Thanks in advance...
madhu
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

Re: multi word synonym

Paul Libbrecht
If I understand well... it would be easy to do so if you do not wish to
use phrase matches... you could just add a field (with the same name)
for each token...

I think that, if you wish phrase-matches (or the span-ones) then Lucene
can't help you... but I'm quite a newbie on this topic.

Is there a hope this becomes different in Lucene 1.9 or 2.0 ??

My dream would be to have the position increments living in a tree...
you know.... and... XML tree...

thanks

paul

Le 26 avr. 05, à 15:22, Madhu Sasidhar, MD a écrit :

> I have found the previous discussions on multi word synonyms as as
> well as
> the section on synonym injection in Hatcher's book, but have not been
> able
> to come up with a satisfactory solution. I am indexing text that has
> several
> multi word synonyms. Some of the synonyms may have single words as one
> of
> the synonym. These words should only match for the appropriate multi
> word
> phrase searches - in other words, "chest pain", should only match a
> query
> for "chest pain" and not for "chest" or "pain". In addition, it will
> match
> angina (synonym). So, in this sentence:
> Lab results for alpha 1 antitrypsin level....
> I would like to index 'alpha-1-antitrypsin', 'antitrypsin',
> 'antitrypsin,
> alpha 1', 'A1AT' as synonyms for the phrase alpha-1-antitrypsin in the
> sentence. Thanks in advance...
> madhu


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Loading...