Turkish Analyzer for lucene

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Turkish Analyzer for lucene

Emre Bayram

Hi,

We(www.meteksan.com.tr) are using Lucene at our document management and workflow 
project. And it is really cool.
 I searched alot to find a turkish analyzer for lucene, couldnt find.
At last i coded(http://issues.apache.org/jira/browse/LUCENE-559). I know many Turkish deceloper will search for a 
Turkish analyzer for lucene so i thought it is better to send the 
codes to you.

 Brazillian, German and standard analyzer helped me while writing 
Turkish Analyzer.

We tested turkish analyzer on our projects, and it is working very good(no problem with turkish character set and very good performance).

Anyway, i am sending you TurkishAnalyzer as attachment.I will be VERY happy if you upload these codes to:

http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/tr

So other turksh developers who use lucene can easily find it.

Thank you,

Emre Bayram.


 

......................................................................................................................................................................

DİKKAT !

Bu e - postanın içerdiği bilgiler (ekleri dahil olmak üzere) gizlidir. Gonderenin onayı olmaksızın üçüncü kişilere açıklanamaz. Bu mesajın gönderilmek istendiği kişi değilseniz, lütfen mesajı sisteminizden derhal siliniz... Gonderen bu mesajın içerdiği bilgilerin doğrulugu veya eksiksiz olduğu konusunda bir garanti vermemektedir. Bu nedenle bilgilerin ne şekilde olursa olsun içeriğinden, iletilmesinden, alınmasından, saklanmasından sorumlu değildir.

 

CAUTION !

The information contained in this e-mail (including any attachments) is confidential. It must not be disclosed to any person without sender's authority. If you are not the intended recipient, please delete it from your system immediately... Sender makes no warranty as to the accuracy or completeness of any information contained in this message and hereby excludes any liability of any kind for the information contained therein or for the information transmission, reception, storage or use of such in any way whatsoever.

......................................................................................................................................................................

This message has been scanned for viruses and dangerous content.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]
Reply | Threaded
Open this post in threaded view
|

Re: Turkish Analyzer for lucene

Chris Hostetter-3

: Anyway, i am sending you TurkishAnalyzer as attachment.I will be VERY
: happy if you upload these codes to:

Emre, I don't know anything about Turkish -- but It's allways good to have
new analyzers: thanks for the contribution.  Uploading it to Jira was
definitely the best way to submit it.

One thing you can do to help encourage people to commit your code, would
be to provide some UnitTests showing it working the way you expect on some
input text.  Perhaps you could adapt some of the tests from the other
analyzers in contrib...

http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/analyzers/src/test/org/apache/lucene/analysis/

The ISOLatin1AccentFilter also has a pretty good test case you might want
to use as inspiration...

http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/src/test/org/apache/lucene/analysis/TestISOLatin1AccentFilter.java?rev=347991&view=log

...right now i think most of hte commiters are focusing on changes
neccessary for the 2.0 release -- which is more about bug fixes then new
features, but i'm guessing if you had some test case someone would commit
yournew Analyzer relatively soon.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]