Quantcast

Limiting terms / field

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

Limiting terms / field

Chris Bamford-4

Hello,

We are using Lucene 4.10.3 and are interested in limiting the number of terms per field. In the past this was set by the IndexWriter (maxFieldLength) and the default was 10K; as I understand it this is no longer the case, in fact it is now unlimited by default?

Anyway, what is the best way we can do this? I have found some references to a class called LimitTokenCountFilter, but I believe it is only found in later versions.

Thanks

- Chris


Chris Bamford m: +44 7860 405292 www.mimecast.com
Lead Software Engineer p: +44 207 847 8700 Address click here

 



Disclaimer
The information contained in this communication from
[hidden email] sent at 2017-03-20 12:02:15 is confidential and may be legally privileged. It is intended solely for use by [hidden email] and others authorized to receive it. If you are not [hidden email] you are hereby notified that any disclosure, copying, distribution or taking action in reliance of the contents of this information is strictly prohibited and may be unlawful.

This email message has been scanned for viruses by Mimecast. Mimecast delivers a complete managed email solution from a single web based platform. For more information please visit http://www.mimecast.com



Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

RE: Limiting terms / field

Uwe Schindler
It is also in 4.10.3 as part of the analysis-common module:

https://lucene.apache.org/core/4_10_3/analyzers-common/org/apache/lucene/analysis/miscellaneous/LimitTokenCountFilter.html

 

Uwe

 

-----

Uwe Schindler

Achterdiek 19, D-28357 Bremen

http://www.thetaphi.de <http://www.thetaphi.de/>

eMail: [hidden email]

 

From: Chris Bamford [mailto:[hidden email]]
Sent: Monday, March 20, 2017 1:02 PM
To: [hidden email]
Subject: Limiting terms / field

 

 


Hello,

We are using Lucene 4.10.3 and are interested in limiting the number of terms per field. In the past this was set by the IndexWriter (maxFieldLength) and the default was 10K; as I understand it this is no longer the case, in fact it is now unlimited by default?

Anyway, what is the best way we can do this? I have found some references to a class called LimitTokenCountFilter, but I believe it is only found in later versions.

Thanks

- Chris





Chris Bamford

m: +44 7860 405292

 <http://www.mimecast.com/> www.mimecast.com


Lead Software Engineer

p: +44 207 847 8700

Address click  <http://www.mimecast.com/About-us/Contact-us/> here


  _____  



 <https://eu-api.mimecast.com/s/click/V5cKV3mUEd00vOSgvXwtbZyPc4HU7YXzH3Q5Ov2IZlOUD5KnfY0Eo__Me97k70LeLdXjnAQzFGO9rpjwJts3InRknkny11ed5T74o6AsRNboAh8dqyqFsq0unf3MyHXrJZy3M1JP91JCAA_brBpnBkIxsGjUljn71poOVL3N1hyJLCqRsucHp-dI8GBbMwFeLX53666oCIpB3mfN4i4LuA>


 


 <https://eu-api.mimecast.com/s/click/1K7xTdhoqgjnB3PEFCIbObk_ZFAnXYKlTiLISV6xNSUBOHgJ34O23NXNATid7364YTyNegMgTtFqBtW54vnckhfn0k-UaDEHtFDzrnfXx8Dpjgv85mz2AnanRV970OKhhoQOsOkKG1l_SYGT0ryVgPfhypOP2MXKfgsbjzlGSXlax271RkXX8mjHRhvuYoZHUWsThNMfSE_TxYg9ZhC-Fg>


 <https://eu-api.mimecast.com/s/click/0ChSNgfhxT33DvPLIaGrHSdBcfZACHC3pDIPU_BoN-kfqbJfezAkQS-MSlF3hLCB6ZVlhiRGR3wIEZlIHEMA3_LGld58ajMxdXLLk9tbO55u1ZecyQu36ZqO8fIxY9q4sSZO-ADUPIfmajCv7yv1VUPEzCN51poTuYrGu4oP3PJPCydVJitflwcdsJM11tr4__5Kprgsvpnc9fHlp2BM5A>


 <https://eu-api.mimecast.com/s/click/NjE9ed9agLcHnu6tdfXIcnD6b0cN73NhGASU7Y7-fBAF3h92PJA8wj2nkLyj7kkcWp7LU4ny37JS_YM1LdyBM4VQArdMfl-tqEm2M0WJoNheY2bxkI-ZKSKKWjfj_z8nmZTbVxfHKPaaHmak2vnDwueGzhDFduwa6BKj3FyGqy-QtlJQ7csd0taeAvhVnpYsylQFZDAlgBPm2se9Vfqssw>


 <https://eu-api.mimecast.com/s/click/v4zOP0KQ-MJlMJOoXyVCSXfOawzU6b7Yl3xFw0ODhytdAbDQ50RRte1KJKbgMViNVkD6fs8BNrBlwT-55EdBK5a4oonpL1ZATKUlP8fjrVpcAdHrVTp4NRc31q1WYWJ0gLhNSCW_kYsoyKBrHYkvgoUmKNvgh-54BQllH7JNn_KJ6jMV5TlrkToHgOmUWKJQUWsThNMfSE_TxYg9ZhC-Fg>


 <https://eu-api.mimecast.com/s/click/XujAZpejvFW2OIhYbUKIG4bbDde5QWSLvav2Zd1T1pHjrccG5l08Ssefc_H8Zr-BkL00127s5rL6kUxtJHwqZ3VPCRBYg4JXq_Wd9owjxjfb3LUf-kNIrJE7XBBExF_k-1-DXs8HNoBxB7OUVIfjBbm80zerQX9iyu2hUqSsBeorOQA5m0DSs02m-WfDE0D8t8DxXx5osyLjtMdIc2MC7g>

 


 


 <https://eu-api.mimecast.com/s/click/gNXpRy8Di3hABeg9FCvOkJyWCtych0vtbAF2YBaOE5exD9teAozt-UCmgN0eOSWeBMLfnjwJKrTgL9QmOD6wAHVPCRBYg4JXq_Wd9owjxjfb3LUf-kNIrJE7XBBExF_k-1-DXs8HNoBxB7OUVIfjBbm80zerQX9iyu2hUqSsBeorOQA5m0DSs02m-WfDE0D8DI8sdYf5O9VTZPl6r-07iA>


Disclaimer
The information contained in this communication from  <mailto:[hidden email]> [hidden email] sent at 2017-03-20 12:02:15 is confidential and may be legally privileged. It is intended solely for use by  <mailto:[hidden email]> [hidden email] and others authorized to receive it. If you are not  <mailto:[hidden email]> [hidden email] you are hereby notified that any disclosure, copying, distribution or taking action in reliance of the contents of this information is strictly prohibited and may be unlawful.

This email message has been scanned for viruses by Mimecast. Mimecast delivers a complete managed email solution from a single web based platform. For more information please visit http://www.mimecast.com 

 






 

Loading...