Re: Max Field Length

2022-09-23 Thread Michael Sokolov
ooh On Fri, Sep 23, 2022 at 11:02 AM Adrien Grand wrote: > > We have a TruncateTokenFilter in lucene/analysis/common. :) > > On Fri, Sep 23, 2022 at 4:39 PM Michael Sokolov wrote: > > > I wonder if it would make sense to provide a TruncationFilter in > > addition to the LengthFilter. That way lo

Re: Max Field Length

2022-09-23 Thread Adrien Grand
We have a TruncateTokenFilter in lucene/analysis/common. :) On Fri, Sep 23, 2022 at 4:39 PM Michael Sokolov wrote: > I wonder if it would make sense to provide a TruncationFilter in > addition to the LengthFilter. That way long tokens in source text > could be better supported, albeit with some

Re: Max Field Length

2022-09-23 Thread Michael Sokolov
I wonder if it would make sense to provide a TruncationFilter in addition to the LengthFilter. That way long tokens in source text could be better supported, albeit with some confusion if they share the same very long prefix... On Fri, Sep 23, 2022 at 9:56 AM Scott Guthery wrote: > > Thanks much,

Re: Max Field Length

2022-09-23 Thread Scott Guthery
Thanks much, Adrian. I hadn't realized that the size limit was on one token in the text as opposed to being a limit on the length of the entire text field. I'm loading patents, so I suspect that the very long word is a DNA sequence. Thanks also for your guidance with regard to setting maximums.

Re: Max Field Length

2022-09-23 Thread Adrien Grand
Hi Scott, There is no way to lift this limit. The assumption is that a user would never type a 32kB keyword in a search bar, so indexing such long keywords is wasteful. Some tokenizers like StandardTokenizer can be configured to limit the length of the tokens that they produce, there is also a Len

Re: Max Field Length

2005-05-06 Thread Bill Tschumy
On May 6, 2005, at 4:42 PM, Ernesto De Santis wrote: Hi Exist a max length for a Field value? I have problems indexing large body files. The bottom isn't indexed. Bye, Ernesto. -- Ernesto De Santis - Colaborativa.net Córdoba 1147 Piso 6 Oficinas 3 y 4 (S2000AWO) Rosario, SF, Argentina. After you

Re: Max Field Length

2005-05-06 Thread Luke Shannon
Hi; I think by default only 10,000 terms will be indexed for a field. You can change this using the maxFieldLength method of IndexWriter. Luke - Original Message - From: "Ernesto De Santis" <[EMAIL PROTECTED]> To: "Lucene Users List" Sent: Friday, May 06, 2005 5:42 PM Subject: Max Fi