ooh
On Fri, Sep 23, 2022 at 11:02 AM Adrien Grand wrote:
>
> We have a TruncateTokenFilter in lucene/analysis/common. :)
>
> On Fri, Sep 23, 2022 at 4:39 PM Michael Sokolov wrote:
>
> > I wonder if it would make sense to provide a TruncationFilter in
> > addition to the LengthFilter. That way lo
We have a TruncateTokenFilter in lucene/analysis/common. :)
On Fri, Sep 23, 2022 at 4:39 PM Michael Sokolov wrote:
> I wonder if it would make sense to provide a TruncationFilter in
> addition to the LengthFilter. That way long tokens in source text
> could be better supported, albeit with some
I wonder if it would make sense to provide a TruncationFilter in
addition to the LengthFilter. That way long tokens in source text
could be better supported, albeit with some confusion if they share
the same very long prefix...
On Fri, Sep 23, 2022 at 9:56 AM Scott Guthery wrote:
>
> Thanks much,
Thanks much, Adrian. I hadn't realized that the size limit was on one
token in the text as opposed to being a limit on the length of the entire
text field. I'm loading patents, so I suspect that the very long word is a
DNA sequence.
Thanks also for your guidance with regard to setting maximums.
Hi Scott,
There is no way to lift this limit. The assumption is that a user would
never type a 32kB keyword in a search bar, so indexing such long keywords
is wasteful. Some tokenizers like StandardTokenizer can be configured to
limit the length of the tokens that they produce, there is also a
Len
On May 6, 2005, at 4:42 PM, Ernesto De Santis wrote:
Hi
Exist a max length for a Field value?
I have problems indexing large body files.
The bottom isn't indexed.
Bye,
Ernesto.
--
Ernesto De Santis - Colaborativa.net
Córdoba 1147 Piso 6 Oficinas 3 y 4
(S2000AWO) Rosario, SF, Argentina.
After you
Hi;
I think by default only 10,000 terms will be indexed for a field.
You can change this using the maxFieldLength method of IndexWriter.
Luke
- Original Message -
From: "Ernesto De Santis" <[EMAIL PROTECTED]>
To: "Lucene Users List"
Sent: Friday, May 06, 2005 5:42 PM
Subject: Max Fi