[ 
https://issues.apache.org/jira/browse/LUCENE-5472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated LUCENE-5472:
-----------------------------

    Attachment: LUCENE-5472.patch

Here's a quick pass at trying to fix this along with a test.

at the moment the test fails because i didn't see any immediately obvious way 
to get the fieldname into the exception message, and that seems kind of key to 
making it useful (yes a byte prefix of the term is there, but for most people 
indexing text that's not going to be immediately helpful to them to understand 
where to look for the long term)

I haven't dug down deeper to see if it would be safe/easy to just add the 
fieldname to docState.maxTermPrefix (as a prefix on the prefix) nor have i run 
any other tests to see if throwing an exception here breaks any other existing 
tests that happen to depend on big ass terms being silently ignored.

> Long terms should generate a RuntimeException, not just infoStream
> ------------------------------------------------------------------
>
>                 Key: LUCENE-5472
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5472
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Hoss Man
>         Attachments: LUCENE-5472.patch
>
>
> As reported on the solr-user list, when a term is greater then 2^15 bytes it 
> is silently ignored at indexing time -- a message is logged in to infoStream 
> if enabled, but no error is thrown.
> seems like we should change this behavior (if nothing else starting in 5.0) 
> to throw an exception.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to