[
https://issues.apache.org/jira/browse/LUCENE-7279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15283497#comment-15283497
]
Michael McCandless commented on LUCENE-7279:
--------------------------------------------
I backported this for 6.0.1.
> AIOOBE from JapaneseTokenizer
> -----------------------------
>
> Key: LUCENE-7279
> URL: https://issues.apache.org/jira/browse/LUCENE-7279
> Project: Lucene - Core
> Issue Type: Bug
> Components: modules/analysis
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: 6.0.1, master (7.0)
>
> Attachments: LUCENE-7279.patch
>
>
> On certain Japanese input strings you can hit this:
> {noformat}
> java.lang.ArrayIndexOutOfBoundsException: -1
> at
> __randomizedtesting.SeedInfo.seed([C6752A567B924B1:2B195610610ED60]:0)
> at
> org.apache.lucene.analysis.ja.JapaneseTokenizer.backtrace(JapaneseTokenizer.java:1607)
> at
> org.apache.lucene.analysis.ja.JapaneseTokenizer.parse(JapaneseTokenizer.java:902)
> at
> org.apache.lucene.analysis.ja.JapaneseTokenizer.incrementToken(JapaneseTokenizer.java:479)
> at
> org.apache.lucene.analysis.ja.TestJapaneseTokenizer.testBigDocument(TestJapaneseTokenizer.java:837)
> {noformat}
> I have a patch with a test case and fix.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]