[ 
https://issues.apache.org/jira/browse/LUCENE-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-3990:
--------------------------------

    Attachment: LUCENE-3990.patch

The problem is that we are getting different results the first time we create 
the tokenstream components,
versus after we reset(Reader) with the same text again.

The bug was introduced by Uwe Schindler in r1311358: when the reader-wrapper 
was changed to use CharFilter
instead. because of crazy CharFilter-Reader delegation.

http://svn.apache.org/viewvc?view=revision&revision=1311358

Attached is a patch demonstrating the bug: with a standalone testcase, and 
backing out that change.
Seed now passes (in addition to the test.
                
> TestRandomChains failure
> ------------------------
>
>                 Key: LUCENE-3990
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3990
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: modules/analysis
>            Reporter: Steven Rowe
>         Attachments: LUCENE-3990.patch, analysis-common.tests-report.txt
>
>
> 100% reproduces for me:
> 2> NOTE: reproduce with: ant test -Dtests.class=*.TestRandomChains 
> -Dtests.method=testRandomChains -Dtests.seed=88CA02C2BB7B1DA 
> -Dargs="-Dfile.encoding=UTF-8"
> Running org.apache.lucene.analysis.core.TestRandomChains
> FAILURE 7.22s | TestRandomChains.testRandomChains
>    > Throwable #1: java.lang.AssertionError: endOffset 1 expected:<7> but 
> was:<8>
>    >    at 
> __randomizedtesting.SeedInfo.seed([88CA02C2BB7B1DA:356D894D6CA5AC1A]:0)
>    >    at org.junit.Assert.fail(Assert.java:93)
>    >    at org.junit.Assert.failNotEquals(Assert.java:647)
>    >    at org.junit.Assert.assertEquals(Assert.java:128)
>    >    at org.junit.Assert.assertEquals(Assert.java:472)
>    >    at 
> org.apache.lucene.analysis.BaseTokenStreamTestCase.assertTokenStreamContents(BaseTokenStreamTestCase.java:165)
>    >    at 
> org.apache.lucene.analysis.BaseTokenStreamTestCase.checkAnalysisConsistency(BaseTokenStreamTestCase.java:662)
>    >    at 
> org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:486)
>    >    at 
> org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:429)
>    >    at 
> org.apache.lucene.analysis.core.TestRandomChains.testRandomChains(TestRandomChains.java:820)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to