[ 
https://issues.apache.org/jira/browse/SOLR-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Smiley updated SOLR-17189:
--------------------------------
    Description: 
DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
[link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].

{quote}java.lang.AssertionError: expected:<6> but was:<7>       
at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0)     
•••
at 
org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}

It seems clear it's because it assumes that the "realistic unicode" chars won't 
match the regexp: {{\s}}.  A single space char is used to join the words but 
maybe this or other whitespace chars are in those unicode codepoint blocks.

Additionally, it's frustrating that this particular benchmark framework doesn't 
honor tests.seed in its generation of random data and thus it's hard to 
reproduce the failure.  That ought to be fixed as well.

  was:
DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
[link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].

{quote}java.lang.AssertionError: expected:<6> but was:<7>       
at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0)     
•••
at 
org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}
It seems clear it's because it assumes that the "realistic unicode" chars won't 
match the regexp: {{\s}} (which is the char used to join multiple unicode 
words).

Additionally, it's frustrating that this particular benchmark framework doesn't 
honor tests.seed in its generation of random data and thus it's hard to 
reproduce the failure.  That ought to be fixed as well.


> DockMakerTest.testRealisticUnicode fails from whitespace assumption
> -------------------------------------------------------------------
>
>                 Key: SOLR-17189
>                 URL: https://issues.apache.org/jira/browse/SOLR-17189
>             Project: Solr
>          Issue Type: Test
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: benchmarks
>            Reporter: David Smiley
>            Priority: Major
>
> DockMakerTest.testRealisticUnicode fails 1-2% of the time -- 
> [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode].
> {quote}java.lang.AssertionError: expected:<6> but was:<7>     
> at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0)   
> •••
> at 
> org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote}
> It seems clear it's because it assumes that the "realistic unicode" chars 
> won't match the regexp: {{\s}}.  A single space char is used to join the 
> words but maybe this or other whitespace chars are in those unicode codepoint 
> blocks.
> Additionally, it's frustrating that this particular benchmark framework 
> doesn't honor tests.seed in its generation of random data and thus it's hard 
> to reproduce the failure.  That ought to be fixed as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to