[ https://issues.apache.org/jira/browse/SOLR-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Smiley updated SOLR-17189: -------------------------------- Description: DockMakerTest.testRealisticUnicode fails 1-2% of the time -- [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode]. {quote}java.lang.AssertionError: expected:<6> but was:<7> at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0) ••• at org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote} It seems clear it's because it assumes that the "realistic unicode" chars won't match the regexp: {{\s}}. A single space char is used to join the words but maybe this or other whitespace chars are in those unicode codepoint blocks. Additionally, it's frustrating that this particular benchmark framework doesn't honor tests.seed in its generation of random data and thus it's hard to reproduce the failure. That ought to be fixed as well. was: DockMakerTest.testRealisticUnicode fails 1-2% of the time -- [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode]. {quote}java.lang.AssertionError: expected:<6> but was:<7> at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0) ••• at org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote} It seems clear it's because it assumes that the "realistic unicode" chars won't match the regexp: {{\s}} (which is the char used to join multiple unicode words). Additionally, it's frustrating that this particular benchmark framework doesn't honor tests.seed in its generation of random data and thus it's hard to reproduce the failure. That ought to be fixed as well. > DockMakerTest.testRealisticUnicode fails from whitespace assumption > ------------------------------------------------------------------- > > Key: SOLR-17189 > URL: https://issues.apache.org/jira/browse/SOLR-17189 > Project: Solr > Issue Type: Test > Security Level: Public(Default Security Level. Issues are Public) > Components: benchmarks > Reporter: David Smiley > Priority: Major > > DockMakerTest.testRealisticUnicode fails 1-2% of the time -- > [link|https://ge.apache.org/scans/tests?search.timeZoneId=America%2FNew_York&tests.container=org.apache.solr.bench.DockMakerTest&tests.test=testRealisticUnicode]. > {quote}java.lang.AssertionError: expected:<6> but was:<7> > at __randomizedtesting.SeedInfo.seed([C5136F274AFF3ADD:95FFEBF499446D74]:0) > ••• > at > org.apache.solr.bench.DockMakerTest.testRealisticUnicode(DockMakerTest.java:189){quote} > It seems clear it's because it assumes that the "realistic unicode" chars > won't match the regexp: {{\s}}. A single space char is used to join the > words but maybe this or other whitespace chars are in those unicode codepoint > blocks. > Additionally, it's frustrating that this particular benchmark framework > doesn't honor tests.seed in its generation of random data and thus it's hard > to reproduce the failure. That ought to be fixed as well. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org For additional commands, e-mail: issues-h...@solr.apache.org