[ 
https://issues.apache.org/jira/browse/SOLR-6991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14286477#comment-14286477
 ] 

Steve Rowe commented on SOLR-6991:
----------------------------------

bq. don't we need similar assumes in dataimporthandler-extras tests that use 
TikaEntityProcessor? (i'm not sure why those wouldn't fail with turkish now as 
well)

I ran {{ant test -Dtests.slow=true -Dtests.locale=tr_TR}} in 
{{solr/contrib/dataimporthandler-extras/}}, and got the following failure:

{noformat}
   [junit4] Suite: org.apache.solr.handler.dataimport.TestTikaEntityProcessor
   [junit4]   2> Creating dataDir: 
/Users/sarowe/svn/lucene/dev/trunk2/solr/build/contrib/solr-dataimporthandler-extras/test/J0/temp/solr.handler.dataimport.TestTikaEntityProcessor
 9123B7DE098A1C98-001/init-core-data-001
   [junit4]   2> log4j:WARN No appenders could be found for logger 
(org.apache.solr.SolrTestCaseJ4).
   [junit4]   2> log4j:WARN Please initialize the log4j system properly.
   [junit4]   2> log4j:WARN See 
http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
   [junit4]   2> NOTE: reproduce with: ant test  
-Dtestcase=TestTikaEntityProcessor -Dtests.method=testTikaHTMLMapperIdentity 
-Dtests.seed=9123B7DE098A1C98 -Dtests.slow=true -Dtests.locale=tr_TR 
-Dtests.timezone=America/Toronto -Dtests.asserts=true 
-Dtests.file.encoding=US-ASCII
   [junit4] ERROR   0.93s J0 | 
TestTikaEntityProcessor.testTikaHTMLMapperIdentity <<<
   [junit4]    > Throwable #1: java.lang.Error: posix_spawn is not a supported 
process launch mechanism on this platform.
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([9123B7DE098A1C98:C15C334FC0BEE965]:0)
   [junit4]    >        at java.lang.UNIXProcess$1.run(UNIXProcess.java:105)
   [junit4]    >        at java.lang.UNIXProcess$1.run(UNIXProcess.java:94)
   [junit4]    >        at java.security.AccessController.doPrivileged(Native 
Method)
   [junit4]    >        at java.lang.UNIXProcess.<clinit>(UNIXProcess.java:92)
   [junit4]    >        at java.lang.ProcessImpl.start(ProcessImpl.java:130)
   [junit4]    >        at 
java.lang.ProcessBuilder.start(ProcessBuilder.java:1029)
   [junit4]    >        at java.lang.Runtime.exec(Runtime.java:620)
   [junit4]    >        at java.lang.Runtime.exec(Runtime.java:485)
   [junit4]    >        at 
org.apache.tika.parser.external.ExternalParser.check(ExternalParser.java:344)
   [junit4]    >        at 
org.apache.tika.parser.ocr.TesseractOCRParser.hasTesseract(TesseractOCRParser.java:117)
   [junit4]    >        at 
org.apache.tika.parser.ocr.TesseractOCRParser.getSupportedTypes(TesseractOCRParser.java:90)
   [junit4]    >        at 
org.apache.tika.parser.CompositeParser.getParsers(CompositeParser.java:81)
   [junit4]    >        at 
org.apache.tika.parser.DefaultParser.getParsers(DefaultParser.java:95)
   [junit4]    >        at 
org.apache.tika.parser.CompositeParser.getSupportedTypes(CompositeParser.java:229)
   [junit4]    >        at 
org.apache.tika.parser.CompositeParser.getParsers(CompositeParser.java:81)
   [junit4]    >        at 
org.apache.tika.parser.CompositeParser.getParser(CompositeParser.java:209)
   [junit4]    >        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
   [junit4]    >        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:141)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:243)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:476)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:415)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:330)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:480)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.DataImportHandler.handleRequestBody(DataImportHandler.java:189)
   [junit4]    >        at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:144)
   [junit4]    >        at 
org.apache.solr.core.SolrCore.execute(SolrCore.java:2006)
   [junit4]    >        at 
org.apache.solr.util.TestHarness.query(TestHarness.java:331)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.AbstractDataImportHandlerTestCase.runFullImport(AbstractDataImportHandlerTestCase.java:86)
   [junit4]    >        at 
org.apache.solr.handler.dataimport.TestTikaEntityProcessor.testTikaHTMLMapperIdentity(TestTikaEntityProcessor.java:99)
   [junit4]    >        at java.lang.Thread.run(Thread.java:745)
{noformat}

> Update to Apache TIKA 1.7
> -------------------------
>
>                 Key: SOLR-6991
>                 URL: https://issues.apache.org/jira/browse/SOLR-6991
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - Solr Cell (Tika extraction)
>            Reporter: Uwe Schindler
>            Assignee: Uwe Schindler
>             Fix For: 5.0, Trunk, 5.1
>
>         Attachments: SOLR-6991-forkfix.patch, SOLR-6991.patch, SOLR-6991.patch
>
>
> Apache TIKA 1.7 was released: 
> [https://dist.apache.org/repos/dist/release/tika/CHANGES-1.7.txt]
> This is more or less a dependency update, so replacements. Not sure if we 
> should do this for 5.0. In 5.0 we currently have the previous version, which 
> was not yet released with Solr. If we now bring this into 5.0, we wouldn't 
> have a new release 2 times. I can change the stuff this evening and let it 
> bake in 5.x, so maybe we backport this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to