janhoy opened a new pull request, #3670: URL: https://github.com/apache/solr/pull/3670
https://issues.apache.org/jira/browse/SOLR-7632 This work builds on the one in #3361 but instead of making a new module, we add it as a capability to the existing extraction handler through specifying `extraction.backend=tikaserver`. This first required refactoring extraction handler to detach it from the Tika-v1 API. There is a new interface `ExtractionBackend` that takes generic `ExtractionRequest` object in and returns an `ExtractionResult` bean, and a new `LocalTikaExtractionBackend` implementation that encapsulates all Tikav1 api handling. This implementation can be deprecated, and in Solr 10, the `tikaserver` one can be made default. All existing tests pass. New tests are added using TestContainers to spin up Tika. Note: Most of the coding was done by JetBrains Junie, so reviewers may want to ensure nothing fancy has slipped into the code. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
