[
https://issues.apache.org/jira/browse/SOLR-6007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eric Pugh resolved SOLR-6007.
-----------------------------
Resolution: Won't Fix
In Solr 10 we are leveraging either Tika Server (running in it's own seperate
server process) or maybe Tika Pipes (again, running in a seperate JVM).
Please revalidate your issue against Solr 10 with one of those options, and if
it is still present need, happy to work with you on a fix using the new
approach for Tika.
> Add param "archive.encoding" for ExtractingRequestHandler
> ---------------------------------------------------------
>
> Key: SOLR-6007
> URL: https://issues.apache.org/jira/browse/SOLR-6007
> Project: Solr
> Issue Type: New Feature
> Components: contrib - Solr Cell (Tika extraction)
> Reporter: Shinichiro Abe
> Priority: Minor
> Attachments: SOLR-6007.patch, japanese-sjis.zip
>
>
> When extracting from the zip files which are zipped at Windows OS(Japanese),
> the file name extracted from zip is garbled(these file names were written by
> CJK language). TIKA-936 allows us to set custom encoding(i.e. SJIS), so I can
> get not-being garbled file name. It would be nice if archive encoding
> parameter in Solr Cell could be specified.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]