[ 
https://issues.apache.org/jira/browse/SOLR-6007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Pugh resolved SOLR-6007.
-----------------------------
    Resolution: Won't Fix

In Solr 10 we are leveraging either Tika Server (running in it's own seperate 
server process) or maybe Tika Pipes (again, running in a seperate JVM).   
Please revalidate your issue against Solr 10 with one of those options, and if 
it is still present need, happy to work with you on a fix using the new 
approach for Tika.

> Add param "archive.encoding" for ExtractingRequestHandler
> ---------------------------------------------------------
>
>                 Key: SOLR-6007
>                 URL: https://issues.apache.org/jira/browse/SOLR-6007
>             Project: Solr
>          Issue Type: New Feature
>          Components: contrib - Solr Cell (Tika extraction)
>            Reporter: Shinichiro Abe
>            Priority: Minor
>         Attachments: SOLR-6007.patch, japanese-sjis.zip
>
>
> When extracting from the zip files which are zipped at Windows OS(Japanese), 
> the file name extracted from zip is garbled(these file names were written by 
> CJK language). TIKA-936 allows us to set custom encoding(i.e. SJIS), so I can 
> get not-being garbled file name. It would be nice if archive encoding 
> parameter in Solr Cell could be specified.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to