Hi All, We are using Tika as a library and also making use of Fork Parser to launch Tika in a separate process. Things work for me for
We have integrated Tika 3.0.0 and would like to try out support for readpst that was added to it (TIKA-4250). Main reason is to see if we can get rid of java-libpst which is marked EOL by various scanners and customers are complaining about it. I used the config example to disable OutlookPSTParser and enable LibPstParser in its place in the commit https://github.com/apache/tika/commit/32baf2345abe1a04d767ea6641a567d5c924587e As the new parser is not having any config option to specify path for readpst binary, I added path for it in system environment PATH variable. It is installed via Cygwin on Windows environment, path is like c:\cygwin64\bin. It is working fine, and new LibPstParser parser is getting launched. But when we do the same with Fork Parser, we get an error. [LibPstParser] Couldn't get version of libpst java.io.IOException: Cannot run program "readpst": CreateProcess error=2, The system cannot find the file specified at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1128) at java.base/java.lang.ProcessBuilder.start(ProcessBuilder.java:1071) at org.apache.tika.utils.ProcessUtils.execute(ProcessUtils.java:94) at org.apache.tika.parser.microsoft.libpst.LibPstParser.check(LibPstParser.java:176) at org.apache.tika.parser.microsoft.libpst.LibPstParser.initialize(LibPstParser.java:161) Any help would be appreciated. Last but not the least, the file for which I am doing testing is a plain text file, so not sure why the PST parser is getting invoked for it. Regards, Sandeep Kulkarni