[ https://issues.apache.org/jira/browse/TIKA-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13012746#comment-13012746 ]
Uwe Schindler commented on TIKA-623: ------------------------------------ >From looking at the code of this library, it looks that it needs some >improvements/fixes: - It catches all exceptions and instead of simply wrap'n'rethrow or declare the checked exceptions in the methods, it prints the stack trace to System.out. Also messages are printed to System.out. - The RTF compression decoder uses new String(byte[]) without charset -> locale dependent! Other places do this, too. This is broken, as the file format should define the charset. > Add support for Outlook PST > --------------------------- > > Key: TIKA-623 > URL: https://issues.apache.org/jira/browse/TIKA-623 > Project: Tika > Issue Type: New Feature > Components: parser > Reporter: Tran Nam Quang > > Hello everyone, > As you might know, Outlook stores its mails and other stuff in a single PST > file. There's a relatively new Java library called java-libpst for reading > Outlook PST files. It is licensed under the LGPL and available over here: > http://code.google.com/p/java-libpst/ > I have tested the library on Outlook 2000 and Outlook 2003, with good > results. It would be great if the library could be integrated into Tika. > Best regards > Tran Nam Quang -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira