[ https://issues.apache.org/jira/browse/TIKA-3077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nicholas DiPiazza updated TIKA-3077: ------------------------------------ Comment: was deleted (was: addressing this in https://github.com/apache/tika/pull/314) > OneNote parser - very inefficient when parsing OneNote <= 2007 files > -------------------------------------------------------------------- > > Key: TIKA-3077 > URL: https://issues.apache.org/jira/browse/TIKA-3077 > Project: Tika > Issue Type: Improvement > Components: core > Reporter: Nicholas DiPiazza > Priority: Major > > The code I put in place for OneNote 2007 files is horribly inefficient. I > hadn't realized that the OneNoteDirectFileResource that I extracted from > another parser was not buffering the bytes. So every time I did a set > position, it was very expensive. > The fix is to buffer the bytes into chunks and operate them instead. -- This message was sent by Atlassian Jira (v8.3.4#803005)