[ https://issues.apache.org/jira/browse/TIKA-3077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17066318#comment-17066318 ]
Nicholas DiPiazza commented on TIKA-3077: ----------------------------------------- addressing this in https://github.com/apache/tika/pull/314 > OneNote parser - very inefficient when parsing OneNote <= 2007 files > -------------------------------------------------------------------- > > Key: TIKA-3077 > URL: https://issues.apache.org/jira/browse/TIKA-3077 > Project: Tika > Issue Type: Improvement > Components: core > Reporter: Nicholas DiPiazza > Priority: Major > > The code I put in place for OneNote 2007 files is horribly inefficient. I > hadn't realized that the OneNoteDirectFileResource that I extracted from > another parser was not buffering the bytes. So every time I did a set > position, it was very expensive. > The fix is to buffer the bytes into chunks and operate them instead. -- This message was sent by Atlassian Jira (v8.3.4#803005)