[ https://issues.apache.org/jira/browse/TIKA-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930491#comment-17930491 ]
Hudson commented on TIKA-4387: ------------------------------ SUCCESS: Integrated in Jenkins build Tika » tika-branch_3x-jdk11 #1978 (See [https://ci-builds.apache.org/job/Tika/job/tika-branch_3x-jdk11/1978/]) TIKA-4387 -- FilenameUtils should require that file extensions be ascii alphanumerics only. (#2143) (tallison: [https://github.com/apache/tika/commit/aba99e2173b456d113c3d8b4db936d19dbbb3012]) * (edit) tika-core/src/test/java/org/apache/tika/io/FilenameUtilsTest.java * (edit) tika-core/src/main/java/org/apache/tika/io/FilenameUtils.java > Improve robustness of file extension parsing > -------------------------------------------- > > Key: TIKA-4387 > URL: https://issues.apache.org/jira/browse/TIKA-4387 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > Fix For: 4.0.0, 3.1.1, 2.9.4 > > > {{FilenameUtils.getSuffixFromPath()}} isn't checking that the extension > contains only alphanumeric characters. > If a "file path" derives from an internal path in a pst, like so {{/Début du > fichier de données Outlook/[WEBINAR] - "Introducing Couchbase Server 2.5"}}, > then the extension is {{.5"}}, which causes problems on Windows. > The problem happens when TemporaryResources goes to write a temp file and > tries to maintain the file extension based on the {{resourceName}} in the > Metadata. > We should add a check that the extension contains only alphanumerics? Or > something? -- This message was sent by Atlassian Jira (v8.20.10#820010)