Tim Allison created TIKA-4387: --------------------------------- Summary: Improve robustness of file extension parsing Key: TIKA-4387 URL: https://issues.apache.org/jira/browse/TIKA-4387 Project: Tika Issue Type: Task Reporter: Tim Allison
{{FilenameUtils.getSuffixFromPath()}} isn't checking that the extension contains only alphanumeric characters. If a "file path" derives from an internal path in a pst, like so {{/Début du fichier de données Outlook/[WEBINAR] - "Introducing Couchbase Server 2.5"}}, then the extension is {{.5"}}, which causes problems on Windows. -- This message was sent by Atlassian Jira (v8.20.10#820010)