[ https://issues.apache.org/jira/browse/TIKA-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison updated TIKA-1857: ------------------------------ Attachment: xfa_in_govdocs1.txt list of PDFs in govdocs1 that have a non-null PDXFAResource object, found with PDFBox 2.0's trunk. > Enhance PDFParser to extract text from XFA forms > ------------------------------------------------ > > Key: TIKA-1857 > URL: https://issues.apache.org/jira/browse/TIKA-1857 > Project: Tika > Issue Type: Improvement > Components: parser > Reporter: Pascal Essiembre > Priority: Trivial > Labels: patch > Fix For: 1.13 > > Attachments: xfa_in_govdocs1.txt > > > Extract text from PDF Forms (XFA). Information about XFA: > https://en.wikipedia.org/wiki/XFA -- This message was sent by Atlassian JIRA (v6.3.4#6332)