On Mon, 3 Oct 2016 12:02:15 -0700 (PDT) John Hardin <jhar...@impsec.org> wrote:
> We need a PDF plugin that will extract text and URLs from PDF > attachments so that they can be scanned as if they were body text. We've written something for extracting URLs. I can't release the code, unfortunately, but you can look at "podofopdfinfo" and use that to extract URLs. libpodofo-utils ships with Debian. Regards, Dianne.