On Sat, 14 May 2022 at 00:00, Roy T. Fielding <field...@gbiv.com> wrote: > > > On May 13, 2022, at 3:54 PM, sebb <seb...@gmail.com> wrote: > > > > On Fri, 13 May 2022 at 22:49, Craig Russell <apache....@gmail.com> wrote: > >> > >> Hi, > >> > >> We have a number of documents awaiting processing and all from docusign > >> are failing to parse the pdf. I have no skills to figure out why. > > > > The PDF parser cannot cope with the document format. > > Not a lot we can do about that in the short term; parsing a PDF is hard > > work. > > > >> Anyone available to help figure this out? It will take minutes to file > >> each document. > > > > At least they are not handwritten/illegible/ > > > >> It is not even possible to copy/paste fields from the displayed pdf. > > > > I was able to copy/paste the fields, but selecting them is tricky. > > Starting at the end and dragging left may work. > > Alternatively double-clicking works for me on individual words (but > > does not capture full email addresses). > > Has anyone tried pushing them through > > > https://developer.adobe.com/document-services/docs/overview/pdf-extract-api/ > <https://developer.adobe.com/document-services/docs/overview/pdf-extract-api/> > > I have no idea if that would work with Docusign pdfs, but worth a shot.
The PDFs contain PII so I don't think we can just upload them to a 3rd party. However if anyone wants to create a test ICLA PDF with dummy data using Docusign, we could certainly try that to see if the service is worth further investigation. > ....Roy >