My scripts are in PowerShell. For GhostScript I'm just using CLI: "C:\Program Files\gs\gs9.56.1\bin\gswin64c.exe" -dBATCH -dNOPAUSE -dQUIET -dNoCancel -sDEVICE=txtwrite -sOutputFile=%stdout "InputFile.pdf"
This give you the whole PDF and now just parse the text. For iText7 I am using PSWritePDF module: https://github.com/EvotecIT/PSWritePDF https://www.powershellgallery.com/packages/PSWritePDF/0.0.19 You just use it's Cmdlet ConvertPDFToText: Convert-PDFToText -FilePath "InputFile.pdf" Now just parse the text. On Sat, Aug 6, 2022 at 11:46 AM Tom Browder <tom.brow...@gmail.com> wrote: > On Sat, Aug 6, 2022 at 10:43 AM Glenn Fowler <gfowl...@outlook.com> wrote: > > > > Look into GhostScript and iText7. I have used both to pull text out of > PDF's easily in scripts. > > What program language? Can you show me an example of your code to do that? > > Thanks. > > -Tom > _______________________________________________ gnucash-user mailing list gnucash-user@gnucash.org To update your subscription preferences or to unsubscribe: https://lists.gnucash.org/mailman/listinfo/gnucash-user ----- Please remember to CC this list on all your replies. You can do this by using Reply-To-List or Reply-All.