On 3/23/25 3:40 PM, Matus UHLAR - fantomas wrote:
On Sat, Mar 22, 2025 at 07:51:03PM +0100, Matus UHLAR - fantomas wrote:On 20.03.25 13:52, Alex wrote: >I'm using ExtractText to identify QR codes in PDFs. > ># QR-code decoder >extracttext_external zbar /usr/bin/zbarimg -q -D {} >extracttext_use zbar .jpg .png .pdf .webp >image/(?:jpeg|png) application/pdf >add_header all ExtractText-Uris _EXTRACTTEXTURIS_ > >However, now they're sending them in Word doc/docx format. Any tips on how >to do that? > >This was just a regular Word document with a .docx extension.unfortunately, ExtractText currently (afaik) does not support - conversion between formats (extracting images from doc,pdf etc)On 22.03.25 21:15, Giovanni Bechis wrote:if you have ghostscript installed extracting barcodes from pdf files should work. Giovanniboth text and barcodes? Even QR codes? Can you chare the commands?
I have the same config you have and it can extract data from pdf files. Atm I am running with zbar-0.23.93 and ghostscript-10.03.1. Giovanni
- running multiple programs over the same file (e.g. OCR and QR extracting)
OpenPGP_signature.asc
Description: OpenPGP digital signature