On 20.03.25 13:52, Alex wrote:
I'm using ExtractText to identify QR codes in PDFs.# QR-code decoder extracttext_external zbar /usr/bin/zbarimg -q -D {} extracttext_use zbar .jpg .png .pdf .webp image/(?:jpeg|png) application/pdf add_header all ExtractText-Uris _EXTRACTTEXTURIS_ However, now they're sending them in Word doc/docx format. Any tips on how to do that? This was just a regular Word document with a .docx extension.
unfortunately, ExtractText currently (afaik) does not support - conversion between formats (extracting images from doc,pdf etc) - running multiple programs over the same file (e.g. OCR and QR extracting) -- Matus UHLAR - fantomas, [email protected] ; http://www.fantomas.sk/ Warning: I wish NOT to receive e-mail advertising to this address. Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu. It's now safe to throw off your computer.
