On Sat, Mar 22, 2025 at 07:51:03PM +0100, Matus UHLAR - fantomas wrote:On 20.03.25 13:52, Alex wrote: >I'm using ExtractText to identify QR codes in PDFs. > ># QR-code decoder >extracttext_external zbar /usr/bin/zbarimg -q -D {} >extracttext_use zbar .jpg .png .pdf .webp >image/(?:jpeg|png) application/pdf >add_header all ExtractText-Uris _EXTRACTTEXTURIS_ > >However, now they're sending them in Word doc/docx format. Any tips on how >to do that? > >This was just a regular Word document with a .docx extension.unfortunately, ExtractText currently (afaik) does not support - conversion between formats (extracting images from doc,pdf etc)
On 22.03.25 21:15, Giovanni Bechis wrote:
if you have ghostscript installed extracting barcodes from pdf files should work. Giovanni
both text and barcodes? Even QR codes? Can you chare the commands?
- running multiple programs over the same file (e.g. OCR and QR extracting)
-- Matus UHLAR - fantomas, [email protected] ; http://www.fantomas.sk/ Warning: I wish NOT to receive e-mail advertising to this address. Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu. "One World. One Web. One Program." - Microsoft promotional advertisement "Ein Volk, ein Reich, ein Fuhrer!" - Adolf Hitler
