Re: ExtractText and docs?

Matus UHLAR - fantomas Sat, 22 Mar 2025 11:51:21 -0700

On 20.03.25 13:52, Alex wrote:

I'm using ExtractText to identify QR codes in PDFs.


# QR-code decoder
extracttext_external    zbar            /usr/bin/zbarimg -q -D {}
extracttext_use         zbar            .jpg .png .pdf .webp
image/(?:jpeg|png) application/pdf
add_header              all             ExtractText-Uris _EXTRACTTEXTURIS_

However, now they're sending them in Word doc/docx format. Any tips on how
to do that?

This was just a regular Word document with a .docx extension.


unfortunately, ExtractText currently (afaik) does not support
- conversion between formats (extracting images from doc,pdf etc)
- running multiple programs over the same file
  (e.g. OCR and QR extracting)

--
Matus UHLAR - fantomas, [email protected] ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
It's now safe to throw off your computer.

Re: ExtractText and docs?

Reply via email to