Re: ExtractText and docs?

Matus UHLAR - fantomas Sun, 23 Mar 2025 07:51:38 -0700

On Sat, Mar 22, 2025 at 07:51:03PM +0100, Matus UHLAR - fantomas wrote:

On 20.03.25 13:52, Alex wrote:
>I'm using ExtractText to identify QR codes in PDFs.
>
># QR-code decoder
>extracttext_external    zbar            /usr/bin/zbarimg -q -D {}
>extracttext_use         zbar            .jpg .png .pdf .webp
>image/(?:jpeg|png) application/pdf
>add_header              all             ExtractText-Uris _EXTRACTTEXTURIS_
>
>However, now they're sending them in Word doc/docx format. Any tips on how
>to do that?
>
>This was just a regular Word document with a .docx extension.


unfortunately, ExtractText currently (afaik) does not support
- conversion between formats (extracting images from doc,pdf etc)


On 22.03.25 21:15, Giovanni Bechis wrote:

if you have ghostscript installed extracting barcodes from pdf files
should work.
 Giovanni


both text and barcodes? Even QR codes?
Can you chare the commands?

- running multiple programs over the same file
   (e.g. OCR and QR extracting)


--
Matus UHLAR - fantomas, [email protected] ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
"One World. One Web. One Program." - Microsoft promotional advertisement
"Ein Volk, ein Reich, ein Fuhrer!" - Adolf Hitler

Re: ExtractText and docs?

Reply via email to