[tesseract-ocr] libtesseract skip OCR, just create invisible text layer

lbr Tue, 04 Jul 2023 22:12:57 -0700

I'm trying to create a searchable pdf out of a scanned one. I want to use 
Textract as an OCR engine instead of Tesseract. Is there a way to make 
libtesseract skip the OCR step and just create the invisible text layer 
(with the extracted chars from Textract) and apply it to the input pdf?


I read that libtesseract is what ocrmypdf uses to create the invisible text 
layer for searchable pdfs. 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/08bb441a-6edb-47be-b314-b0638a0bce1an%40googlegroups.com.

[tesseract-ocr] libtesseract skip OCR, just create invisible text layer

Reply via email to