I'll look into that Linux option :)  as for the save method, I used the 
show method on the object and it had a path in the temp directory. So I 
asked ChatGPT how the file could be on disk and it told me that the save 
method created it. If you're right then it's just another hallucination by 
the model. I use it to teach me, as I'm learning alone. Thanks for the 
replies 🙂
On Tuesday, February 14, 2023 at 5:49:45 PM UTC-3 Merlijn Wajer wrote:

> Hi,
>
> On 14/02/2023 21:16, Flávio. wrote:
> > Thanks, i'm still trying to figure it out. It seems when the PIL image 
> > is saved, unfortunately it saves a temp file to disk. My goal is to not 
> > write to disk, because this application will read a lot of files and I 
> > want to spare my SSD. My code receives byte data from a Dart program (I 
> > checked it is correct).   So far the py file looks like this but i'm not 
> > getting anything in return.
>
> At this point I maybe ought to reply off list, but you could also save 
> the images to a "tmpfs" on Linux if you don't want to deal with stdin 
> and have the images never hit the disk/ssd: 
> https://www.kernel.org/doc/html/v5.7/filesystems/tmpfs.html
>
> In any case - when you write that PIL 'saves a temp file to disk', what 
> lead you to that conclusion? The code below really shouldn't do that.
>
> Regards,
> Merlijn
>
> > def main():
> >     base64_image = sys.stdin.read()
> >     image_bytes = base64.b64decode(base64_image)
> >     with io.BytesIO(image_bytes) as input:
> >         pil_image = Image.open(input)
> >         with io.BytesIO() as output:
> >             pil_image.save(output, format='PNG', compress=0, 
> > compress_level=0)  # using disk!
> >             output.seek(0)
> > 
> >             env = os.environ.copy()
> >             env['OMP_THREAD_LIMIT'] = '1'
> > 
> >             p = subprocess.Popen([tesseractPath, '-', '-','-l','por'],
> >                                  stdin=subprocess.PIPE,
> >                                  stdout=subprocess.PIPE,
> >                                  stderr=subprocess.PIPE,
> >                                  env=env)
> >             output, stderr = p.communicate(output.read())
> >             stderr = stderr.decode('utf-8')
> > 
> >             if stderr:
> >                 logger.warning('tesseract_baselines stderr: %s', stderr)
> >             else:
> >                 sys.stdout(output.encode('utf-8').strip())
> > 
> > 
> > if __name__ == '__main__':
> >      main()
> > 
> > 
> > On Tuesday, February 14, 2023 at 4:11:13 PM UTC-3 Merlijn Wajer wrote:
> > 
> > 
> > Hi,
> > 
> > On 14/02/2023 19:10, Flávio. wrote:
> > > Sorry, how can I do that?  I'm trying to send image binary data,
> > not a
> > > path. The goal is to not write a file to disk and use only
> > memory. Could
> > > you please write a code that sends the data (binary) to the stdin of
> > > tesseract? it can be in Python, Dart or Java :(  I've tried
> > ChatGPT but
> > > it is wrong and gets lost
> > 
> > Normally I'd say 'left as an exercises to the reader' but I so
> > happen to
> > have a snippet around that ought to give you a general idea.
> > 
> > This uses io.BytesIO in Python 3 to save the image (stream) to, it
> > contains an uncompressed PNG (compression will just slow things down).
> > It assumes that the variable "pil_image" contains a PIL.Image object.
> > 
> > The code to use just one core in Tesseract is of course entirely
> > optional. I didn't *test* this to work (I modified it a bit - it works
> > in another setting), but it should work in theory:
> > 
> > > with io.BytesIO() as output:
> > > pil_image.save(output, format='PNG', compress=0, compress_level=0)
> > > output.seek(0)
> > >
> > > # Let's just use one core in tesseract
> > > env = os.environ.copy()
> > > env['OMP_THREAD_LIMIT'] = '1'
> > >
> > > p = subprocess.Popen(['tesseract', '-', '-'],
> > > stdin=subprocess.PIPE,
> > > stdout=subprocess.PIPE,
> > > stderr=subprocess.PIPE,
> > > env=env)
> > > output, stderr = p.communicate(output.read())
> > > stderr = stderr.decode('utf-8')
> > >
> > > if stderr:
> > > logger.warning('tesseract_baselines stderr: %s', stderr)
> > 
> > 
> > Regards,
> > Merlijn
> > 
> > -- 
> > You received this message because you are subscribed to the Google 
> > Groups "tesseract-ocr" group.
> > To unsubscribe from this group and stop receiving emails from it, send 
> > an email to tesseract-oc...@googlegroups.com 
> > <mailto:tesseract-oc...@googlegroups.com>.
> > To view this discussion on the web visit 
> > 
> https://groups.google.com/d/msgid/tesseract-ocr/ebd9d42d-244c-4a6b-8ab8-c1efd87db501n%40googlegroups.com
>  
> > <
> https://groups.google.com/d/msgid/tesseract-ocr/ebd9d42d-244c-4a6b-8ab8-c1efd87db501n%40googlegroups.com?utm_medium=email&utm_source=footer
> >.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/14c7e227-7f84-4e0d-91c6-f5f257a07cd6n%40googlegroups.com.

Reply via email to