Hi. I have no way to check on "Amazon Linux" I checked for: > Debian: ≥ 9 > Fedora: ≥ 29 > Ubuntu: ≥ 16.04 > CentOS ≥ 8 > openSUSE ≥ 42.3 > openSUSE Tumbleweed
пн, 14 сент. 2020 г. в 11:39, mit <kollolmit...@gmail.com>: > Thanks Alex..So does this work only on Ubuntu or it can be used in Amazon > Linux AMI's too? > > On Thursday, September 10, 2020 at 8:39:01 AM UTC+5:30 shree wrote: > >> Thanks, Alex. >> >> I suggest that you also add this to tesseract documentation, tessdoc repo. >> >> On Wed, Sep 9, 2020, 23:30 Александр Поздняков <irc.a...@gmail.com> >> wrote: >> >>> Hi. >>> Alternatively, use AppImage (Ubuntu >= 16.04) >>> 1. Download >>> >>>> wget >>>> https://github.com/AlexanderP/tesseract-appimage/releases/download/v5.0.0-alpha-773-gd33ed/tesseract-5.0.0-alpha-773-gd33ed-x86_64.AppImage >>>> chmod +x tesseract-5.0.0-alpha-773-gd33ed-x86_64.AppImage >>> >>> 2.Run >>> >>>> ./tesseract-5.0.0-alpha-773-gd33ed-x86_64.AppImage --l eng page.jpg - >>> >>> or >>> >>>> ./tesseract-5.0.0-alpha-773-gd33ed-x86_64.AppImage --appimage-extract >>> >>> ./squashfs-root/AppRun --l eng page.jpg - >>> >>> >>> >>> Instructions for creating AppImage: >>> https://github.com/AlexanderP/tesseract-appimage >>> >>> >>> ср, 9 сент. 2020 г. в 08:57, Mobeen Ali <moby...@gmail.com>: >>> >>>> @mit >>>> >>>> I was working on AWS Lambda and i had built the libraries for OCR >>>> functions, just follow the docker part of this post/tutorial by Amine >>>> Tamasna: >>>> >>>> >>>> https://medium.com/analytics-vidhya/build-tesseract-serverless-api-using-aws-lambda-and-docker-in-minutes-dd97a79b589b >>>> >>>> You will be good to go... >>>> On Tuesday, September 8, 2020 at 4:09:56 PM UTC+3 mit wrote: >>>> >>>>> Thanks for the suggestions..Will look through it. I mainly need static >>>>> tesseract to directly use it in AWS Lambda,and hence I will have to find a >>>>> way to build it in linux. >>>>> >>>>> On Tuesday, September 8, 2020 at 6:12:59 PM UTC+5:30 zdenop wrote: >>>>> >>>>>> vcpkg static build is not supported on linux: >>>>>> >>>>>> Error: invalid triplet: x64-linux-static >>>>>> Available architecture triplets >>>>>> VCPKG built-in triplets: >>>>>> x64-osx >>>>>> x64-windows >>>>>> arm-uwp >>>>>> arm64-windows >>>>>> x64-linux >>>>>> x86-windows >>>>>> x64-uwp >>>>>> x64-windows-static >>>>>> >>>>>> VCPKG community triplets: >>>>>> s390x-linux >>>>>> x64-windows-static-md >>>>>> arm64-mingw-dynamic >>>>>> x86-ios >>>>>> x86-windows-static >>>>>> arm-mingw-static >>>>>> wasm32-emscripten >>>>>> arm64-uwp >>>>>> arm64-ios >>>>>> arm-ios >>>>>> x86-windows-static-md >>>>>> x86-uwp >>>>>> x86-mingw-static >>>>>> arm64-linux >>>>>> arm64-mingw-static >>>>>> arm-windows >>>>>> x64-ios >>>>>> x86-windows-v120 >>>>>> x64-osx-dynamic >>>>>> arm-linux >>>>>> x64-mingw-static >>>>>> arm64-osx >>>>>> x86-mingw-dynamic >>>>>> x64-mingw-dynamic >>>>>> arm-mingw-dynamic >>>>>> arm64-windows-static >>>>>> >>>>>> Zdenko >>>>>> >>>>>> >>>>>> ut 8. 9. 2020 o 14:13 Zdenko Podobny <zde...@gmail.com> napísal(a): >>>>>> >>>>>>> I did not try it on linux, but you can try to use Microsoft vcpkg[1]to >>>>>>> build static leptonica[2]... (on windows it works). >>>>>>> >>>>>>> [1] https://github.com/microsoft/vcpkg#quick-start-unix >>>>>>> [2] >>>>>>> https://github.com/charlesw/tesseract/wiki/Compiling-Tesseract-and-Libleptonica-(using-vcpkg) >>>>>>> >>>>>>> Zdenko >>>>>>> >>>>>>> >>>>>>> ut 8. 9. 2020 o 13:56 Zdenko Podobny <zde...@gmail.com> napísal(a): >>>>>>> >>>>>>>> As I mentioned in a previous email: you need to build a static >>>>>>>> leptonica library with all its dependencies (image libraries) as static >>>>>>>> libraries. >>>>>>>> Maybe it would require more tweaking ;-) : >>>>>>>> >>>>>>>> > ldd /usr/bin/tesseract >>>>>>>> linux-vdso.so.1 (0x00007ffdd44e2000) >>>>>>>> libtesseract.so.5 => /usr/lib64/libtesseract.so.5 >>>>>>>> (0x00007faa42f5d000) >>>>>>>> liblept.so.5 => /usr/lib64/liblept.so.5 (0x00007faa42ac9000) >>>>>>>> libpng16.so.16 => /usr/lib64/libpng16.so.16 >>>>>>>> (0x00007faa42886000) >>>>>>>> libjpeg.so.8 => /usr/lib64/libjpeg.so.8 (0x00007faa4261d000) >>>>>>>> libgif.so.7 => /usr/lib64/libgif.so.7 (0x00007faa42414000) >>>>>>>> libtiff.so.5 => /usr/lib64/libtiff.so.5 (0x00007faa4219b000) >>>>>>>> libwebpmux.so.2 => /usr/lib64/libwebpmux.so.2 >>>>>>>> (0x00007faa41f91000) >>>>>>>> libwebp.so.6 => /usr/lib64/libwebp.so.6 (0x00007faa41d33000) >>>>>>>> libopenjp2.so.7 => /usr/lib64/libopenjp2.so.7 >>>>>>>> (0x00007faa41ae1000) >>>>>>>> libz.so.1 => /lib64/libz.so.1 (0x00007faa418ca000) >>>>>>>> libarchive.so.13 => /usr/lib64/libarchive.so.13 >>>>>>>> (0x00007faa41615000) >>>>>>>> libcurl.so.4 => /usr/lib64/libcurl.so.4 (0x00007faa4138c000) >>>>>>>> librt.so.1 => /lib64/librt.so.1 (0x00007faa41184000) >>>>>>>> libpthread.so.0 => /lib64/libpthread.so.0 >>>>>>>> (0x00007faa40f65000) >>>>>>>> libstdc++.so.6 => /usr/lib64/libstdc++.so.6 >>>>>>>> (0x00007faa40b84000) >>>>>>>> libm.so.6 => /lib64/libm.so.6 (0x00007faa4084c000) >>>>>>>> libgomp.so.1 => /usr/lib64/libgomp.so.1 (0x00007faa40614000) >>>>>>>> libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007faa403fc000) >>>>>>>> libc.so.6 => /lib64/libc.so.6 (0x00007faa40041000) >>>>>>>> liblzma.so.5 => /usr/lib64/liblzma.so.5 (0x00007faa3fe07000) >>>>>>>> libjbig.so.2 => /usr/lib64/libjbig.so.2 (0x00007faa3fbfb000) >>>>>>>> libcrypto.so.1.1 => /usr/lib64/libcrypto.so.1.1 >>>>>>>> (0x00007faa3f769000) >>>>>>>> libacl.so.1 => /lib64/libacl.so.1 (0x00007faa3f560000) >>>>>>>> libbz2.so.1 => /usr/lib64/libbz2.so.1 (0x00007faa3f343000) >>>>>>>> libxml2.so.2 => /usr/lib64/libxml2.so.2 (0x00007faa3efdb000) >>>>>>>> libnghttp2.so.14 => /usr/lib64/libnghttp2.so.14 >>>>>>>> (0x00007faa3edb3000) >>>>>>>> libidn2.so.0 => /usr/lib64/libidn2.so.0 (0x00007faa3eb96000) >>>>>>>> libssh.so.4 => /usr/lib64/libssh.so.4 (0x00007faa3e915000) >>>>>>>> libpsl.so.5 => /usr/lib64/libpsl.so.5 (0x00007faa3e705000) >>>>>>>> libssl.so.1.1 => /usr/lib64/libssl.so.1.1 >>>>>>>> (0x00007faa3e499000) >>>>>>>> libgssapi_krb5.so.2 => /usr/lib64/libgssapi_krb5.so.2 >>>>>>>> (0x00007faa3e24d000) >>>>>>>> libldap_r-2.4.so.2 => /usr/lib64/libldap_r-2.4.so.2 >>>>>>>> (0x00007faa3dff9000) >>>>>>>> liblber-2.4.so.2 => /usr/lib64/liblber-2.4.so.2 >>>>>>>> (0x00007faa3ddea000) >>>>>>>> /lib64/ld-linux-x86-64.so.2 (0x00007faa434b4000) >>>>>>>> libdl.so.2 => /lib64/libdl.so.2 (0x00007faa3dbe6000) >>>>>>>> libattr.so.1 => /lib64/libattr.so.1 (0x00007faa3d9e1000) >>>>>>>> libunistring.so.2 => /usr/lib64/libunistring.so.2 >>>>>>>> (0x00007faa3d65f000) >>>>>>>> libkrb5.so.3 => /usr/lib64/libkrb5.so.3 (0x00007faa3d383000) >>>>>>>> libk5crypto.so.3 => /usr/lib64/libk5crypto.so.3 >>>>>>>> (0x00007faa3d151000) >>>>>>>> libcom_err.so.2 => /lib64/libcom_err.so.2 >>>>>>>> (0x00007faa3cf4d000) >>>>>>>> libkrb5support.so.0 => /usr/lib64/libkrb5support.so.0 >>>>>>>> (0x00007faa3cd40000) >>>>>>>> libresolv.so.2 => /lib64/libresolv.so.2 (0x00007faa3cb29000) >>>>>>>> libsasl2.so.3 => /usr/lib64/libsasl2.so.3 >>>>>>>> (0x00007faa3c90c000) >>>>>>>> libkeyutils.so.1 => /usr/lib64/libkeyutils.so.1 >>>>>>>> (0x00007faa3c708000) >>>>>>>> libselinux.so.1 => /lib64/libselinux.so.1 >>>>>>>> (0x00007faa3c4df000) >>>>>>>> libpcre.so.1 => /usr/lib64/libpcre.so.1 (0x00007faa3c252000) >>>>>>>> >>>>>>>> I would suggest building a minimal leptonica and tesseract library, >>>>>>>> to avoid extra work with linking. E.g. if you will open image with >>>>>>>> other >>>>>>>> library (python PIL [1] , opencv [2] ...) you do not any image support >>>>>>>> in >>>>>>>> leptonica - just pass image data to leptonica/tesseract) >>>>>>>> >>>>>>>> [1] >>>>>>>> https://stackoverflow.com/questions/55195932/typeerror-initializer-for-ctype-unsigned-int-must-be-a-cdata-pointer-not-b/57776268#57776268 >>>>>>>> [2] >>>>>>>> https://stackoverflow.com/questions/39293922/convert-between-opencv-mat-and-leptonica-pix >>>>>>>> >>>>>>>> Zdenko >>>>>>>> >>>>>>>> >>>>>>>> ut 8. 9. 2020 o 12:37 mit <kollol...@gmail.com> napísal(a): >>>>>>>> >>>>>>>>> The tesseract binary it creates,still needs leptonica files to run >>>>>>>>> , which in essence breaks the core idea of static linking >>>>>>>>> >>>>>>>>> On Tuesday, September 8, 2020 at 12:38:16 PM UTC+5:30 Mobeen Ali >>>>>>>>> wrote: >>>>>>>>> >>>>>>>>>> @mit heres the link to the github page: >>>>>>>>>> >>>>>>>>>> https://github.com/wingedrhino/static-tesseract >>>>>>>>>> >>>>>>>>>> This guy has a single script written for the whole task. but when >>>>>>>>>> build the static version of tesseract and run it, i get this error: >>>>>>>>>> >>>>>>>>>> pytesseract.pytesseract.TesseractError: (127, >>>>>>>>>> '/home/dpsocr/mysite/tesseract-static/bin/tesseract: symbol lookup >>>>>>>>>> error: >>>>>>>>>> /home/dpsocr/mysite/tesseract-static/bin/tesseract: undefined symbol: >>>>>>>>>> pixaDisplayTiledInColumns') >>>>>>>>>> >>>>>>>>>> Give it a try and share what you get. >>>>>>>>>> Best of Luck! >>>>>>>>>> >>>>>>>>>> On Tuesday, September 8, 2020 at 10:04:00 AM UTC+3 mit wrote: >>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> No..how have you built a static version of tesseract? Any link? >>>>>>>>>>> >>>>>>>>>>> Thanks >>>>>>>>>>> On Tuesday, September 8, 2020 at 12:27:01 PM UTC+5:30 Mobeen Ali >>>>>>>>>>> wrote: >>>>>>>>>>> >>>>>>>>>>>> I tried using a static version of tesseract but i got this >>>>>>>>>>>> error now, >>>>>>>>>>>> >>>>>>>>>>>> pytesseract.pytesseract.TesseractError: (127, >>>>>>>>>>>> '/home/dpsocr/mysite/tesseract-static/bin/tesseract: symbol lookup >>>>>>>>>>>> error: >>>>>>>>>>>> /home/dpsocr/mysite/tesseract-static/bin/tesseract: undefined >>>>>>>>>>>> symbol: >>>>>>>>>>>> pixaDisplayTiledInColumns') >>>>>>>>>>>> >>>>>>>>>>>> Please help!! >>>>>>>>>>>> >>>>>>>>>>>> On Tuesday, September 8, 2020 at 9:42:08 AM UTC+3 Mobeen Ali >>>>>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> @mit did you found any solution? >>>>>>>>>>>>> >>>>>>>>>>>>> On Monday, September 7, 2020 at 5:32:56 PM UTC+3 mit wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> I am also looking for it. The process to create static >>>>>>>>>>>>>> tesseract >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Monday, September 7, 2020 at 7:11:47 PM UTC+5:30 Mobeen >>>>>>>>>>>>>> Ali wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Thanks @zdenop for your response. Is there any 'How To?' for >>>>>>>>>>>>>>> building the static version of tesseract on macOS or Ubuntu? >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Monday, September 7, 2020 at 2:01:29 PM UTC+3 zdenop >>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> try to build static version of tesseract >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Zdenko >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> po 7. 9. 2020 o 10:48 Mobeen Ali <moby...@gmail.com> >>>>>>>>>>>>>>>> napísal(a): >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Hi! >>>>>>>>>>>>>>>>> I was trying to build and upload tesseract online. I've >>>>>>>>>>>>>>>>> somehow found the tesseract but I'm getting this error: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> "pytesseract.pytesseract.TesseractError: (127, >>>>>>>>>>>>>>>>> '/home/dpsocr/mysite/tesseract-ara/bin/tesseract: error while >>>>>>>>>>>>>>>>> loading >>>>>>>>>>>>>>>>> shared libraries: libtesseract.so.4: cannot open shared >>>>>>>>>>>>>>>>> object file: No >>>>>>>>>>>>>>>>> such file or directory')" >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> It seems like tesseract is unable to find libtesseract >>>>>>>>>>>>>>>>> library. >>>>>>>>>>>>>>>>> But the libraries are in the 'lib' directory as follows: >>>>>>>>>>>>>>>>> |_mysite >>>>>>>>>>>>>>>>> |_tesseract >>>>>>>>>>>>>>>>> |_bin >>>>>>>>>>>>>>>>> |_lib >>>>>>>>>>>>>>>>> |_tesseract >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> *So the Question is that, I have built the tesseract using >>>>>>>>>>>>>>>>> docker now I wanted to know that if there is any way to build >>>>>>>>>>>>>>>>> tesseract and >>>>>>>>>>>>>>>>> it's libraries, and then compress them together to upload >>>>>>>>>>>>>>>>> them to the >>>>>>>>>>>>>>>>> online hosting so that tesseract will know its required >>>>>>>>>>>>>>>>> libraries are along >>>>>>>>>>>>>>>>> with it.* >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Thanks for any advice. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>> You received this message because you are subscribed to >>>>>>>>>>>>>>>>> the Google Groups "tesseract-ocr" group. >>>>>>>>>>>>>>>>> To unsubscribe from this group and stop receiving emails >>>>>>>>>>>>>>>>> from it, send an email to tesseract-oc...@googlegroups.com >>>>>>>>>>>>>>>>> . >>>>>>>>>>>>>>>>> To view this discussion on the web visit >>>>>>>>>>>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/551aacb8-59ff-4c5f-881d-c8f94aeba0a4n%40googlegroups.com >>>>>>>>>>>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/551aacb8-59ff-4c5f-881d-c8f94aeba0a4n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>>>>>>>>>>>> . >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> -- >>>>>>>>> You received this message because you are subscribed to the Google >>>>>>>>> Groups "tesseract-ocr" group. >>>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>>> send an email to tesseract-oc...@googlegroups.com. >>>>>>>>> To view this discussion on the web visit >>>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/33fa3388-6703-44a7-ba4f-8ed69930a3e7n%40googlegroups.com >>>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/33fa3388-6703-44a7-ba4f-8ed69930a3e7n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>>>> . >>>>>>>>> >>>>>>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to tesseract-oc...@googlegroups.com. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/4255f378-b1b6-4447-a298-900086222536n%40googlegroups.com >>>> <https://groups.google.com/d/msgid/tesseract-ocr/4255f378-b1b6-4447-a298-900086222536n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-oc...@googlegroups.com. >>> >> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/CANzBpR0HGwLQDyK55KWCEEX4peiw65RYj0sXqUkmg%2BPR1G%3D3iQ%40mail.gmail.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/CANzBpR0HGwLQDyK55KWCEEX4peiw65RYj0sXqUkmg%2BPR1G%3D3iQ%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/943f95fa-057a-4723-b36e-d2b6e32a66f4n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/943f95fa-057a-4723-b36e-d2b6e32a66f4n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CANzBpR0QaUp0F3SAoBXvVVjdTEHCDsN3oV35c%3Do%2BeriNBqBAHA%40mail.gmail.com.