Compiling Tesseract under Cygwin

2011-07-01 Thread Simon Eigeldinger
Hello, I want to compile Tesseract from SVN under cygwin. Can someone tell me how to do that? Greetings, Simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso

compiling tesseract under cygwin now with more details

2012-02-17 Thread Simon Eigeldinger
Hi all, so. now i had time to document this. please note there is loads of messages and things. tried to get it as detailed as possible. Here we go. I installed the whole cygwin suite and i installed leptonica and webp as stated in the article on http://code.google.com/p/python-tesseract/wik

Re: compiling tesseract under cygwin now with more details

2012-02-19 Thread Simon Eigeldinger
ter.cpp:49:25: error: 'clock' was not declared in this scope'? Zdenko [1] http://en.wikipedia.org/wiki/Portability_(computer_science) [2] http://code.google.com/p/tesseract-ocr/wiki/PlatformStatus On Fri, Feb 17, 2012 at 6:04 PM, Simon Eigeldinger wrote: Hi all, so. now i

Re: compiling tesseract under cygwin now with more details

2012-02-19 Thread Simon Eigeldinger
n.wikipedia.org/wiki/Portability_(computer_science) [2] http://code.google.com/p/tesseract-ocr/wiki/PlatformStatus On Fri, Feb 17, 2012 at 6:04 PM, Simon Eigeldinger wrote: Hi all, so. now i had time to document this. please note there is loads of messages and things. tried to get it as detailed as poss

Re: compiling tesseract under cygwin now with more details

2012-02-20 Thread Simon Eigeldinger
re is another problem - please post issue. Zdenko On Sun, Feb 19, 2012 at 10:54 AM, Simon Eigeldinger< simon.eigeldin...@vol.at> wrote: hi, i forgot something to write about portability. I meant not that portability for many systems i meant the portability to copy tesseract for exam

Re: compiling tesseract under cygwin now with more details

2012-02-20 Thread Simon Eigeldinger
at means there could be problems. Try r676 it could solve your problem. If there is another problem - please post issue. Zdenko On Sun, Feb 19, 2012 at 10:54 AM, Simon Eigeldinger< simon.eigeldin...@vol.at> wrote: hi, i forgot something to write about portability. I meant not that portab

[tesseract-ocr] tesseract data files

2018-03-02 Thread Simon Eigeldinger
Hi all, Just looked at the git commits for tesseract and read that there has been changes to the OCR modes. are the 3 tessdata sets still valid? tessdata_fast and tessdata_best have been updated so i guess those reflect the latest developments but tessdata hasn't an update since september. i

Re: [tesseract-ocr] tesseract data files

2018-03-04 Thread Simon Eigeldinger
s the recommended one to be used for OCR. * ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Mar 3, 2018 at 2:04 AM, Simon Eigeldinger wrote: Hi all, Just looked at the git commits for tesseract and read that there has

Re: [tesseract-ocr] tesseract data files

2018-03-04 Thread Simon Eigeldinger
Simon Eigeldinger, wrote: Hi ShreeDevi, I have scraped the cygwin builds. i am using now the builds i get from the appveyor builds which just needs me to repackage the resulting stuff. so tessdata_best isn't like the wiki says for better accuracy? greetings, Simon Am 03.03.2018 um

Re: [tesseract-ocr] Re: Automagic Orientation Detection with the new LSTM models?

2018-03-06 Thread Simon Eigeldinger
Hi, like me as a blind i wonder how i might use some of those tools? because you can't see if the pic is good or bad. actually we might need something that does that automatically. any ideas on that? Greetings and thanks, Simon Am 06.03.2018 um 02:42 schrieb Michael Smith: I just do some prep

Re: [tesseract-ocr] Re: compiling tesseract on cygwin

2014-10-01 Thread Simon Eigeldinger
also saw the language data isn't available on the git repo. greetings and thanks, simon -- Simon Eigeldinger simon.ei...@vol.at -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 1218

[tesseract-ocr] issues with pdf

2014-10-06 Thread Simon Eigeldinger
hi all, just tried the following with the provided eurotext.tif in the testing dir of the source package. used current git from this afternoon european time: i get this: $ tesseract eurotext.tif eurotext -l eng pdf Tesseract Open Source OCR Engine v3.04.00 with Leptonica Page 1 Error in fope

[tesseract-ocr] PDFs still broken?

2014-10-08 Thread Simon Eigeldinger
format is 4; unreadable Error during processing. tested with the eurotext.tif file from the testing directory on a windows system. compiled with cygwin. https://dl.dropboxusercontent.com/u/1598766/tesseract-error.7z greetings, simon -- Simon Eigeldinger Follow me on Twitter: http

Re: [tesseract-ocr] Re: PDF output not searchable within SumatraPDF

2014-10-15 Thread Simon Eigeldinger
. greetings, simon Am 15.10.2014 um 18:06 schrieb Chris Cameron: All the files I mention can be found here: https://www.dropbox.com/sh/v5w4zl0c2z1wra1/AACxjmomYL4o-iQEhBrLvNgHa Incidentally, I now see that Chrome's PDF viewer is also unable to search the PDF. Thanks, Chris -- Simon Eigeld

Re: [tesseract-ocr] Poor results with tesseract OCR'ing .tif (as compared to an on-line OCR)

2014-10-24 Thread Simon Eigeldinger
act. Attached is an image file that I used. Thanks. -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso...@andrelouis.com --- Diese E-Mail ist frei von Viren und Malware, denn d

Re: [tesseract-ocr] Poor results with tesseract OCR'ing .tif (as compared to an on-line OCR)

2014-10-24 Thread Simon Eigeldinger
t this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0274edc9-8744-489b-bcf5-0eabc9dbd5c0%40googlegroups.com. For more options, visit https://groups.google.com/d/optout. -- Simon Eigeldinger Follow

[tesseract-ocr] how to automagically convert images that are best for tesseract?

2014-10-24 Thread Simon Eigeldinger
etter idea? thanks. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso...@andrelouis.com --- Diese E-Mail ist frei von Viren und Malware, denn der avast!

[tesseract-ocr] thanks to robert

2014-10-24 Thread Simon Eigeldinger
btw forgot to say thanks to robert melton for telling me about this script or at least for googling for that. wonder if we can get something awesome out of things. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at

[tesseract-ocr] building tesseract on windows using cygwin

2015-07-20 Thread Simon Eigeldinger
Hi all, just tried to compile tesseract on windows using cygwin from the git repo. this is what i got. looks pretty fine until we reach make. though i post the full stuff. i think i haven't forgotten any dependencies. maybe someone could have a look at this? thanks. $ ./autogen.sh --prefix=/ho

Re: [tesseract-ocr] building tesseract on windows using cygwin

2015-07-21 Thread Simon Eigeldinger
ngs, simon Am 20.07.2015 um 20:38 schrieb zdenko podobny: should be fixed - pull updates from git repo... Zdenko On Mon, Jul 20, 2015 at 6:13 PM, Simon Eigeldinger wrote: Hi all, just tried to compile tesseract on windows using cygwin from the git repo. this is what i got. looks pretty fine unti

Re: [tesseract-ocr] building tesseract on windows using cygwin

2015-07-22 Thread Simon Eigeldinger
und 351.7 mb. all the data files for tesseract which it can use at the moment. Let's see if it works. had no time currently to test but will do in the office tomorrow. greetings, simon Am 20.07.2015 um 20:38 schrieb zdenko podobny: should be fixed - pull updates from git repo... Z

[tesseract-ocr] tesseract on cygwin

2015-07-22 Thread Simon Eigeldinger
that. thats why i just dropped the error here. but i guess zdenko was writing to the other guy. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso...@andreloui

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
s. ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, Jul 22, 2015 at 11:11 PM, Simon Eigeldinger < simon.eigeldin...@vol.at> wrote: Hi, sorry for starting a new thread but i deleted all the other mails. just updated the

[tesseract-ocr] displayed version number of tesseract when compiled from git

2015-07-23 Thread Simon Eigeldinger
do you think about this? so everyone using the program where the version number shows up will see its a git compiled version. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ

[tesseract-ocr] tesseract on cygwin: training tools seem not to build

2015-07-23 Thread Simon Eigeldinger
Hi all, this is what make produces when it should make the training tools: sorry for sending all the stuff but maybe it might be interesting. $ make training make[1]: Entering directory '/home/Besitzer/tesseractsrc/training' depbase=`echo boxchar.lo | sed 's|[^/]*$|.deps/&|;s|\.lo$||'`;\ /bin/s

Re: [tesseract-ocr] displayed version number of tesseract when compiled from git

2015-07-23 Thread Simon Eigeldinger
23, 2015 at 6:33 PM, Simon Eigeldinger wrote: Hi all, at the moment when you compile the versions from git you get the version number 3.04.00 when you do a tesseract -v. but i guess when you compile directly from git it should show the version number like in the autogen and configure scripts

Re: [tesseract-ocr] tesseract on cygwin: training tools seem not to build

2015-07-23 Thread Simon Eigeldinger
hi, and i just opened a ticket: https://github.com/tesseract-ocr/tesseract/issues/61 greetings, simon Am 23.07.2015 um 23:23 schrieb Jim O'Regan: On 23 July 2015 at 19:02, Simon Eigeldinger wrote: Hi all, pango_font_info.cpp:223:46: error: 'strcasestr' was not declare

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
ile test/eurotext.tif format is 4; unreadable Error during processing. It looks like leptonica issue. Did you try to build and run leptonica progs (all that has pdf in name)? Zdenko -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vo

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
(and tesseract 3.03/3.04) and I guess it was not tested on cygwin yet. Zdenko On Fri, Jul 24, 2015 at 8:42 AM, Simon Eigeldinger wrote: Hi, i never tried to give tesseract a pdf as an input. cygwin has leptonica 1.71 or 1.72 by default so i used this for compiling. maybe leptonica doesn&#

Re: [tesseract-ocr] tesseract on cygwin

2015-07-24 Thread Simon Eigeldinger
dobny: it is not about input, but output. pdf output is key feature of leptonica 1.71 release (and tesseract 3.03/3.04) and I guess it was not tested on cygwin yet. Zdenko On Fri, Jul 24, 2015 at 8:42 AM, Simon Eigeldinger wrote: Hi, i never tried to give tesseract a pdf as an input. cygwi

[tesseract-ocr] how to compile tesseract on msys2/mingw?

2016-03-04 Thread Simon Eigeldinger
builds daily builds from the source. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso...@andrelouis.com --- Diese E-Mail wurde von Avast Antivirus-Software

Re: [tesseract-ocr] how to compile tesseract on msys2/mingw?

2016-03-05 Thread Simon Eigeldinger
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Mar 5, 2016 at 1:38 AM, Simon Eigeldinger wrote: Hi all, Now i am back having changed from cygwin to msys2/mingw. anyone knows how to compile tesseract best on this platform? i also thought of getting a daily

[tesseract-ocr] how to compile tesseract for windows on a linux machine?

2016-05-07 Thread Simon Eigeldinger
Hi all, I have now a different machine for building stuff based on Arch Linux. Can i compile tesseract on linux for the windows machines i have? Which dependencies do i need? I never have done that and would be grateful for some hand holding. I just compiled stuff on cygwin. Thanks and greeting

[tesseract-ocr] how to compile tesseract for windows on a linux machine?

2016-05-11 Thread Simon Eigeldinger
Hi all, Just resending this message. Maybe it has fallen under the radar of someone who might have some clues for me. sorry to the others who read this already. I have now a different machine for building stuff based on Arch Linux. Can i compile tesseract on linux for the windows machines i h

Re: [tesseract-ocr] how to compile tesseract for windows on a linux machine?

2016-05-14 Thread Simon Eigeldinger
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, May 11, 2016 at 3:31 PM, Simon Eigeldinger wrote: Hi all, Just resending this message. Maybe it has fallen under the radar of someone who might have some clues for me. sorry to

[tesseract-ocr] do i get a performance boost when i compile tesseract as a 64 bit program?

2016-05-15 Thread Simon Eigeldinger
. do i get a performance boost when i compile tesseract with 64 bit? i also don't know if i can install cygwin 32 and 64 bit on the same system or if i just need cygwin 64 bit to also compile 32 bit progams. greetings, simon -- Simon Eigeldinger Follow me on Twitter: http://www.twitte

Re: [tesseract-ocr] do i get a performance boost when i compile tesseract as a 64 bit program?

2016-05-15 Thread Simon Eigeldinger
server for tesseract so i guess i will build my own builds then which i will share with people. i guess i would build 32 and 64 bit versions if i can with one install. greetings, simon Am 15.05.2016 um 13:10 schrieb Marco Atzeri: On 15/05/2016 12:33, Simon Eigeldinger wrote: Hi all, i am t

[tesseract-ocr] opening multi page pdfs using tesseract

2016-09-15 Thread Simon Eigeldinger
Hi all, can i use tesseract to open multipage pdfs directly? we have multi function printers which produce pdfs with images which can be run through ocr. how can i acomplish that for tesseract? do i need a second program for that? greetings, simon --- Diese E-Mail wurde von Avast Antivirus-S

Re: [tesseract-ocr] Re: opening multi page pdfs using tesseract

2016-09-16 Thread Simon Eigeldinger
, Simon Eigeldinger wrote: Hi all, can i use tesseract to open multipage pdfs directly? we have multi function printers which produce pdfs with images which can be run through ocr. how can i acomplish that for tesseract? do i need a second program for that? greetings, simon --- Diese E-Mail

[tesseract-ocr] AppVeyor: add downloadable builds

2016-12-21 Thread Simon Eigeldinger
Hi all, I just looked at the git logs and found basically this message and this is pretty interesting. so we seem to be able to use win32 and win64 binaries from tesseract rebuilt after a git commit. sounds great. i looked at https://ci.appveyor.com/project/zdenop/tesseract/ and found the ar

[tesseract-ocr] thanks for tesseract daily builds

2017-02-06 Thread Simon Eigeldinger
Hi all, i want to thank again for the tesseract daily builds which are compiled on appveyor for windows. that is a good thing so i can use the most recent features. btw the pdf files are pretty readable now using a screen reader though some line breaks or paragraphs are still kind of hanging

Re: [tesseract-ocr] thanks for tesseract daily builds

2017-02-07 Thread Simon Eigeldinger
le to users. It would be helpful if that info is shared/added to wiki. - excuse the brevity, sent from mobile On 07-Feb-2017 11:04 AM, "Simon Eigeldinger" wrote: Hi all, i want to thank again for the tesseract daily builds which are compiled on appveyor for windows. that is a good

Re: [tesseract-ocr] thanks for tesseract daily builds

2017-02-07 Thread Simon Eigeldinger
s, visit https://groups.google.com/d/optout. -- Simon Eigeldinger Follow me on Twitter: http://www.twitter.com/domasofan/ E-Mail: simon.eigeldin...@vol.at MSN: simon_eigeldin...@hotmail.com ICQ: 121823966 Jabber: domaso...@andrelouis.com --- Diese E-Mail wurde von Avast Antivirus-Softwa

[tesseract-ocr] new tessdata repos on github

2017-09-17 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
Hi all, I guess i need some help understanding that. I have seen that there are now 3 repos on github containing .traineddata files. Let's see if i understand them right. Tessdata fast: Fast recognition but lesser accuracy. tessdata best: Slower recognition but higher accuracy. and there i

Re: [tesseract-ocr] new tessdata repos on github

2017-09-17 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sun, Sep 17, 2017 at 2:57 AM, 'Simon Eigeldinger' via tesseract-ocr < tesseract-ocr@googlegroups.com> wrote: Hi all, I guess i need some help understanding that.

Re: [tesseract-ocr] new tessdata repos on github

2017-09-17 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
most suitable On 17-Sep-2017 10:28 PM, "'Simon Eigeldinger' via tesseract-ocr" < tesseract-ocr@googlegroups.com> wrote: Hi ShreeDevi, Thanks for the info. So it seems for blind people who need the best accuracy they should use tessdata_best. Greetings, Simon Am 1

Re: [tesseract-ocr] install Tesseract on window

2017-10-12 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
successfully download the tesseract-ocr-setup.exe file but when i click on it, it send me to the get apps from the store app . I don't know what does the get apps from the store app do? and what do i suppose to do in order to get the tesseract-ocr-setup app installed ? Many thanks, Tom --

Re: [tesseract-ocr] russian-old?

2017-10-18 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
not provide for recognising old orthography and Church Slavonic glyphs? You know, i with dot, theta, yat, etc. Would it be very hard to add the 'rus_old' variant? Or, is it too difficult to roll-your-own the changed rus.tessdata on the local system? -Yury -- Simon Eigeldinger Fo

Re: [tesseract-ocr] russian-old?

2017-10-18 Thread &#x27;Simon Eigeldinger' via tesseract-ocr
I guess i have to correct myself. german fraktur is in the tessdata repo. Am 18.10.2017 um 21:34 schrieb 'Simon Eigeldinger' via tesseract-ocr: Hi Yury, Maybe the same happened to it like the german fraktur data. they seem to have not been updated for a long time and they have bee