Re: Suggestions for tesseract

2022-01-21 Thread Siard
On Thu, 20 Jan 2022, Curt wrote: > On 2022-01-20, Siard wrote: > > Bob Bernstein wrote: > > > Executing 'apt-cache search tesseract' brings up a multitude of > > > packages. > > > > > > My need is simple enough, I think: I like to scan

Re: Suggestions for tesseract

2022-01-20 Thread Curt
On 2022-01-20, Siard wrote: > Bob Bernstein wrote: >> Executing 'apt-cache search tesseract' brings up a multitude of >> packages. >> >> My need is simple enough, I think: I like to scan (using an >> Epson scanner) pages of printed books -- almost on

Re: Suggestions for tesseract

2022-01-20 Thread Siard
Bob Bernstein wrote: > Executing 'apt-cache search tesseract' brings up a multitude of > packages. > > My need is simple enough, I think: I like to scan (using an > Epson scanner) pages of printed books -- almost one hundred per > cent text -- and then use OCR to pr

Suggestions for tesseract

2022-01-20 Thread Bob Bernstein
Executing 'apt-cache search tesseract' brings up a multitude of packages. My need is simple enough, I think: I like to scan (using an Epson scanner) pages of printed books -- almost one hundred per cent text -- and then use OCR to produce pages from which I can copy 'n paste s

Re: xsane & tesseract

2017-08-26 Thread Siard
Joe Pfeiffer wrote: > I scanned the document to ppm files, sent them to tesseract, put the > output of tesseract into a .txt file, and cleaned up from there. You could try gimagereader, a frontend for tesseract, making this process somewhat easier. Among others, it uses a spell checker, so

Re: xsane & tesseract

2017-08-25 Thread Joe Pfeiffer
Doug writes: > On 08/25/2017 08:31 PM, Stephen Grant Brown wrote: > > Hi All, > How do I setup xsane to use the tesseract OCR engine? > I see gocr under preferences->setup->ocr. > Yours Sincerely > Stephen Grant Brown. > > Unless it has been vastly imp

Re: xsane & tesseract

2017-08-25 Thread Doug
On 08/25/2017 08:31 PM, Stephen Grant Brown wrote: Hi All, How do I setup xsane to use the tesseract OCR engine? I see gocr under preferences->setup->ocr. Yours Sincerely Stephen Grant Brown. Unless it has been vastly improved, you might as well copy the document by hand! Finding and

xsane & tesseract

2017-08-25 Thread Stephen Grant Brown
Hi All, How do I setup xsane to use the tesseract OCR engine? I see gocr under preferences->setup->ocr. Yours Sincerely Stephen Grant Brown.

Re: Tesseract...

2011-07-15 Thread Camaleón
On Fri, 15 Jul 2011 22:00:18 +0700, Sthu Deus wrote: >>> Who needs old files when new arrive? :) > >>Well, it can be years of work that now cannot render with the new >>version... you will get very angry birds (oops... sorry, I mean "users", >>angry users) if you update the package to the last ve

Re: Tesseract...

2011-07-15 Thread Sthu Deus
Thank You for Your time and answer, Camaleón: >Did you try to compile tesseract for your Debian version? You can do >that on the virtual machine, just to see how it goes :-? OK, I'll give it a try. >> Who needs old files when new arrive? :) >Well, it can be years of

Re: Tesseract...

2011-07-15 Thread Camaleón
rectories of several repos > like testing and stable, then all the packages are mixed - You do not > know from the dir.s architecture which package belongs to which repo in > the case. Hum... not in Debian and many other distributions, look: http://ftp.de.debian.org/debian/pool/main/t

Re: Tesseract...

2011-07-14 Thread Sthu Deus
esting, sid... >I would not mix Ubuntu packages into a Debian installation. Have you >considered in compiling the apckage from Tesseract site? I thought to install Ubuntu in KVM and go on in case no luck w/ tesseract 3 in Debian. >Ah, I've seen that you already posted into backport

Re: Tesseract...

2011-07-08 Thread Camaleón
El 2011-07-07 a las 13:50 -0700, sthu deus escribió: (resending to the list) > On 07/07/2011, Camaleón wrote: > > On Thu, 07 Jul 2011 17:02:40 +0700, Sthu Deus wrote: > > > >> Here are: > >> > >> https://launchpad.net/~nutznboltz/+archive/tesseract

Re: Tesseract...

2011-07-07 Thread Camaleón
On Thu, 07 Jul 2011 17:02:40 +0700, Sthu Deus wrote: > Here are: > > https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra > > plenty of missing in Debian 6 language packages w/ the updated program > itself. > > Can't it be

Re: Tesseract...

2011-07-07 Thread Hugo Vanwoerkom
Sthu Deus wrote: Good time of the day. Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra plenty of missing in Debian 6 language packages w/ the updated program itself. Can't it be easily backported to D6 ones it is in Ubuntu? I

Tesseract...

2011-07-07 Thread Sthu Deus
Good time of the day. Here are: https://launchpad.net/~nutznboltz/+archive/tesseract/+sourcepub/1729019/+listing-archive-extra plenty of missing in Debian 6 language packages w/ the updated program itself. Can't it be easily backported to D6 ones it is in Ubuntu? I would like to do it m

Re: tesseract: ocr that works

2008-12-29 Thread Rainer Kluge
Hugo Vanwoerkom schrieb: Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! Hugo I use it with the gscan2pdf frontend and it works perfectly (at least for documents in german language) -- To UNSUBSCRIBE, email to debian

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 28 Dec 2008, andmalc wrote: > On Dec 28, 5:10 am, Anthony Campbell wrote: > > On 21 Dec 2008, Hugo Vanwoerkom wrote: > > > [snip] > > > Yes, tesseract does work well. Here, xsane gives depth 24, but conversion > > to depth 8 is neither possible nor nec

Re: tesseract: ocr that works

2008-12-28 Thread andmalc
On Dec 28, 5:10 am, Anthony Campbell wrote: > On 21 Dec 2008, Hugo Vanwoerkom wrote: > [snip] > Yes, tesseract does work well. Here, xsane gives depth 24, but conversion > to depth 8 is neither possible nor necessary. Following the docs, I did There is an option at the top of the

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 21 Dec 2008, Hugo Vanwoerkom wrote: > Hi, > > Recently there was a post mentioning tesseract. > > Turns out that is an award winning opensource OCR that works! > > I tried it out: > > 1. apt-get install tesseract-ocr > 2. apt-get install tesseract-ocr-eng > 3

Re: tesseract: ocr that works

2008-12-27 Thread Bryan Bishop
. 1200 DPI made things _worse_ not better, > possibly because of noise. This was on Fedora, so maybe it was in fact > tesseract. Back when I first got access to the university scientific publication network, I started to get hungry for an OCR tool to do bibliographies and references,

Re: tesseract: ocr that works

2008-12-27 Thread Dotan Cohen
, so maybe it was in fact tesseract. -- Dotan Cohen http://what-is-what.com http://gibberish.co.il א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ-ק-ר-ש-ת ا-ب-ت-ث-ج-ح-خ-د-ذ-ر-ز-س-ش-ص-ض-ط-ظ-ع-غ-ف-ق-ك-ل-م-ن-ه‍-و-ي А-Б-В-Г-Д-Е-Ё-Ж-З-И-Й-К-Л-М-Н-О-П-Р-С-Т-У-Ф-Х-Ц-Ч-Ш-Щ-Ъ-Ы-Ь-Э-Ю-Я а-б-в-г-д-е-ё-ж-з-и-

tesseract: ocr that works

2008-12-27 Thread Hugo Vanwoerkom
Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! I tried it out: 1. apt-get install tesseract-ocr 2. apt-get install tesseract-ocr-eng 3. use xsane to scan a page at dpi 300 and save as .tif 4. run: convert foo.tif -depth 8 foo1