Package: wnpp
Severity: wishlist

* Package name    : tesseract-ocr
  Version         : 1.0.1
  Upstream Author : Ray Smith <[EMAIL PROTECTED]>
* URL             : http://sourceforge.net/projects/tesseract-ocr/
* License         : Apache License, Version 2.0
  Programming Lang: C++
  Description     : console-based optical character recognition, (OCR)

A commercial quality OCR engine originally developed at HP between 1985
and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It
was open-sourced by HP and UNLV in 2005.

(Long description taken from the sourceforge project page, something
better will need to be written)

Whoever packages tesseract needs to be prepared to address a couple of
serious problems:

 * The third-party source found in aspirin/ is non-free, it will either
   need to be stripped out, or the author convinced to relicense it.
 * The tesseract binary requires that it reside in the same directory as
   the tessdata directory, which must be writable.

Nonetheless, the accuracy of tesseract is better than any of the other
free alternatives I've tried; I think it would make a nice addition to 
the archive.

-- System Information:
Debian Release: testing/unstable
  APT prefers unstable
  APT policy: (700, 'unstable')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.17-2-686
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)

-- 
Eric Evans
[EMAIL PROTECTED]

Attachment: signature.asc
Description: Digital signature

Reply via email to