Package: ocrodjvu Version: 0.4.2-1 Severity: normal On Mon, 01 Mar 2010 jsb...@mimuw.edu.pl wrote:
> The input file is temporarily available at > http://fleksem.klf.uw.edu.pl/~jsbien/tmp/in.djvu. Now I get: ---------------------------------------------------------------------------------------------- ocrodjvu --render all --engine cuneiform --language pol --clear-text -o out.djvu in.djvu Processing 'in.djvu': - Page #1 - Page #2 Exception in thread Thread-2: Traceback (most recent call last): File "/usr/lib/python2.5/threading.py", line 486, in __bootstrap_inner self.run() File "/usr/lib/python2.5/threading.py", line 446, in run self.__target(*self.__args, **self.__kwargs) File "/usr/share/ocrodjvu/lib/_ocrodjvu.py", line 443, in page_thread result = self.process_page(page) File "/usr/share/ocrodjvu/lib/_ocrodjvu.py", line 423, in process_page page_size=size File "/usr/share/ocrodjvu/lib/hocr.py", line 457, in extract_text scan_result = scan(doc.find('/body'), settings) File "/usr/share/ocrodjvu/lib/hocr.py", line 419, in scan _scan(node, buffer, BBox(), settings) File "/usr/share/ocrodjvu/lib/hocr.py", line 394, in _scan look_down(result, bbox) File "/usr/share/ocrodjvu/lib/hocr.py", line 342, in look_down _scan(child, buffer, parent_bbox, settings) File "/usr/share/ocrodjvu/lib/hocr.py", line 407, in _scan result[:] = _replace_cuneiform08_paragraph(result[:], settings) File "/usr/share/ocrodjvu/lib/hocr.py", line 234, in _replace_cuneiform08_paragraph raise ValueError ValueError ---------------------------------------------------------------------------------------------- JSB -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (500, 'testing') Architecture: i386 (i686) Kernel: Linux 2.6.32-trunk-486 Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages ocrodjvu depends on: ii djvulibre-bin 3.5.22-8 Utilities for the DjVu image forma ii python 2.5.4-9 An interactive high-level object-o ii python-argparse 1.0.1-1 optparse-inspired command-line par ii python-djvu 0.1.17-1 Python support for the DjVu image ii python-lxml 2.2.4-1+b1 pythonic binding for the libxml2 a ii python-support 1.0.6 automated rebuilding support for P Versions of packages ocrodjvu recommends: ii ocropus 0.3.1-2 document analysis and OCR system ii python-pyicu 0.9-2 Python extension wrapping the ICU ii tesseract-ocr 2.04-2 Command line OCR tool Versions of packages ocrodjvu suggests: ii cuneiform 0.7.0+dfsg-5 multi-language OCR system -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org