Package: ocrfeeder Version: 0.8.1-2 Severity: normal Exporting is aborted if tesseract identifies any unicode character. Sample output:
Traceback (most recent call last): File "/usr/lib/python2.7/dist-packages/ocrfeeder/studio/studioBuilder.py", line 298, in exportDialog self.EXPORT_FORMATS[format][1]) File "/usr/lib/python2.7/dist-packages/ocrfeeder/studio/studioBuilder.py", line 281, in exportToFormat name) File "/usr/lib/python2.7/dist-packages/ocrfeeder/studio/widgetModeler.py", line 606, in exportPagesWithGenerator document_generator.save() File "/usr/lib/python2.7/dist-packages/ocrfeeder/feeder/documentGeneration.py", line 221, in save file.write(pages[i]) UnicodeEncodeError: 'ascii' codec can't encode character u'\u2014' in position 1263: ordinal not in range(128) -- System Information: Debian Release: stretch/sid APT prefers unstable APT policy: (990, 'unstable'), (1, 'experimental') Architecture: amd64 (x86_64) Kernel: Linux 4.4.0-rc8-amd64 (SMP w/4 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Init: systemd (via /run/systemd/system) Versions of packages ocrfeeder depends on: ii cuneiform 1.1.0+dfsg-5+b2 ii ghostscript 9.16~dfsg-2 ii gir1.2-goocanvas-2.0 2.0.2-2 ii gir1.2-gtk-3.0 3.18.6-1 ii gir1.2-gtkspell3-3.0 3.0.7-2 ii gocr 0.49-2 ii iso-codes 3.64-1 ii ocrad 0.24-1 ii python 2.7.11-1 ii python-enchant 1.6.6-2 ii python-gi 3.18.2-2 ii python-lxml 3.5.0-1 ii python-pil 3.0.0-1 ii python-reportlab 3.2.0-1 ii python-sane 2.8.2-1+b1 ii tesseract-ocr 3.04.00-5+b1 Versions of packages ocrfeeder recommends: ii unpaper 6.1-1 ii yelp 3.16.1-1 ocrfeeder suggests no packages. -- no debconf information