New submission from roskakori:

Currently Python includes a codec for EBCDIC international (cp500) but seems to 
be missing any further EBCDIC codecs. These encodings are widly used on 
mainframe platforms, popular in finance and insurance.

Descriptions of these codepages are available from IBM: 
<http://www-01.ibm.com/software/globalization/cp/cp_cpgid.html>. These 
descriptions also include mapping files although not in a format that can 
readily be processed by gencodec.py.

So instead I used the codecs included with Java 1.7 to generate mappings for 
gencodec.py. You can find them in the attached ZIP archive. As Java also runs 
on mainframe platforms, IBM should be interested in the Java codecs to be 
correct and complete.

The converter is available from <https://github.com/roskakori/CodecMapper>. To 
build the cp*.txt for EBCDIC, simply run:

$ git clone https://github.com/roskakori/CodecMapper.git
$ cd CodecMapper
$ ant ebcdic

IBM lists a large number of EBCDIC codepages, I only attached the ones listed 
in the German Wikipedia: <http://de.wikipedia.org/wiki/EBCDIC>. This also 
includes cp500 for comparison with your current cp500. And it lacks EDF03DRV 
because even Java does not support it.

Currently Java 1.7 supports 43 variants. To get a list of them, use:

$ ant list | grep -i ' ibm'

This would also fix issue 1097797: Encoding for Code Page 273 used by EBCDIC 
Germany Austria.

----------
components: Unicode
files: cp_ebcdic.zip
messages: 192214
nosy: ezio.melotti, lemburg, roskakori
priority: normal
severity: normal
status: open
title: Additional code pages for EBCDIC
type: enhancement
Added file: http://bugs.python.org/file30751/cp_ebcdic.zip

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue18348>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to