Package: unicode-data Version: 10.0.0-3 Severity: wishlist http://www.unicode.org/Public/ has a lot of data.
Data under emoji is included in the unicode-data package but data under cldr is not. It will be nice cldr data is also available. Background: I actually needed emoji data and some parts of cldr data under the /usr/share/unocode directory to update ibus program. I understand emoji is mentioned in but cldr is not mentioned in http://www.unicode.org/versions/Unicode10.0.0/ But http://www.unicode.org/ has a prominent link to http://cldr.unicode.org/index so I do expect this data is also included or coordinated with this package. Fedora seems to create another rpm https://github.com/fujiwarat/cldr-emoji-annotation which looks like coming from the unicode cldr data but not all of them. He installs part of idata from the zip file under /usr/share/unocode/cldr/common/annotations /usr/share/unocode/cldr/common/annotationsDerived The data used is http://www.unicode.org/Public/cldr/31.0.1/cldr-common-31.0.1.zip http://www.unicode.org/Public/cldr/31.0.1/core.zip (These seem to be the same file) It will be nice you also include or package these zip files in http://www.unicode.org/Public/cldr/31.0.1/ or its new version. Anyway, this data archive is huge. Unless careful coordination is done, we may end up lots of duplicated data. I am not quite sure how these should be packaged for Debian. I guess unicode-data maintainer has a better idea, So i am filing this bug report. Osamu -- System Information: Debian Release: buster/sid APT prefers testing APT policy: (500, 'testing'), (500, 'stable'), (90, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 4.12.0-2-amd64 (SMP w/4 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US:en (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system) -- no debconf information