Re: UCD in XML or in CSV? (was: Re: Unicode Digest, Vol 56, Issue 20)

2018-08-31 Thread Manuel Strehl via Unicode
To handle the UCD XML file a streaming parser like Expat is necessary. For codepoints.net I use that data to stuff everything in a MySQL database. If anyone is interested, the code for that is Open Source: https://github.com/Codepoints/unicode2mysql/ The example for handling the large XML file c

Re: CLDR (was: Private Use areas)

2018-08-31 Thread Manuel Strehl via Unicode
The XML files in these folders: https://unicode.org/repos/cldr/tags/latest/common/ But I agree. I spent an extreme amount of time to get somewhat used to cldr.unicode.org and and the data repo, and still I have no clue, where to find a concrete piece of information without digging into the site.

Re: emoji props in the ucdxml ?

2017-07-05 Thread Manuel Strehl via Unicode
>> but are there any plans to integrate the data in the ucdxml [2] >> (possibly as separate files) ? > > No. Not unless and until they become formally part of the UCD. In this context: Would it be possible for the maintainers of the TR #51 data files to add a symlink "latest" under unicode.org/Pu

Re: Comparing Raw Values of the Age Property

2017-05-23 Thread Manuel Strehl via Unicode
The rising standard in the world of web development (and others) is called »Semantic Versioning« [1], that many projects adhere to or sometimes must actively explain, why they don't. The structure of a »semantic version« string is a set of three integers, MAJOR.MINOR.PATCH, where the »sematics« pa