On Fri, Dec 04, 2009 at 01:03:23AM +0100, Volker Armin Hemmann wrote: > look at my name, ok? > > Just dropping the Umlaut is wrong. No if, but, maybe. It is wrong. Error. > Mistake. Fail. If you can not enter ?, ? or ?, you must transform them to ae, > oe or ue.
I'd like to find a program which would do that! Seriously. But anyway, the purpose of this is not to transform names so our antique ASCII-7 computers can store them, but to eliminate redundant records. For instance, we get data from vendors for all cities and states, geolocation data, which has its own redundancies, such as both FORT WORTH and FT WORTH, or SAINT LOUIS and ST LOUIS. But we have to convert to upper case, get rid of punctuation, get rid of extra white space, etc, and all that is independent of the locale. I want to do the same for unicode. If enough Europeans are in the habit of taking shortcuts and skipping umlauts and accents and cedilla and tildes, then I'd like to standardize the data for lookup. This has nothing to do with converting people's names for storage. We don't even store the transformed place name. -- ... _._. ._ ._. . _._. ._. ___ .__ ._. . .__. ._ .. ._. Felix Finch: scarecrow repairman & rocket surgeon / fe...@crowfix.com GPG = E987 4493 C860 246C 3B1E 6477 7838 76E9 182E 8151 ITAR license #4933 I've found a solution to Fermat's Last Theorem but I see I've run out of room o