DM Smith wrote:
> I have added a -n flag to osis2mod.

I'm going to add it to the other major importers (osis2gbs & imp2*) just 
as soon as I get things into a fairly stable state.

> This flag, to be enabled, requires osis2mod to be compiled with ICU  
> support enabled.
> 
> -n stands for normalized to NFC, the agreed upon UTF-8 encoding
> 
> When should this flag be used?
> 1) When the input is UTF-8
> and
> 2) It is not known to be NFC

First, I feel like there's really no reason NOT to perform 
normalization, provided that the input is UTF-8. Even if the input is 
already in NFC, it won't hurt anything to do it again. It will take 
extra time to compile the module, but I feel like it's better to be safe 
than sorry in this case.

Second, your comment about needing UTF-8 input makes me think we should 
go ahead and add encoding conversion to the importers as well, possibly 
with automatic charset detection.

--Chris

_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to