Chris, So is the problem that these languages generally have U+002D (hyphen/minus) originating from an easy to type workaround for normal keyboards (even going back to mechanical typewriters!), and that everyone got used to doing this?
If so, what would the SWORD API do with U+2010 (General punctuation block - HYPHEN), if all instances where U+002D had been used as a letter were replaced by U+2010 ? Or is there an even more suitable alternative codepoint somewhere in another block of the BMP ? PS. For parsing booknames, what does the API do with figure dash U+2012 and ndash U+2013 ? David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Hyphens-in-book-names-tp2719769p2720350.html Sent from the SWORD Dev mailing list archive at Nabble.com. _______________________________________________ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page