On Wed, Oct 20, 2004 at 09:43:40PM +0000, Andreas Vox wrote:
> José Abílio Oliveira Matos <[EMAIL PROTECTED]> writes:
> 
> >    So this means more or less:
> >      + All Letters
> >      + All Marks
> >      + Digits
> >      + Characters ':' and '_' are allowed as name-start characters.
> >      + Characters '-' and '.' are allowed as name characters.
> 
> I read most of that out of the link to the XML declaration you sent earlier :-)
> 
> Only remaining question is: what are marks ???

  For an small introduction:
  http://www.unicode.org/glossary/

  Accents, for example are marks, any diacrytic that you can imagine is a
mark.
  I know that this is an introduction, but the best part for me is:
  
  Spacing Mark. A combining character that is not a nonspacing mark. 
  
  :-D
  I couldn't say it best. :-)

> And is it  ok if I limit myself to  ASCII for the time being?

  Yes, we are safe with ASCII, notice that standard allows more than that.
If we restict our self to ASCII and then when supporting unicode lift this
restriction, we are safe. It is simple and for 1.4 should be OK.

> BTW, I changed the translation scheme to:
> 
> : or comma or semicolon to .
> _ and space to -
> all others dropped
> if any mangling occured, append "-" + unique number
> if no mangling occured but the original ended in a digit, append "."
> 
> That should result in quite readable mangled names which are much
> shorter than the "Morse" code I proposed earlier.
> 
> Also the scheme is very similar to the one LyX already uses for filenames
> (yes, I just copied the good ideas :-) )

  You are learning. :-)
  Note that this is for sgml, for xml the letters + ':.-_' are allowed.
  
> /Andreas
> 

-- 
José Abílio Matos
LyX and docbook a perfect match. :-)

Reply via email to