Tom Gillespie <tgb...@gmail.com> writes: >> We might probably generalize to >> PRE = Zs Zl Pc Pd Ps Pi ' " >> POST = Zs Zl Pc Pd Pe Pf . ; : ! ? ' " \ [ > > If this works I think it is reasonable. We might want to > specify what to do in cases where an org implementation > might not fully support unicode,
Just fall back to ASCII subset? If the implementation does not support unicode, it probably cannot properly work with UTF-encoded documents anyway. > ...and might want to do a > review of related issues in syntax with respect to ascii > vs unicode, because iirc there is some ambiguity in > the current syntax doc. > For example, I'm pretty sure that I'm mixing and matching > unicode and ascii whitespace in the tokenizer I have in Racket. Feel free to open new bug reports about such ambiguities. -- Ihor Radchenko // yantar92, Org mode contributor, Learn more about Org mode at <https://orgmode.org/>. Support Org development at <https://liberapay.com/org-mode>, or support my work at <https://liberapay.com/yantar92>