On Thursday, April 29, 2021 09:03:59 AM Albretch Mueller wrote: > > What is "alpha-offset format"? > > we, corpora research kinds of folks, need to process thousand of > files as other people process bytes. UTF8 was basically an > Americanizierung of alle alphabets. UTF is great to describe an > alphabet but not for text files. > > UTF8 turned all files into streams not good for questions such as > what is the charatcer/string sequence starting on the nth addressable > unit of a file ... > > Doing that with utF8 is from way too complicated to impossible. Also > alpha offset nicely splits the files segments into its different > parts: ALPHABETICAL text, js, css, ...
Ok, but what does it look like? (What is the format?) A google search shows only links to this thread and some page about its relevance to aiming a telescope -- I strongly suspect that is not relevant to your use case.