Interim progress report.
I downloaded the file Mat_utf8.zip from Cyrille's link and unzipped the
contents to Mat_utf8-odt
I opened the .odt file using 7-Zip from the Windows Explorer context menu, and
extracted the file contents.xml
I used Notepad++ plug-in XMLTools to pretty print the XML fil
Sorry I was occupied!
Il 15/05/2019 02:34, Michael H ha scritto:
> I don't read Burmese and I don't know anyone who does. I was
> suggesting you contact whoever provided you the files and permission,
> and ask them if they can verify the unicode text reads correctly.
>
I did it, and the conversion
Il 15/05/2019 11:46, David Haslam ha scritto:
> Interim progress report.
>
> I downloaded the file Mat_utf8.zip from Cyrille's link and unzipped the
> contents to Mat_utf8-odt
>
> I opened the .odt file using 7-Zip from the Windows Explorer context menu,
> and extracted the file contents.xml
>
David I have no count in box, and I want not to create one. Can you push
on https://framadrop.org/ it's totally free and secure (and private).
Thank you.
Il 15/05/2019 11:46, David Haslam ha scritto:
> Interim progress report.
>
> I downloaded the file Mat_utf8.zip from Cyrille's link and unzippe
Cyrille writes:
“My question is, can you do something with the txt file for adding the verse
number?”
Well - yes - that’s my intention.
It was after all an “interim progress report”. ;)
David
Sent from ProtonMail Mobile
On Wed, May 15, 2019 at 11:24, Cyrille wrote:
> Il 15/05/2019 11:46, D
Yep - sure - later I can do that.
David
Sent from ProtonMail Mobile
On Wed, May 15, 2019 at 11:26, Cyrille wrote:
> David I have no count in box, and I want not to create one. Can you push on
> https://framadrop.org/ it's totally free and secure (and private).
> Thank you.
>
> Il 15/05/2019
Observations: (continued)
4. In addition to the reported instances of the anomalous 3 characters (È,Ø,ò)
found after the font conversion,
there are 6 instances of the string "m;" that are also probably due to bugs in
the converter.
Best regards,
David
Sent with [ProtonMail](https://protonmail
Observations: (continued)
5. The string "Kd;" also looks anomalous. It's found only once in
ကိုယ်တော်၏ဦးခေါင်းတော်အပေါ်၌ လည်း ဤသူသည်ကား ဂျူးလူမျ Kd;တို့၏ဘုရင်၊
6. It's evident from the PDF file that the text is paragraphed with indented
first lines. See
https://www.dropbox.com/s/do5e675i19xfomf/
The attachment contains a counted list of Myanmar words containing a font
conversion error.
NB. We need to match these words with what they are in the legacy font.
This issue should be discussed with the current maintainer of the SIL TECkit
converter, whoever that may be.
It may be worthwhile a
I've been working long hours and emailing in my break time. David has the
basics of converting to VPL.
I would then make the entire work a column in a spreadsheet.
Then in other collumns insert a list of Book/chapter/verse in order.
The BCV and versetext columns should align and can be verifie
Michael’s description matches how I imagined the method during my waking
moments this morning. :)
David
Sent from ProtonMail Mobile
On Wed, May 15, 2019 at 17:33, Michael H wrote:
> I've been working long hours and emailing in my break time. David has the
> basics of converting to VPL.
>
>
I have not understood everything yet ... But I trust you. But if you
have the courage to explain to me I want to learn :)
What I don't understand is how you can find the marker of each verse and
chapter in the utf8 text? What is this marker in question?
Il 15/05/2019 19:03, David Haslam ha scritto
Each of the last 1 or 2 characters of each verse is a regular Myanmar
punctuation mark.
We need to be careful how we apply this. There may well be some exceptions.
Windows users should install BabelPad. This free Unicode text editor is highly
recommended.
http://www.babelstone.co.uk/Software/
Il 15/05/2019 19:18, David Haslam ha scritto:
> Each of the last 1 or 2 characters of each verse is a regular Myanmar
> punctuation mark.
>
Do you know wich mark?
> We need to be careful how we apply this. There may well be
> some exceptions.
>
> Windows users should install BabelPad. This free
14 matches
Mail list logo