Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread David Haslam
If the language is ordinary Burmese, we can use Google Translate to check that any particular verse has text that we should expect for the initially assigned reference. Machine translation is often adequate for such a limited purpose task. We can also check against how PageMaker rendered it to

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Michael H
If the text doesn't come from an active team with people who read Burmese: I am on another group list which has relationships with people working on minority languages related to Burmese, but not Myanmarese. I can ask if anyone can verify the unicode conversion works there if that's the only option

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Michael H
I don't read Burmese and I don't know anyone who does. I was suggesting you contact whoever provided you the files and permission, and ask them if they can verify the unicode text reads correctly. To a native speaker, the text will quickly be recognized as readable, or misspelled (letters are out

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
Il 14/05/2019 22:48, Michael H ha scritto: > You should be able to configure a regex search to find the verse > boundaries. > > Once you have verse boundaries, if you configure the text into Verse > per line it should be possible to assign each row a chapter and verse > number from a reference. T

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
Il 14/05/2019 22:55, Cyrille ha scritto: > > > Il 14/05/2019 22:45, Michael H ha scritto: >> Cyrille, did you start from the PDF or the pagemaker file? > PMaker >> Either way, you should send a snippet to your source and validate the >> words are still readable. As small as 30 words should be eno

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Michael H
You should be able to configure a regex search to find the verse boundaries. Once you have verse boundaries, if you configure the text into Verse per line it should be possible to assign each row a chapter and verse number from a reference. That is, the 3341 verse in the New Testament is usually J

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Michael H
Cyrille, did you start from the PDF or the pagemaker file? Either way, you should send a snippet to your source and validate the words are still readable. As small as 30 words should be enough. On Tue, May 14, 2019 at 8:09 AM Cyrille wrote: > I send my message again because it was bigger. > > Th

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
Il 14/05/2019 22:26, David Haslam ha scritto: > If Michael’s observations are anything to go by, then maybe I can > script the recovery of chapter & verse tags.  > > We shall see > > Even if I’m not immediately successful - valuable lessons can be > learned in the attempt. Very, well, I'll w

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread David Haslam
If Michael’s observations are anything to go by, then maybe I can script the recovery of chapter & verse tags. We shall see Even if I’m not immediately successful - valuable lessons can be learned in the attempt. David Sent from ProtonMail Mobile On Tue, May 14, 2019 at 21:21, Cyrille

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
Ok thank you!  I have already all the text in unicode but without the verse numbers and chapters... I begun manually... Il 14/05/2019 22:17, David Haslam ha scritto: > Hi Cyrille  > > If I can find the time tomorrow or later, I’ll have a look at what > might be feasible.  > > Thanks for all these

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread David Haslam
Hi Cyrille If I can find the time tomorrow or later, I’ll have a look at what might be feasible. Thanks for all these useful links. David Sent from ProtonMail Mobile On Tue, May 14, 2019 at 14:08, Cyrille wrote: > I send my message again because it was bigger. > > The conversion to UTF-8 is

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread David Haslam
The ThanLwinSoft software was indeed developed by Keith Stribley (1976-2011). Screenshot posted to my Facebook timeline. https://m.facebook.com/story.php?story_fbid=10213794210749822&id=1243443528 We had exchanged emails during the year before he died. Best regards David Sent from ProtonMail

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
I send my message again because it was bigger. The conversion to UTF-8 is 99% solved!! I used a online converter: https://thanlwinsoft.github.io/www.thanlwinsoft.org/ThanLwinSoft/MyanmarUnicode/Conversion/myanmarConverter.html or: http://burglish.my-mm.org/latest/trunk/web/fontconv.htm See the re

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Michael H
Cyrille, (Peter), Maybe further discussion on this belongs in Gitlab as issues. Can I get added to this project? Here are the first few lines of Matthew copied from the PDF: -- &Sifrmaw;OD; {0Ha*vdusrf; The Gospel According to Matthew ed'gef; usr;f ûyy*k Kd¾v f &iS rf maw;O;D \b0rwS wf r;f u

[sword-devel] Commentary module -- no text output (RESOLVED)

2019-05-14 Thread Christopher Adams
I'm not sure how, but I seem to have missed the e-mail of the response to my last post. So I apologize that this post isn't connected with it, and is a bit late. Troy, and David, thank-you for your responses; they were very helpful! Also, just for completeness, I'll add that I wasn't sure what fi

Re: [sword-devel] Bible in Myanmar

2019-05-14 Thread Cyrille
Yesterday I thought, if a pdf tool give the possibility to cut the pdf in the middle, then the raw conversion to txt can be possible, the we only need to convert it to UTF8. Any idea? Il 13/05/2019 17:40, Michael H ha scritto: > I unzipped the pagemaker file, and when I open NT_Proverb/Pagemaker >