If the language is ordinary Burmese, we can use Google Translate to check that
any particular verse has text that we should expect for the initially assigned
reference. Machine translation is often adequate for such a limited purpose
task.
We can also check against how PageMaker rendered it to
If the text doesn't come from an active team with people who read Burmese:
I am on another group list which has relationships with people working on
minority languages related to Burmese, but not Myanmarese. I can ask if
anyone can verify the unicode conversion works there if that's the only
option
I don't read Burmese and I don't know anyone who does. I was suggesting you
contact whoever provided you the files and permission, and ask them if they
can verify the unicode text reads correctly.
To a native speaker, the text will quickly be recognized as readable, or
misspelled (letters are out
Il 14/05/2019 22:48, Michael H ha scritto:
> You should be able to configure a regex search to find the verse
> boundaries.
>
> Once you have verse boundaries, if you configure the text into Verse
> per line it should be possible to assign each row a chapter and verse
> number from a reference. T
Il 14/05/2019 22:55, Cyrille ha scritto:
>
>
> Il 14/05/2019 22:45, Michael H ha scritto:
>> Cyrille, did you start from the PDF or the pagemaker file?
> PMaker
>> Either way, you should send a snippet to your source and validate the
>> words are still readable. As small as 30 words should be eno
You should be able to configure a regex search to find the verse
boundaries.
Once you have verse boundaries, if you configure the text into Verse per
line it should be possible to assign each row a chapter and verse number
from a reference. That is, the 3341 verse in the New Testament is usually
J
Cyrille, did you start from the PDF or the pagemaker file? Either way, you
should send a snippet to your source and validate the words are still
readable. As small as 30 words should be enough.
On Tue, May 14, 2019 at 8:09 AM Cyrille wrote:
> I send my message again because it was bigger.
>
> Th
Il 14/05/2019 22:26, David Haslam ha scritto:
> If Michael’s observations are anything to go by, then maybe I can
> script the recovery of chapter & verse tags.
>
> We shall see
>
> Even if I’m not immediately successful - valuable lessons can be
> learned in the attempt.
Very, well, I'll w
If Michael’s observations are anything to go by, then maybe I can script the
recovery of chapter & verse tags.
We shall see
Even if I’m not immediately successful - valuable lessons can be learned in the
attempt.
David
Sent from ProtonMail Mobile
On Tue, May 14, 2019 at 21:21, Cyrille
Ok thank you! I have already all the text in unicode but without the
verse numbers and chapters... I begun manually...
Il 14/05/2019 22:17, David Haslam ha scritto:
> Hi Cyrille
>
> If I can find the time tomorrow or later, I’ll have a look at what
> might be feasible.
>
> Thanks for all these
Hi Cyrille
If I can find the time tomorrow or later, I’ll have a look at what might be
feasible.
Thanks for all these useful links.
David
Sent from ProtonMail Mobile
On Tue, May 14, 2019 at 14:08, Cyrille wrote:
> I send my message again because it was bigger.
>
> The conversion to UTF-8 is
The ThanLwinSoft software was indeed developed by Keith Stribley (1976-2011).
Screenshot posted to my Facebook timeline.
https://m.facebook.com/story.php?story_fbid=10213794210749822&id=1243443528
We had exchanged emails during the year before he died.
Best regards
David
Sent from ProtonMail
I send my message again because it was bigger.
The conversion to UTF-8 is 99% solved!! I used a online converter:
https://thanlwinsoft.github.io/www.thanlwinsoft.org/ThanLwinSoft/MyanmarUnicode/Conversion/myanmarConverter.html
or:
http://burglish.my-mm.org/latest/trunk/web/fontconv.htm
See the re
Cyrille, (Peter),
Maybe further discussion on this belongs in Gitlab as issues. Can I get
added to this project?
Here are the first few lines of Matthew copied from the PDF:
--
&Sifrmaw;OD; {0Ha*vdusrf;
The Gospel According to Matthew
ed'gef;
usr;f ûyy*k Kd¾v f &iS rf maw;O;D \b0rwS wf r;f
u
I'm not sure how, but I seem to have missed the e-mail of the response to
my last post. So I apologize that this post isn't connected with it, and is
a bit late.
Troy, and David, thank-you for your responses; they were very helpful!
Also, just for completeness, I'll add that I wasn't sure what fi
Yesterday I thought, if a pdf tool give the possibility to cut the pdf
in the middle, then the raw conversion to txt can be possible, the we
only need to convert it to UTF8.
Any idea?
Il 13/05/2019 17:40, Michael H ha scritto:
> I unzipped the pagemaker file, and when I open NT_Proverb/Pagemaker
>
16 matches
Mail list logo