That sounds like a big
task. Assuming the Thai is unicode, somehow you have to get the
documents into an xml file encoded in utf8. For the Bible obviously
USFM is the preferred option as an intermediate step because you can
use USFM editors like Bibedit, SIL FieldWorks, etc., to edit USFM more
or less in a user friendly way (You are probably familiar with these,
but if not search for them online). OSIS is good when you want to go to
SWORD, but it's not as easy to learn and work with. Bibledit offers the
option of exporting to a SWORD module, which is very attractive since
it gives you immediate feedback and a hopefully gratifying result. Once
your USFM files are really good, then you can use some other converter
to convert it to OSIS. I once took some RTF files of a Bible and did some search and replace operations using the kind of markup you describe to convert it to USFM. It was a bit laborious and not necessarily the best way to go about it, but you have to start somewhere. For italics, bold, etc., Word seems to work well. Use wildcards and search for the markup, adding USFM markers when you replace things. This assumes that their markup is consistent, though. In my case it helped that there were styles named after some older translation program whose name is escaping me, so I used OpenOffice to go the rest of the way because you can search and replace by style. For the book you should consider using OpenOffice since you can open Word files in it easily, save it as ODT, and export as XML. Basically for a book you just need to distinguish Parts, Chapters, Sections, and so on. Scripture references, tables, and a few other things may factor in as well. The filter I worked on for OpenOffice (exporting to OSIS for Genbooks) worked well with version 2.4, but I'm not sure about version 3. I need to work on it sometime. You can find it at http://sites.google.com/site/danielowensstuff/. There are instructions in English. Daniel Adrian Korten wrote: Good day, |
_______________________________________________ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page