That sounds like a big task. Assuming the Thai is unicode, somehow you have to get the documents into an xml file encoded in utf8. For the Bible obviously USFM is the preferred option as an intermediate step because you can use USFM editors like Bibedit, SIL FieldWorks, etc., to edit USFM more or less in a user friendly way (You are probably familiar with these, but if not search for them online). OSIS is good when you want to go to SWORD, but it's not as easy to learn and work with. Bibledit offers the option of exporting to a SWORD module, which is very attractive since it gives you immediate feedback and a hopefully gratifying result. Once your USFM files are really good, then you can use some other converter to convert it to OSIS.

I once took some RTF files of a Bible and did some search and replace operations using the kind of markup you describe to convert it to USFM. It was a bit laborious and not necessarily the best way to go about it, but you have to start somewhere. For italics, bold, etc., Word seems to work well. Use wildcards and search for the markup, adding USFM markers when you replace things. This assumes that their markup is consistent, though.
In my case it helped that there were styles named after some older translation program whose name is escaping me, so I used OpenOffice to go the rest of the way because you can search and replace by style.

For the book you should consider using OpenOffice since you can open Word files in it easily, save it as ODT, and export as XML. Basically for a book you just need to distinguish Parts, Chapters, Sections, and so on. Scripture references, tables, and a few other things may factor in as well. The filter I worked on for OpenOffice (exporting to OSIS for Genbooks) worked well with version 2.4, but I'm not sure about version 3. I need to work on it sometime. You can find it at http://sites.google.com/site/danielowensstuff/. There are instructions in English.

Daniel

Adrian Korten wrote:
Good day,

By inline markup, I mean that it is bolded, centred, super-scripted, etc. (Sorry, I'm not sure of the right terminology but may display markup would have been a better term.) But no standard format markup. I would either recommend that they markup with USFM or OSIS.

ak


----- Original Message -----
*From:* Daniel Owens <dhow...@pmbx.net>
*To:* "SWORD Developers' Collaboration Forum" <sword-devel@crosswire.org>
*Sent:* 02/26/2009 4:28:05 PM +0700
*Subject:* [sword-devel] OSIS editor


Adrian,

What does the inline markup look like? I mean, is it like standard format markup (USFM, MDF, etc.), or am I way off the mark?

Daniel

Peter von Kaehne wrote:
Adrian Korten wrote:
Good day,

I'm advising a team of Thai people who would like to prepare a Bible and book for import to Sword. They currently have both texts in Word with in-line mark-up (no styles). I assume that they would need to save as raw text and then start adding the markup. Could someone advise on a good editor for this? Or a strategy for doing this?

There is a XSLT style sheet for OpenOffice Export to Genbook format. This would be an easy way to create markup for the book at least.

Peter

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org <mailto:sword-devel@crosswire.org>
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

------------------------------------------------------------------------

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to