Kirk Lowery is probably the best person to answer how to encode morphology in OSIS since he is working on the Linguistic Annotation module, using his Leningrad Codex as a partial basis.

On Tue, 31 May 2005, Martin Gruner wrote:

Hi,

I have a question. The WLC source text I work from includes morphological
separation markers, which is _very_ helpful when reading the Old Testament.

a) How do I encode them in OSIS? With <seg>some</seg><seg>thing</seg>?

<seg> is the tag you would use in OSIS 2.x as it is currently. But you'll need to add some kind of type information, e.g. <seg type="x-morpheme">. It works fine even if you have segs embedded within <w> elements.

b) Should we add an OptionFilter (OSISMorphSegmentation) to turn this on and
off in the display, as well as a corresponding .conf file entry?

Sounds good to me. What do you have in mind to differentiate display/hiding of morphological segmentation? OLB has two BHS modules, one with morphemes divided by spaces. As a result, there's no way to differentiate between morpheme and word breaks. Pipes, dashes, etc. might be better to express morpheme breaks. (Dashes happen to be the linguistic standard.)

If you intend to put other grammatical information in the morpheme tags, it would probably be better to go with <w> instead of <seg>. It isn't really the intended purpose of <w>, but <w> does have attributes for lemma & morphological data, which <seg> doesn't. Kirk might be able to advise better on this particular point.

--Chris
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to