Hi,

I have 1 thought:

1. It would probably be possible to write some code to effectively
remove all content outside of two arbitary markers in an XML document.
E.G.

<osis>
    blah
    <verseStart/>
    <q>
        blah
        <verseEnd/>
        blah
    </q>
    blah
</osis>

Becomes:

<osis>
    <verseStart/>
    <q>
        blah
        <verseEnd/>
    </q>
</osis>

So you start with a full OSIS document and do that process for each
and every verse, storing the result in a SWORD module.

Joe.

On Thu, 19 Aug 2004 09:30:28 +1000, Kahunapule Michael P. Johnson
<[EMAIL PROTECTED]> wrote:
> At 03:17 19-08-04, DM Smith wrote:
> >Looking past a JSword 1.0 release, I was studying the OSIS 2.0 schema
> >and it looks like it may be tough to handle well. Specifically, there
> >are elements that can be either a marker or a container. With regard to
> >Bibles specifically a  verse may start smack dab in the middle of one of
> >these other elements. Or one of these elements may end in a verse. And
> >it might not be just one element that is split by a verse, it may be
> >several.
> >...
> >Does anyone know of a best practice for OSIS, or any other XML field?
> 
> I believe that the recommendation in the XSEM documentation is valid for OSIS as 
> well. Basically, it gives lots of good reasons for making the natural poetry and 
> prose structure of the document primary, and the chapter/verse structure secondary. 
> It then goes on to say that for use in applications where verse priority is required 
> (i. e. a Bible search engine), it would make sense to use XSLT to transform the XML 
> to prioritize the chapters and verses as containers, and make the other elements 
> milestones. This makes sense to me. The sword engine is entirely verse-oriented, as 
> are most Bible search engines, but it is highly desirable to preserve poetry and 
> prose formatting within each displayed verse range.
> 
> Of course, the details of what the chapter and verse priority version of OSIS, XSEM, 
> or other similar XML standard isn't fully clear, but I would think that if you start 
> with OSIS, you would want to keep it as close to OSIS as you can while still making 
> every verse well-formed XML, and while making the other stuff into milestones. OSIS 
> already allows verses to be containers, for example, and it has a milestone 
> mechanism that can be extended to the containers for anything that can cross verse 
> boundaries, such as paragraphs. Perhaps using the milestone element just after every 
> verse open tag to put in redundant reminders of what kind of paragraph you are in 
> would make sense; likewise for character styles, quotations being continued, etc.
> 
> Thoughts?
> 
> 
> 
> 
> _______________________________________________
> sword-devel mailing list
> [EMAIL PROTECTED]
> http://www.crosswire.org/mailman/listinfo/sword-devel
>
_______________________________________________
sword-devel mailing list
[EMAIL PROTECTED]
http://www.crosswire.org/mailman/listinfo/sword-devel

Reply via email to