Teus has since added all the missing *\toc#* markers to the  Shona
<https://github.com/teusbenschop/shona>   repo.

After the last commit, the USFM tag statistics were as follows:

Count   SFM tag Description (updated for USFM 3.0)
-----   --------        -----------------------------------
04948   \add    Translator's added words begin
04948   \add*   Translator's added words end
01189   \c      Chapter
00066   \h      Running header (h=h1)
00066   \id     Identification
00065   \mt     Major title (mt=mt1)
00001   \mt1    Major title (portion 1)
00031   \mt2    Major title (portion 2)
00009   \nb     No break with previous paragraph
06445   \p      Paragraph
00066   \rem    Remark
01774   \s      Section heading (s=s1)
00066   \toc1   Table of contents 1 (Long  table of contents text)
00066   \toc2   Table of contents 2 (Short table of contents text)
00066   \toc3   Table of contents 3 (Book abbreviation)
31102   \v      Verse[s]
15739   \x      Cross reference element begin
15739   \x*     Cross reference element end

Observation:
The data structure in the GitHub repository is not one USFM file per book,
but one [USFM] data file per chapter, each in a suitable numbered directory,
plus a separate data file (in directory 0) for the USFM header lines.

In order to convert the text to OSIS, some preprocessing would be required
to get the source text to one USFM file per book (as used by ParaTExt).

Best regards,

David





--
View this message in context: 
http://sword-dev.350566.n4.nabble.com/Module-upload-Shona-tp4657457p4657513.html
Sent from the SWORD Dev mailing list archive at Nabble.com.

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to