Thanks, Erik, but I'm developing this system from scratch as it has
specific use cases including dealing with multiple languages including
multiple forms of a specific minority language (Irish).
I'm going to look at XTF anyway just to see how they managed it!
Thanks,
A.
> Have you looked at XTF
Dear users,
Question on approaches to indexing TEI XML or similar section/subsectioned
files.
I'm indexing TEI P4 XML files using Lucene 2.x.
Currently, each TEI XML file corresponds to a Lucene document.
I extract the data from each XML file using XPath expressions e.g. for the
body text: "/TEI