Re: Indexing sections of TEI XML files

2008-08-13 Thread ao1
Thanks, Erik, but I'm developing this system from scratch as it has specific use cases including dealing with multiple languages including multiple forms of a specific minority language (Irish). I'm going to look at XTF anyway just to see how they managed it! Thanks, A. > Have you looked at XTF

Indexing sections of TEI XML files

2008-08-13 Thread ao1
Dear users, Question on approaches to indexing TEI XML or similar section/subsectioned files. I'm indexing TEI P4 XML files using Lucene 2.x. Currently, each TEI XML file corresponds to a Lucene document. I extract the data from each XML file using XPath expressions e.g. for the body text: "/TEI