Not sure if this is what you are after, but there is a projet call File2XLIFF4j which converts a number of file formats to XLIFF (an XML structure) using OpenOffice.org. And if I am not mistaken, Lucene has code available for indexing XML. The project is located at http://file2xliff4j.sourceforge.net.

John Haxby wrote:
Hello All,

In LIA, Erik and Otis mention using the openoffice.org API for converting from various formats to something that can be used for indexing.

Does anyone have any examples of doing this that they'd be willing to share?

jch

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to