Not sure if this is what you are after, but there is a projet call
File2XLIFF4j which converts a number of file formats to XLIFF (an XML
structure) using OpenOffice.org. And if I am not mistaken, Lucene has
code available for indexing XML. The project is located at
http://file2xliff4j.sourceforge.net.
John Haxby wrote:
Hello All,
In LIA, Erik and Otis mention using the openoffice.org API for
converting from various formats to something that can be used for
indexing.
Does anyone have any examples of doing this that they'd be willing to
share?
jch
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]