Hi, On Fri, Jun 3, 2011 at 10:29 PM, <robert_w...@us.ibm.com> wrote: > Are there any other Apache projects where there might be an interesting > relationship? Anything jump out?
Apache Tika (http://tika.apache.org/) is a generic toolkit for extracting text and metadata from various file formats. Improving ODF support with tools from OOo is an obvious area of interest for Tika. Apache PDFBox (http://pdfbox.apache.org/) is a Java library for working with PDF documents. If not direct code sharing over the Java / C++ divide, then at least sharing of PDF know-how and perhaps things like test cases between these projects would be great. BR, Jukka Zitting --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org