Hi,

On Fri, Jun 3, 2011 at 10:29 PM,  <robert_w...@us.ibm.com> wrote:
> Are there any other Apache projects where there might be an interesting
> relationship?   Anything jump out?

Apache Tika (http://tika.apache.org/) is a generic toolkit for
extracting text and metadata from various file formats. Improving ODF
support with tools from OOo is an obvious area of interest for Tika.

Apache PDFBox (http://pdfbox.apache.org/) is a Java library for
working with PDF documents. If not direct code sharing over the Java /
C++ divide, then at least sharing of PDF know-how and perhaps things
like test cases between these projects would be great.

BR,

Jukka Zitting

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Reply via email to