you may want to have a look at http://poi.apache.org/ as tmextractors seem to duplicate functionality
/Johan Florian Richter wrote: > Package: wnpp > Severity: wishlist > Owner: Florian Richter <florian_rich...@gmx.de> > > * Package name : tmextractors > Version : 1.0 > Upstream Author : Benryan Software Inc. > * URL : http://www.textmining.org/ > * License : LGPL > Programming Lang: Java > Description : A pure java library for extracting text from Word > documents > > This is a pure java library for extracting text from Word documents. > > -- System Information: > Debian Release: 5.0 > APT prefers testing > APT policy: (500, 'testing') > Architecture: i386 (i686) > > > > -- -- ------------------------------------------------ Johan Henriksson MSc Engineering PhD student, Karolinska Institutet http://mahogny.areta.org http://www.endrov.net -- To UNSUBSCRIBE, email to debian-wnpp-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org