You might also consider asking on the Tika (a Lucene subproject currently in Incubation) and Aperture project sites (http://aperture.sourceforge.net ). Not sure if you will have any luck, but they are also focused on the extraction problem and may have thought more about it.

-Grant

On Nov 8, 2007, at 8:37 AM, Michael Prichard wrote:

Hello,

I know this has gone around a bit but anyone had any success with pulling text from Office 2007 files? Any recommendations?

Thanks,
Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Boot Camp Training:
ApacheCon Atlanta, Nov. 12, 2007.  Sign up now!  http://www.apachecon.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to