Re: AW: Parsing MSWord

2008-11-12 Thread Otis Gospodnetic
: java-user@lucene.apache.org Sent: Wednesday, November 12, 2008 8:25:43 AM Subject: Re: AW: Parsing MSWord Check out POI; that's what I use http://poi.apache.org/ "Sertic Mirko, Bedag" <[EMAIL PROTECTED]> wrote on 11/12/2008 03:25:47 AM: > Hi > > You can also use a tool cal

Re: AW: Parsing MSWord

2008-11-12 Thread Donna L Gresh
Check out POI; that's what I use http://poi.apache.org/ "Sertic Mirko, Bedag" <[EMAIL PROTECTED]> wrote on 11/12/2008 03:25:47 AM: > Hi > > You can also use a tool called "antiword" to extract the text from a > .doc file, and then > give the text to lucene. > > See here : http://en.wikipedia

AW: Parsing MSWord

2008-11-12 Thread Sertic Mirko, Bedag
Hi You can also use a tool called "antiword" to extract the text from a .doc file, and then give the text to lucene. See here : http://en.wikipedia.org/wiki/Antiword Regards Mirko -Ursprüngliche Nachricht- Von: dipesh [mailto:[EMAIL PROTECTED] Gesendet: Mittwoch, 12. November 2008 04: