Re: AW: Parsing MSWord

2008-11-12 Thread Otis Gospodnetic
: java-user@lucene.apache.org Sent: Wednesday, November 12, 2008 8:25:43 AM Subject: Re: AW: Parsing MSWord Check out POI; that's what I use http://poi.apache.org/ "Sertic Mirko, Bedag" <[EMAIL PROTECTED]> wrote on 11/12/2008 03:25:47 AM: > Hi > > You can also use a tool cal

Re: AW: Parsing MSWord

2008-11-12 Thread Donna L Gresh
Check out POI; that's what I use http://poi.apache.org/ "Sertic Mirko, Bedag" <[EMAIL PROTECTED]> wrote on 11/12/2008 03:25:47 AM: > Hi > > You can also use a tool called "antiword" to extract the text from a > .doc file, and then > give the text to lucene. > > See here : http://en.wikipedia