For reading word document as text, you can try AntiWord. I have written a simplified Lucene that does Max words match.
For example, if you are searching for aa, bb, cc, then, the document that contains all words (aa, bb, cc) will be definitely ranked higher than documents containing either aa, bb or aa, cc or bb, cc. I am going to put up the code as open source. If you are interested, you can email me directly. Jian On 2/9/06, P. Alex. Salamanca R. <[EMAIL PROTECTED]> wrote: > > On the other hand, if you want be the most cheapest, why don't give a > chance > to google search appliance? > >