jim shirreffs wrote:
Thanks for the advice, I just don't see where in the Lucene code I
should plug OOParcer into Lucene.
I've walked the code in LIUS and Nutch (moving on to Solr) trying to
find common objects. If I can find common objects in Lucene and Nutch
I'll know where to plug in.
Yo
code up a Reader the just spites out "Here I am" a few
hundred times and see what happens. LOL.
thank you for the reply and advice.
jim s
- Original Message -
From: "Andrzej Bialecki" <[EMAIL PROTECTED]>
To:
Sent: Friday, May 25, 2007 1:10 PM
Subject: R
jim shirreffs wrote:
Thanks to all that try to help me out
Jim S
P.S. If I get it working I will be happy to email post the code.
If you looked at the code in Nutch, you can take most of the parse-oo
plugin verbatim, because all this plugin does is it extracts the text
content and metadata
I've been working on this for a while, I am trying to get the demo code that
comes with Lucene to index OpenOffice documentss. I've looked at LIUS code
and at Nutch code. But can't find an easy way. So I am digging into the
code.
I wrote a KcmiDocument class that returns a Document. In it I