Re: Indexing help needed

2007-05-25 Thread Andrzej Bialecki
jim shirreffs wrote: Thanks for the advice, I just don't see where in the Lucene code I should plug OOParcer into Lucene. I've walked the code in LIUS and Nutch (moving on to Solr) trying to find common objects. If I can find common objects in Lucene and Nutch I'll know where to plug in. Yo

Re: Indexing help needed

2007-05-25 Thread jim shirreffs
code up a Reader the just spites out "Here I am" a few hundred times and see what happens. LOL. thank you for the reply and advice. jim s - Original Message - From: "Andrzej Bialecki" <[EMAIL PROTECTED]> To: Sent: Friday, May 25, 2007 1:10 PM Subject: R

Re: Indexing help needed

2007-05-25 Thread Andrzej Bialecki
jim shirreffs wrote: Thanks to all that try to help me out Jim S P.S. If I get it working I will be happy to email post the code. If you looked at the code in Nutch, you can take most of the parse-oo plugin verbatim, because all this plugin does is it extracts the text content and metadata

Indexing help needed

2007-05-25 Thread jim shirreffs
I've been working on this for a while, I am trying to get the demo code that comes with Lucene to index OpenOffice documentss. I've looked at LIUS code and at Nutch code. But can't find an easy way. So I am digging into the code. I wrote a KcmiDocument class that returns a Document. In it I