to do to begin finding/evaluating
questions and answers.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Amin Mohammed-Coleman
> To: java-user@lucene.apache.org
> Sent: Thursday, March 26, 2009 3:54:59 AM
> Subject: Re: question a
Hi
I was wondering if soemthing like LingPipe or Gate (for text extraction)
might be an idea? I've started looking at it and I'm just thinking it may
be applicable (I maybe wrong).
Cheers
Amin
On Wed, Mar 25, 2009 at 4:18 PM, Grant Ingersoll wrote:
> Hi MFM,
>
> This comes down to a preprocess
Hi MFM,
This comes down to a preprocessing step that you would have to do
before putting into Lucene, although I suppose you might be able to
identify it during analysis and use the TeeTokenFilter and the
SinkTokenizer. Once you do this, then you can add them as fields on a
Document. I