Re: question about grouping text

2009-03-26 Thread Otis Gospodnetic
to do to begin finding/evaluating questions and answers. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Amin Mohammed-Coleman > To: java-user@lucene.apache.org > Sent: Thursday, March 26, 2009 3:54:59 AM > Subject: Re: question a

Re: question about grouping text

2009-03-26 Thread Amin Mohammed-Coleman
Hi I was wondering if soemthing like LingPipe or Gate (for text extraction) might be an idea? I've started looking at it and I'm just thinking it may be applicable (I maybe wrong). Cheers Amin On Wed, Mar 25, 2009 at 4:18 PM, Grant Ingersoll wrote: > Hi MFM, > > This comes down to a preprocess

Re: question about grouping text

2009-03-25 Thread Grant Ingersoll
Hi MFM, This comes down to a preprocessing step that you would have to do before putting into Lucene, although I suppose you might be able to identify it during analysis and use the TeeTokenFilter and the SinkTokenizer. Once you do this, then you can add them as fields on a Document. I