Re: question about grouping text

Grant Ingersoll Wed, 25 Mar 2009 18:26:26 -0700

Hi MFM,

This comes down to a preprocessing step that you would have to dobefore putting into Lucene, although I suppose you might be able toidentify it during analysis and use the TeeTokenFilter and theSinkTokenizer. Once you do this, then you can add them as fields on aDocument. I know that's not a great help, but not much Lucene can dob/c it is application specific.


Document/field wise, I would probably have:
Document
   question
   answer

Then, when you search in the question field, you can also retrieve theanswer.


-Grant

On Mar 24, 2009, at 4:04 PM, MFM wrote:

I have been able to successfully index and search text from structured
documents like PDF and MS Word. I am having a real hard time trying to
figure out how to group the index strings together e.g. if mydocument had aquestion and answer in a table, the search will produce the textwith thequestion based on the keyword. How would I group or associate thequestionand answer as part of the indexing ? I have tried using POI to readthru theMS Word file and try and group them, but then it gets really intenseinto
pattern matching.

Thanks
MFM
--
View this message in context: 
http://www.nabble.com/question-about-grouping-text-tp22682433p22682433.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: question about grouping text

Reply via email to