Yes you could use lucene for this, but it may be overkill for your
requirement.  If I understand you correctly, all you need to is find
documents which match "any" of the words in your list?  Do you need to rank
the results?   If not, it's probably easier just to create your own inverted
index of the documents you need to search.  If you actually need to rank
results, then lucene is probably easier as it does this for you.



On 8/11/06, Eugeny N Dzhurinsky <[EMAIL PROTECTED]> wrote:

Hello!

I have an assigment, which will require to search documents for keywords
or
keyphrases.

For instance, I have a database of keywords/keyphrases, which might
contain
several millions items. Now I need to find if document contains any of the
keywords/phrases listed in that database.

I was thinking on implementing finite-state machine, and use b-trees, so I
will iterate document char by char and go down the tree unless I find some
word or phrase which matches character sequence.

I think Lucene is doing in the same way when performs searching, so may be
I
can use Lucene?

--
Eugene N Dzhurinsky

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Reply via email to