Yes you could use lucene for this, but it may be overkill for your requirement. If I understand you correctly, all you need to is find documents which match "any" of the words in your list? Do you need to rank the results? If not, it's probably easier just to create your own inverted index of the documents you need to search. If you actually need to rank results, then lucene is probably easier as it does this for you.
On 8/11/06, Eugeny N Dzhurinsky <[EMAIL PROTECTED]> wrote:
Hello! I have an assigment, which will require to search documents for keywords or keyphrases. For instance, I have a database of keywords/keyphrases, which might contain several millions items. Now I need to find if document contains any of the keywords/phrases listed in that database. I was thinking on implementing finite-state machine, and use b-trees, so I will iterate document char by char and go down the tree unless I find some word or phrase which matches character sequence. I think Lucene is doing in the same way when performs searching, so may be I can use Lucene? -- Eugene N Dzhurinsky --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]