Re: Unique Fields

2008-03-13 Thread Ion Badita
My unique is more like synonym. For instance: Brain cancer, Cancer of the brain, Brain neoplasm, are the same, so i need to tokenize the title remove the stop words etc. I have a problem with the indexing... with a new title first i have to search in the index, if the title is not found write

Re: Unique Fields

2008-03-12 Thread Erick Erickson
So, you're tokenizing the title field? If so, I don't understand how you expect this to work. Would the title "this is one order" and "is one order this" be considered identical? Would capitalization matter? Punctuation? Throwing all the terms of a title into a tokenized field and expecting some ma

Re: Unique Fields

2008-03-12 Thread Ion Badita
The "problem" is that my unique field is a title, many terms per field. I want to make an index with titles and i don't want to have duplicates. John Erick Erickson wrote: You can easily find whether a term is in the index with TermEnum/TermDocs (I think TermEnum is all you really need). Exce

Re: Unique Fields

2008-03-11 Thread Erick Erickson
You can easily find whether a term is in the index with TermEnum/TermDocs (I think TermEnum is all you really need). Except, you'll probably also have to keep an internal map of IDs added since the searcher was opened and check against that too. Best Erick On Tue, Mar 11, 2008 at 11:04 AM, Ion B

Unique Fields

2008-03-11 Thread Ion Badita
Hi, I want to create an index with one unique field. Before inserting a document i must be sure that "unique field" is unique. John - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED