Re: Tokenize on another character

2008-03-31 Thread Erick Erickson
Much clearer. Here's what I'd try. Index UN_TOKENIZED as follows: for METAL MAN (bad pseudo-code...) Document doc = new Document(); doc.add("category", "GUITAR", Store.NO, UN_TOKENIZED); doc.add("category", "ROCK", Store.NO, UN_TOKENIZED); doc.add("category", "ROCK AND ROLL" , Store.NO, UN_TOKENIZ

Re: Tokenize on another character

2008-03-31 Thread Fiaz Khan
Thanks Erick Ok,.. I have a track called METAL MAN, this has 4 categories assigned to it like so: GUITAR ROCK ROCK AND ROLL METAL I have another track called NOISE with the following 3 categories: GUITAR ROCK AND ROLL METAL When a user searches using the keyword ROCK, it is finding both w

Re: Tokenize on another character

2008-03-31 Thread Erick Erickson
I'm confused on the use case you're trying to implement, could you add a bit more explanation? In particular, do you ever want ROCK to match ROCK AND ROLL? If you want both, that is some searches match partial keywords and some match entire keywords, I recommend you create a second field in your d