Re: Tokenize on another character

2008-03-31 Thread Erick Erickson
Much clearer. Here's what I'd try. Index UN_TOKENIZED as follows: for METAL MAN (bad pseudo-code...) Document doc = new Document(); doc.add("category", "GUITAR", Store.NO, UN_TOKENIZED); doc.add("category", "ROCK", Store.NO, UN_TOKENIZED); doc.add("category", "ROCK AND ROLL" , Store.NO, UN_TOKENIZ

Re: Tokenize on another character

2008-03-31 Thread Fiaz Khan
Thanks Erick Ok,.. I have a track called METAL MAN, this has 4 categories assigned to it like so: GUITAR ROCK ROCK AND ROLL METAL I have another track called NOISE with the following 3 categories: GUITAR ROCK AND ROLL METAL When a user searches using the keyword ROCK, it is finding both w

Re: Tokenize on another character

2008-03-31 Thread Erick Erickson
I'm confused on the use case you're trying to implement, could you add a bit more explanation? In particular, do you ever want ROCK to match ROCK AND ROLL? If you want both, that is some searches match partial keywords and some match entire keywords, I recommend you create a second field in your d

Tokenize on another character

2008-03-31 Thread fiaz.khan
Hello I just joined the list and need some help. I have a database of music tracks.These tracks have been added to an index. They are classified using keywords, so a track can have up to 20 keywords assigned to them. I took the keywords and create a "keyword" FIELD which was not stored and tokeni