Re: Index and search terms containing character "-"

2009-05-31 Thread Erick Erickson
Simple analyzer does two things: splits tokens on non-letter characters and lowercases them. So, in your test you've indexed the tokens "jack" and "bauer" in your second document, the hyphen is completely lost during tokenization and you have two tokens for that document. Using the term query "ja

Re: Index and search terms containing character "-"

2009-05-31 Thread balasubramanian sudaakeran
Hi Tom, You are using a SimpleAnalyzer while indexing which will do some transformation to your string before indexing. If you are using analyzer which does transformation to the words during indexing you should use the same or similar analyzer during querying as well. Try the same with Keywor

Index and search terms containing character "-"

2009-05-31 Thread legrand thomas
Hi, I have a problem using TermQuery and FuzzyQuery for terms containing the character "-". Considering I've indexed "jack" and "jack-bauer" as 2 tokenized captions, I get no result when searching for "jack-bauer". Moreover, "jack" with a TermQuery returns the two captions.   What should I do t