Re: exact match ..

Mufaddal Khumri Mon, 20 Feb 2006 10:04:37 -0800

Hi Steve,

If I understand you right, I could use something like the Keywordanalyzer to tokenize the entire stream as a single token and store thatin the index. I could definitely the keyword analyzer while indexingthis particular field "categoryNames".

Now my questions is on how to search and boost this since this is partof a bigger boolean query in my case.


My typical query actually looks like:

+(+content:digit +content:camera) +entity:product +(title:"digitcamera"~2^40.0 ((title:digit title:camera)^10.0) content:"digitcamera"~2^20.0 (content:digit content:camera) categoryNames:"digitcamera"^80.0)

As you can see i was trying to do a phrase query on the categoryNamesfield and boosting it by 80.0.Also I am using the potter stemming filter to stem while searching. (Ido this while indexing as well). If I go with the KeywordAnalyzerapproach I can index the categoryNames field using this analyzer .

Would I be using the QueryParser to create my query and specify thekeyword analyzer to it while searching on categoryNames ? (and then makethat query part of my global boolean query?)


-Thanks.





Steven Rowe wrote:

Mufaddal Khumri wrote:
lets say i do this while indexing:

doc.add(Field.Text("categoryNames", categoryNames));
Now while searching categoryNames, I do a search for "digitalcameras". I only want to match the exact phrase digital cameras withdocuments who have exactly the phrase "digital cameras" in thecategoryNames field. I do not want results that have "digital camerabatteries" part of the result.
Whats the best way to accomplish this?
Hi Mufaddal,
One way to do this is to use the KeywordAnalyzer (in the LuceneSubversion trunk, but not in v1.4.3; will be in forthcoming v1.9) forthe "categoryNames" field. This analyzer does not tokenize fieldcontents, so "digital cameras" would be a single token, and the onlything that would match it would be the exact same single token. Becareful when you search to construct the search tokens similarly.
If you have other fields you want to search, and you want to tokenizetheir contents when you index them, you could use thePerFieldAnalyzerWrapper, so that the KeywordAnalyzer is only used forthe "categoryNames" field.
Steve

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: exact match ..

Reply via email to