> What do you mean by "first"? Would you want to process a doc thatdid NOT
> have a "Term 3"?
>
> Let's say you have the following:
> doc1: "Term 1"
> doc2: "Term 2"
> doc3: "Term 1" "Term 2"
> doc4: "Term 3"
> doc5: "Term 1" "Term 2" "Term 3"
> doc6: "Term 2" "Term 3"
>
> Which docs do you want to get from your search? And does order really
> matter?


I would want all of those.  What I wouldn't want would be

doc7: "Term 3"

I rank documents more highly based on how many of these terms they contain.
For example, my system ranks doc5 the highest in your example, then doc3,
then doc6. I don't need Term 3, but I need to have Term 1 or Term 2 before I
go looking for Term 3 to further organize my results.  Order doesn't matter,
they are put through a separate scoring system.  I am really just trying to
improve the performance a little bit since "Term 3" will, in my index, hit
on far more documents than Term 1 or Term 2, but I only care about the
documents where Term 1 or Term 2 were found first.

Thanks,
Max

Reply via email to