Hello All.

Say I have website that has articles on clothes, and I allow people to
tag each article. (Unlimited vocabulary). I wish to recognize which
words are being used as synonyms.

For example:
(I make the additional restriction that every person must tag the
article with two tags.)

Art1 (an opinion on shoes in winter)
     WinterSeason, Shoes
    : Shoes, Fashion
    : opinion, Winter, etc

Art2: (Shoes and Gloves)
    : shoes, gloves
   : accessories, gloves
    : footwear gloves, etc...

Art3: (Jackets in winter)
   :Jackets, winter, etc.


Now, if I were to do a search on "footwear", I would come up with
Art2, and not Art1.

Is there any algorithm that will recognise that a search of "footwear"
and "shoes" should return the same set.

I have a feeling that some kind of conditional probability calculation
should be used. i.e. P("footwear")|P("shoes") / P("footwear")

Thoughts, or any direction to go from here?

Regards,
   Timbo

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Algorithm Geeks" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at http://groups.google.com/group/algogeeks
-~----------~----~----~----~------~----~------~--~---

Reply via email to