As others have said, there isn't an algorithm that does this. Useful results depend on precise definitions of "context" and "similarity." The waters get deep quickly.
As a clojure exercise, though, there are lots of good starting points. For instance: get a set of words, create all pairs from the set, run a google search on each pair, extract the count of documents from the results, use the counts as a distance between the words as nodes, and throw that in a graph. Something like this would make an interesting topology, and could be enhanced by using a different corpus, and/or swapping in different distance measurements. Though as described it would not tell you anything interesting semantically. For a slightly more sophisticated framing of the problem, look at NLP programming assignments, like "wordnet distance", e.g. http://www.cs.princeton.edu/courses/archive/spr07/cos226/assignments/wordnet.html Hope that helps. On Wed, Jul 28, 2010 at 4:58 PM, Daniel <doubleagen...@gmail.com> wrote: > I want to write a clojure program that searches for similarities of > words in the english language and places them in a graph, where the > distance between nodes indicates their similarity. I don't mean > syntactical similarity. Related contextual meaning is closer to the > mark. > > For instance: "fish" and "reel" don't have much similarity, but in the > context of fishing they do, so the distance in such a graph wouldn't > be very large. > > I'm sure research has been done in this area (I suspect with no small > portion belonging to google), so can anybody point me in the right > direction? > > Thanks. > > -- > You received this message because you are subscribed to the Google > Groups "Clojure" group. > To post to this group, send email to clojure@googlegroups.com > Note that posts from new members are moderated - please be patient with your > first post. > To unsubscribe from this group, send email to > clojure+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/clojure?hl=en > -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to clojure+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/clojure?hl=en