As others have said, there isn't an algorithm that does this. Useful
results depend on precise definitions of "context" and "similarity."
The waters get deep quickly.

As a clojure exercise, though, there are lots of good starting points.
For instance: get a set of words, create all pairs from the set, run a
google search on each pair, extract the count of documents from the
results, use the counts as a distance between the words as nodes, and
throw that in a graph.

Something like this would make an interesting topology, and could be
enhanced by using a different corpus, and/or swapping in different
distance measurements. Though as described it would not tell you
anything interesting semantically.

For a slightly more sophisticated framing of the problem, look at NLP
programming assignments, like "wordnet distance", e.g.

http://www.cs.princeton.edu/courses/archive/spr07/cos226/assignments/wordnet.html

Hope that helps.

On Wed, Jul 28, 2010 at 4:58 PM, Daniel <doubleagen...@gmail.com> wrote:
> I want to write a clojure program that searches for similarities of
> words in the english language and places them in a graph, where the
> distance between nodes indicates their similarity.  I don't mean
> syntactical similarity.  Related contextual meaning is closer to the
> mark.
>
> For instance: "fish" and "reel" don't have much similarity, but in the
> context of fishing they do, so the distance in such a graph wouldn't
> be very large.
>
> I'm sure research has been done in this area (I suspect with no small
> portion belonging to google), so can anybody point me in the right
> direction?
>
> Thanks.
>
> --
> You received this message because you are subscribed to the Google
> Groups "Clojure" group.
> To post to this group, send email to clojure@googlegroups.com
> Note that posts from new members are moderated - please be patient with your 
> first post.
> To unsubscribe from this group, send email to
> clojure+unsubscr...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/clojure?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

Reply via email to