Re: Calculate Term Co-occurrence Matrix

2010-08-21 Thread Ivan Provalov
Ahmed, FYI, I updated the term collocations package I mentioned earlier with a few fixes and changes which will make it work for Lucene 3.0.2. This may help your task. See: https://issues.apache.org/jira/browse/LUCENE-474 Thanks, Ivan Provalov --- On Sat, 8/21/10, Otis Gospodnetic wrote:

Re: Calculate Term Co-occurrence Matrix

2010-08-21 Thread Otis Gospodnetic
Ahmed, That's what that KPE (link in my previous email, below) will do for you. It's not open source at this time, but that is exactly one of the things it does. I think Mahout collocations stuff might work for you, too. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Luce

Re: Calculate Term Co-occurrence Matrix

2010-08-21 Thread ahmed algohary
Thanks for all your answers! it seems like I did not make my question clear. I have a text corpus and I need to determine the pairs of words that occur together in many documents. I need to do that to be able to measure the semantic proximity between words. This method is expanded here