There is also a non-Mahout Key Phrase Extractor for Collocations, SIPs, and a few other things: http://sematext.com/products/key-phrase-extractor/index.html
One of the demos that uses news data is at http://sematext.com/demo/kpe/index.html Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ----- Original Message ---- > From: Grant Ingersoll <gsing...@apache.org> > To: java-user@lucene.apache.org > Sent: Fri, August 20, 2010 8:52:17 AM > Subject: Re: Calculate Term Co-occurrence Matrix > > You might also be interested in Mahout's collocations package: >http://cwiki.apache.org/confluence/display/MAHOUT/Collocations > > -Grant > On Aug 19, 2010, at 11:39 AM, ahmed algohary wrote: > > > Hi all, > > > > I need to know if there is a Lucene plug-in or a Lucene-based API for > > calculating the term co-occurrence matrix for a given text corpus. > > > > Thanks! > > > > -- > > Ahmed > > -------------------------- > Grant Ingersoll > http://www.lucidimagination.com/ > > Search the Lucene ecosystem using Solr/Lucene: >http://www.lucidimagination.com/search > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org