Re: Keyphrase Extraction (via Lingo)

2007-05-09 Thread Bill Janssen
> Dawid Weiss wrote: > > You could also try splitting the document into paragraphs and use Carrot2's > > Lingo algorithm (www.carrot2.org) on a paragraph-level to extract clusters. > > Labelling routine in Lingo should extract 'key' phrases; this analysis is > > heavily frequency-based, but... y

Re: Keyphrase Extraction

2007-05-08 Thread Bob Carpenter
Mark Miller wrote: The only commercial options that I have seen do not have a web presence (that I know of or can find) and I don't recall the company names (only peripherally involved). Are we talking about Yahoo's buzz index and Amazon's SIPs or CAPs? I actually think the most interesting a

Re: Keyphrase Extraction

2007-05-08 Thread José Ramón Pérez Agüera
here you have a very good tool for Keyphrase Extraction. It is GNU and easy to integrate in Lucene. http://www.paynter.info/academia/Kea.php best jose On 5/8/07, Bill Janssen <[EMAIL PROTECTED]> wrote: Dawid Weiss wrote: > You could also try splitting the document into paragraph

RE: Keyphrase Extraction

2007-05-08 Thread Vishal Shah
Hi Arsen, I've seen another commercial one from a company called Connexor (www.connexor.com) . It has a decent part-of-speech tagger that could be used in keyphrase extraction with some heuristics on top of it. -vishal. -Original Message- From: Mark Miller [mailto:[EMAIL PROT

Re: Keyphrase Extraction

2007-05-08 Thread Bill Janssen
Dawid Weiss wrote: > You could also try splitting the document into paragraphs and use Carrot2's > Lingo algorithm (www.carrot2.org) on a paragraph-level to extract clusters. > Labelling routine in Lingo should extract 'key' phrases; this analysis is > heavily frequency-based, but... you know, y

Re: Keyphrase Extraction

2007-05-07 Thread Mark Miller
nce. Cheers Mark - Original Message From: Otis Gospodnetic <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed so

Re: Keyphrase Extraction

2007-05-06 Thread Otis Gospodnetic
t; To: java-user@lucene.apache.org Sent: Sunday, May 6, 2007 10:57:42 PM Subject: Re: Keyphrase Extraction Hi Mark, Do you know of a good paid product that does this? Thanks, Arsen - Original Message From: Mark Miller <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday,

Re: Keyphrase Extraction

2007-05-06 Thread [EMAIL PROTECTED]
Hi Mark, Do you know of a good paid product that does this? Thanks, Arsen - Original Message From: Mark Miller <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, May 2, 2007 7:52:36 AM Subject: Re: Keyphrase Extraction >From what I know you generally have

Re: Keyphrase Extraction

2007-05-06 Thread Dawid Weiss
ation of phrase significance. Cheers Mark - Original Message From: Otis Gospodnetic <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once

Re: Keyphrase Extraction

2007-05-02 Thread Mark Miller
- Original Message From: Otis Gospodnetic <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does this in JI

Re: Keyphrase Extraction

2007-04-30 Thread mark harwood
. Cheers Mark - Original Message From: Otis Gospodnetic <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 30 April, 2007 4:11:36 AM Subject: Re: Keyphrase Extraction Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something tha

Re: Keyphrase Extraction

2007-04-29 Thread Otis Gospodnetic
lting - http://lucene-consulting.com/ - Original Message From: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Sunday, April 29, 2007 5:24:17 PM Subject: Keyphrase Extraction Hi, I tried using MoreLikeThis contrib feature to extract "

Keyphrase Extraction

2007-04-29 Thread [EMAIL PROTECTED]
Hi, I tried using MoreLikeThis contrib feature to extract "interesting terms" from a document. This works very well - but only for SINGLE words. I am looking for a way to extra "keyPHRASES" from a document. Is there an easy way to achieve this using Lucene index? Thanks in advance! Av ___