Hi Arsen, I've seen another commercial one from a company called Connexor (www.connexor.com) . It has a decent part-of-speech tagger that could be used in keyphrase extraction with some heuristics on top of it.
-vishal. -----Original Message----- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: Tuesday, May 08, 2007 4:04 AM To: java-user@lucene.apache.org Subject: Re: Keyphrase Extraction The only commercial options that I have seen do not have a web presence (that I know of or can find) and I don't recall the company names (only peripherally involved). Here is a web page where a guy does a nice writeup on a few options: http://dsanalytics.com/dsblog/the-start-of-the-art-in-keyphrase-extraction_9 9 - Mark [EMAIL PROTECTED] wrote: > Hi Mark, > > Do you know of a good paid product that does this? > > Thanks, > Arsen > > > ----- Original Message ---- > From: Mark Miller <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Wednesday, May 2, 2007 7:52:36 AM > Subject: Re: Keyphrase Extraction > > > >From what I know you generally have to pay if you want something that > does this really well. Or check out http://www.nzdl.org/Kea/ > Unfortunately, the license is GPL. Really too bad; now that it is all > Java, it would make a great combo with Lucene. > > - Mark > > mark harwood wrote: > >> I believe the code Otis is referring to is here: http://issues.apache.org/jira/browse/LUCENE-474 >> >> This is index-level analysis but could be adapted to work for just a single document. >> The implementation is optimised for speed rather than being a thorough examination of phrase significance. >> >> Cheers >> Mark >> >> ----- Original Message ---- >> From: Otis Gospodnetic <[EMAIL PROTECTED]> >> To: java-user@lucene.apache.org >> Sent: Monday, 30 April, 2007 4:11:36 AM >> Subject: Re: Keyphrase Extraction >> >> Av, look at Lucene's JIRA and search for Mark Harwood. I believe he once contributed something that does this in JIRA. If you are interested in a commercial solution, I can recommend LingPipe. >> >> Otis >> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . >> Lucene Consulting - http://lucene-consulting.com/ >> >> >> ----- Original Message ---- >> From: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> >> To: java-user@lucene.apache.org >> Sent: Sunday, April 29, 2007 5:24:17 PM >> Subject: Keyphrase Extraction >> >> Hi, >> >> I tried using MoreLikeThis contrib feature to extract "interesting terms" from a document. This works very well - but only for SINGLE words. >> >> I am looking for a way to extra "keyPHRASES" from a document. Is there an easy way to achieve this using Lucene index? >> >> Thanks in advance! >> Av >> >> __________________________________________________ >> Do You Yahoo!? >> Tired of spam? Yahoo! Mail has the best spam protection around >> http://mail.yahoo.com >> >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [EMAIL PROTECTED] >> For additional commands, e-mail: [EMAIL PROTECTED] >> >> >> >> >> >> >> ___________________________________________________________ >> Yahoo! Answers - Got a question? Someone out there knows the answer. Try it >> now. >> http://uk.answers.yahoo.com/ >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [EMAIL PROTECTED] >> For additional commands, e-mail: [EMAIL PROTECTED] >> >> >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > > ____________________________________________________________________________ ________ > Don't pick lemons. > See all the new 2007 cars at Yahoo! Autos. > http://autos.yahoo.com/new_cars.html > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]