Lucene with Semantic Vectors (LSA, LSI, LDA)?

2016-07-15 Thread Mila88
Hello, I have a project which indexes and scores documents using Lucene. However, I'd like to do that using semantic indexing (LSI, LSA, LDA or Semantic Vectors). I've read old posts and some people said that Semantic Vectors plays well with Lucene. However, I noticed that its c

Re: LSI

2011-08-29 Thread Grant Ingersoll
It doesn't look too active, but you might check out: https://github.com/algoriffic/lsa4solr Otherwise, Mahout has some of the basis for it but you would have to do some work. On Aug 29, 2011, at 9:55 AM, zarrinkalam wrote: > hi, > > I want to use LSI for clustring ducument

Re: LSI

2011-08-29 Thread Paul Libbrecht
Zarrinkalam, have a look at semanticvectors. paul Le 29 août 2011 à 15:55, zarrinkalam a écrit : > hi, > > I want to use LSI for clustring ducuments indexed with lucene, I dont know > how, plz help me > > thanks, --

LSI

2011-08-29 Thread zarrinkalam
hi, I want to use LSI for clustring ducuments indexed with lucene, I dont know how, plz help me thanks,

Re: lsi as indexing algorithm with lucene

2009-04-27 Thread adasal
> > If it is possible could you post a link where everybody can reach the > > thesis of your student? > > I guess it could be interesting for a couple of people on this list > > and a benefit for your student as well. > > > > simon > > > > > > Joi

Re: lsi as indexing algorithm with lucene

2009-04-26 Thread Dominik Jednoralski
ll. > > simon > > > > Join the semanticVectors' list where the original author also talks. > > > > paul > > > > > > Le 18-mars-09 à 08:34, nitin gopi a écrit : > > > >> hi Paul, I am new to this field of search engine. My aim is t

Re: lsi as indexing algorithm with lucene

2009-03-18 Thread Marvin Humphrey
On Wed, Mar 18, 2009 at 08:09:33AM +0100, Paul Libbrecht wrote: > LSI is patented so it's not been a flurry of implementation attempts. Hasn't the original patent expired? http://mail.python.org/pipermail/python-list/2007-July/621547.html Ma

Re: lsi as indexing algorithm with lucene

2009-03-18 Thread Simon Willnauer
gt; Join the semanticVectors' list where the original author also talks. > > paul > > > Le 18-mars-09 à 08:34, nitin gopi a écrit : > >> hi Paul, I am new to this field of search engine. My aim is to develop >> a semantic search engine. Initially  I was trying

Re: lsi as indexing algorithm with lucene

2009-03-18 Thread Paul Libbrecht
where the original author also talks. paul Le 18-mars-09 à 08:34, nitin gopi a écrit : hi Paul, I am new to this field of search engine. My aim is to develop a semantic search engine. Initially I was trying to develop that by using LSI. But since it is patented that is why there a

Re: lsi as indexing algorithm with lucene

2009-03-18 Thread nitin gopi
hi Paul, I am new to this field of search engine. My aim is to develop a semantic search engine. Initially I was trying to develop that by using LSI. But since it is patented that is why there are no many implementation attempts. I want to ask is it possible to create a search engine using

Re: lsi as indexing algorithm with lucene

2009-03-18 Thread Paul Libbrecht
Nitin, LSI is patented so it's not been a flurry of implementation attempts. However, SemanticVectors is a library that does similar approaches to LSA/LSI for indexing and is based on Lucene's term-vectors. paul Le 18-mars-09 à 07:09, nitin gopi a écrit : hi all , has any bod

lsi as indexing algorithm with lucene

2009-03-17 Thread nitin gopi
hi all , has any body tried to use LSI(latent semantic indexing) for indexing in lucene? - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Stable score scaling; LSI again

2008-07-15 Thread Asad Sayeed
cc PM Subject Stable score scaling; LSI again Please respond to

Stable score scaling; LSI again

2008-07-14 Thread Asad Sayeed
from Lucene are being fed to another analyzer is why, so when the "identity" score changes by adding docs to the index, it messes up the rest of the processing. The other question I had was about scoring via Latent Semantic Indexing. I read in the archives of this list from way back

Re: Using Lucene to apply LSI

2007-04-03 Thread Faizan Ahmed
José Ramón Pérez Agüera wrote: you need to use JAMA combined with Lucene, using the vectors that are builded by lucene to compute SVD with JAMA http://math.nist.gov/javanumerics/jama/ Thanks for your help. I am new to Lucene and do not know how to build a vector. I will have my input data s

RE: Using Lucene to apply LSI

2007-03-28 Thread José Ramón Pérez Agüera
you need to use JAMA combined with Lucene, using the vectors that are builded by lucene to compute SVD with JAMA http://math.nist.gov/javanumerics/jama/ Best jose José Ramón Pérez Agüera Dept. de Ingeniería del Software e Inteligencia Artificial Despacho 411 tlf. 913947599 Facultad de Inform

Re: Using Lucene to apply LSI

2007-03-28 Thread José Ramón Pérez Agüera
lement LSI technique. The scenario is something as follows. Search the news sites for the locally event based news. Cluster the similar news items together. For example hurricane in New York city. We want to apply basic LSI as follows -Key word extraction -Filter using stop list -St

Using Lucene to apply LSI

2007-03-28 Thread Mark Stiner
Hi, I have a research project where I want to implement LSI technique. The scenario is something as follows. Search the news sites for the locally event based news. Cluster the similar news items together. For example hurricane in New York city. We want to apply basic LSI

Re: Lucene + LSI

2005-12-20 Thread Chandana
> On 12/13/05, Dave Kor <[EMAIL PROTECTED]> wrote: > > > On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote: > > > > Paul Libbrecht < [EMAIL PROTECTED]> writes: > > > > > > > > > We're also thinking about implementing something

Re: Lucene + LSI

2005-12-20 Thread Chandana
t; > > > We're also thinking about implementing something similar to LSI > within > > > > ActiveMath which is lucene-powered where both formulae and text > > > > searching would benefit of the latent-semantic-similarity. I've been > > > > ref

Re: Lucene + LSI

2005-12-14 Thread adasal
t; > > > > > We're also thinking about implementing something similar to LSI > within > > > > ActiveMath which is lucene-powered where both formulae and text > > > > searching would benefit of the latent-semantic-similarity. I've been > > &g

Re: Lucene + LSI

2005-12-13 Thread Dave Kor
On 12/13/05, Dave Kor <[EMAIL PROTECTED]> wrote: > On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote: > > Paul Libbrecht <[EMAIL PROTECTED]> writes: > > > > > We're also thinking about implementing something similar to LSI within > > > Ac

Re: Lucene + LSI

2005-12-13 Thread adasal
: > > There seem to be quite a few alternatives around. I would be interested > in > > comments on the following:- > > The work at NITLE <http://www.nitle.org/tools/semantic/search.htm> > > using Contextual > > Network Search (CNS) a graph-based alternativ

Re: Lucene + LSI

2005-12-13 Thread Sebastian Marius Kirsch
h (CNS) a graph-based alternative to LSI. Hi, perhaps I can say a few words about CNS. I evaluated spreading activation search (which is what CNS is based on) for my master's thesis. I must conclude that SA is a rather fickle method -- if you read Preece's thesis, you will see that it can b

Re: Lucene + LSI

2005-12-13 Thread adasal
There seem to be quite a few alternatives around. I would be interested in comments on the following:- The work at NITLE <http://www.nitle.org/tools/semantic/search.htm> using Contextual Network Search (CNS) a graph-based alternative to LSI. This work *[PDF]* An Introduction to *Random* In

Re: Lucene + LSI

2005-12-12 Thread Dave Kor
On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote: > Paul Libbrecht <[EMAIL PROTECTED]> writes: > > > We're also thinking about implementing something similar to LSI within > > ActiveMath which is lucene-powered where both formulae and text > > searc

Re: Lucene + LSI

2005-12-12 Thread Ian Soboroff
Paul Libbrecht <[EMAIL PROTECTED]> writes: > We're also thinking about implementing something similar to LSI within > ActiveMath which is lucene-powered where both formulae and text > searching would benefit of the latent-semantic-similarity. I've been > refrained of d

Re: Lucene + LSI

2005-12-01 Thread Paul Libbrecht
We're also thinking about implementing something similar to LSI within ActiveMath which is lucene-powered where both formulae and text searching would benefit of the latent-semantic-similarity. I've been refrained of doing "exactly this" at least since LSI is patented. T

Re: Lucene + LSI

2005-11-30 Thread rrshwrk
Yeah, its Latent Semantic Indexing On 12/1/05, gekkokid <[EMAIL PROTECTED]> wrote: > > sorry have to ask - whats LSI - " latent semantic indexing"? > > _gk > - Original Message - > From: "Lorenzo Viscanti" <[EMAIL PROTECTED]> > To: ; &l

Re: Lucene + LSI

2005-11-30 Thread gekkokid
sorry have to ask - whats LSI - " latent semantic indexing"? _gk - Original Message - From: "Lorenzo Viscanti" <[EMAIL PROTECTED]> To: ; <[EMAIL PROTECTED]> Sent: Thursday, December 01, 2005 12:02 AM Subject: Re: Lucene + LSI It depends on the kind of

Re: Lucene + LSI

2005-11-30 Thread Lorenzo Viscanti
It depends on the kind of implementation you are thinking of. You can use Lucene to create the inputs to the LSI, and then use them in your own system. I've written that code and it works, for searches and clustering. But if you are figuring out an LSI enhanced Lucene search system (based

Lucene + LSI

2005-11-30 Thread Chandana
Have any one implemented LSI in Lucene? Kindly let me know how hard/easy it is. thanks chandana

Re: Regarding Lucene and LSI

2005-10-08 Thread adasal
icle about which has been written by Maciej Ceglowski who is (largely) responsible for the LSI/ContextGraph implementation at NITLE. Article Using Bloom Filters on Perl.com<http://Perl.com>. Bloom filters look a bit like the random index from Sahlgren I have mentioned. Much to do! Adam

Re: Regarding Lucene and LSI

2005-10-07 Thread Lorenzo Viscanti
I use my own LSI implementation based on Lucene for text clustering. I've done some tests, but I do believe that integrating LSI onto the lucene search subsystem (i.e. creating something like LSISimilarity) is not an easy task I start analyzing the documents using Lucene, and then extract

Re: Regarding Lucene and LSI

2005-10-07 Thread Paul Libbrecht
I've met other persons with such needs and we would also be interested. Unfortunately, this seems not to be available. A clear issue might be that LSI, in its original form at least, is covered by an US patent. But maybe someone finds another form which is not. paul Le 5 oct. 05, à

Re: Regarding Lucene and LSI

2005-10-05 Thread adasal
adasal <[EMAIL PROTECTED]> wrote: > > > > There are LSI and LSI like implementations in open source, but I am > > unsure what the state of play is in Java. The most interesting work I know > > of is in Perl available (cvs permitting) from > > NITLE.<http://www

Re: Regarding Lucene and LSI

2005-10-05 Thread adasal
There are LSI and LSI like implementations in open source, but I am unsure what the state of play is in Java. The most interesting work I know of is in Perl available (cvs permitting) from NITLE.<http://www.nitle.org/tools/semantic/search.htm> What I would like to see is an implementat

Regarding Lucene and LSI

2005-10-05 Thread rrshwrk
Hi all, I am looking for LSI implementation i lucene. Is it available. I couldnt find it in the website. I searched in the archives but no help. could some one tell me if it is available or not. Could you tell me where can i see to find if there are any Language processing tools for Indexing and