Hello,
I have a project which indexes and scores documents using Lucene. However,
I'd like to do that using semantic indexing (LSI, LSA, LDA or Semantic
Vectors).
I've read old posts and some people said that Semantic Vectors plays well
with Lucene. However, I noticed that its c
It doesn't look too active, but you might check out:
https://github.com/algoriffic/lsa4solr
Otherwise, Mahout has some of the basis for it but you would have to do some
work.
On Aug 29, 2011, at 9:55 AM, zarrinkalam wrote:
> hi,
>
> I want to use LSI for clustring ducument
Zarrinkalam,
have a look at semanticvectors.
paul
Le 29 août 2011 à 15:55, zarrinkalam a écrit :
> hi,
>
> I want to use LSI for clustring ducuments indexed with lucene, I dont know
> how, plz help me
>
> thanks,
--
hi,
I want to use LSI for clustring ducuments indexed with lucene, I dont know how,
plz help me
thanks,
> > If it is possible could you post a link where everybody can reach the
> > thesis of your student?
> > I guess it could be interesting for a couple of people on this list
> > and a benefit for your student as well.
> >
> > simon
> > >
> > > Joi
ll.
>
> simon
> >
> > Join the semanticVectors' list where the original author also talks.
> >
> > paul
> >
> >
> > Le 18-mars-09 à 08:34, nitin gopi a écrit :
> >
> >> hi Paul, I am new to this field of search engine. My aim is t
On Wed, Mar 18, 2009 at 08:09:33AM +0100, Paul Libbrecht wrote:
> LSI is patented so it's not been a flurry of implementation attempts.
Hasn't the original patent expired?
http://mail.python.org/pipermail/python-list/2007-July/621547.html
Ma
gt; Join the semanticVectors' list where the original author also talks.
>
> paul
>
>
> Le 18-mars-09 à 08:34, nitin gopi a écrit :
>
>> hi Paul, I am new to this field of search engine. My aim is to develop
>> a semantic search engine. Initially I was trying
where the original author also talks.
paul
Le 18-mars-09 à 08:34, nitin gopi a écrit :
hi Paul, I am new to this field of search engine. My aim is to develop
a semantic search engine. Initially I was trying to develop that by
using LSI. But since it is patented that is why there a
hi Paul, I am new to this field of search engine. My aim is to develop
a semantic search engine. Initially I was trying to develop that by
using LSI. But since it is patented that is why there are no many
implementation attempts. I want to ask is it possible to create a
search engine using
Nitin,
LSI is patented so it's not been a flurry of implementation attempts.
However, SemanticVectors is a library that does similar approaches to
LSA/LSI for indexing and is based on Lucene's term-vectors.
paul
Le 18-mars-09 à 07:09, nitin gopi a écrit :
hi all , has any bod
hi all , has any body tried to use LSI(latent semantic indexing) for
indexing in lucene?
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
cc
PM
Subject
Stable score scaling; LSI again
Please respond to
from
Lucene are being fed to another analyzer is why, so when the "identity"
score changes by adding docs to the index, it messes up the rest of the
processing.
The other question I had was about scoring via Latent Semantic Indexing. I
read in the archives of this list from way back
José Ramón Pérez Agüera wrote:
you need to use JAMA combined with Lucene, using the vectors that are builded
by lucene to compute SVD with JAMA
http://math.nist.gov/javanumerics/jama/
Thanks for your help. I am new to Lucene and do not know how to build a
vector. I will have my input data s
you need to use JAMA combined with Lucene, using the vectors that are builded
by lucene to compute SVD with JAMA
http://math.nist.gov/javanumerics/jama/
Best
jose
José Ramón Pérez Agüera
Dept. de Ingeniería del Software e Inteligencia Artificial
Despacho 411 tlf. 913947599
Facultad de Inform
lement LSI technique. The
scenario is something as follows.
Search
the news sites for the locally event based news. Cluster the similar
news items together. For example hurricane in New York city.
We want to apply basic LSI as follows
-Key word extraction
-Filter using stop list
-St
Hi,
I have a research project where I want to implement LSI technique. The scenario
is something as follows.
Search
the news sites for the locally event based news. Cluster the similar
news items together. For example hurricane in New York city.
We want to apply basic LSI
> On 12/13/05, Dave Kor <[EMAIL PROTECTED]> wrote:
> > > On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote:
> > > > Paul Libbrecht < [EMAIL PROTECTED]> writes:
> > > >
> > > > > We're also thinking about implementing something
t; > > > We're also thinking about implementing something similar to LSI
> within
> > > > ActiveMath which is lucene-powered where both formulae and text
> > > > searching would benefit of the latent-semantic-similarity. I've been
> > > > ref
t; >
> > > > We're also thinking about implementing something similar to LSI
> within
> > > > ActiveMath which is lucene-powered where both formulae and text
> > > > searching would benefit of the latent-semantic-similarity. I've been
> > &g
On 12/13/05, Dave Kor <[EMAIL PROTECTED]> wrote:
> On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote:
> > Paul Libbrecht <[EMAIL PROTECTED]> writes:
> >
> > > We're also thinking about implementing something similar to LSI within
> > > Ac
:
> > There seem to be quite a few alternatives around. I would be interested
> in
> > comments on the following:-
> > The work at NITLE <http://www.nitle.org/tools/semantic/search.htm>
> > using Contextual
> > Network Search (CNS) a graph-based alternativ
h (CNS) a graph-based alternative to LSI.
Hi,
perhaps I can say a few words about CNS.
I evaluated spreading activation search (which is what CNS is based
on) for my master's thesis. I must conclude that SA is a rather fickle
method -- if you read Preece's thesis, you will see that it can b
There seem to be quite a few alternatives around. I would be interested in
comments on the following:-
The work at NITLE <http://www.nitle.org/tools/semantic/search.htm>
using Contextual
Network Search (CNS) a graph-based alternative to LSI.
This work *[PDF]* An Introduction to *Random*
In
On 12/13/05, Ian Soboroff <[EMAIL PROTECTED]> wrote:
> Paul Libbrecht <[EMAIL PROTECTED]> writes:
>
> > We're also thinking about implementing something similar to LSI within
> > ActiveMath which is lucene-powered where both formulae and text
> > searc
Paul Libbrecht <[EMAIL PROTECTED]> writes:
> We're also thinking about implementing something similar to LSI within
> ActiveMath which is lucene-powered where both formulae and text
> searching would benefit of the latent-semantic-similarity. I've been
> refrained of d
We're also thinking about implementing something similar to LSI within
ActiveMath which is lucene-powered where both formulae and text
searching would benefit of the latent-semantic-similarity. I've been
refrained of doing "exactly this" at least since LSI is patented. T
Yeah, its Latent Semantic Indexing
On 12/1/05, gekkokid <[EMAIL PROTECTED]> wrote:
>
> sorry have to ask - whats LSI - " latent semantic indexing"?
>
> _gk
> - Original Message -
> From: "Lorenzo Viscanti" <[EMAIL PROTECTED]>
> To: ; &l
sorry have to ask - whats LSI - " latent semantic indexing"?
_gk
- Original Message -
From: "Lorenzo Viscanti" <[EMAIL PROTECTED]>
To: ; <[EMAIL PROTECTED]>
Sent: Thursday, December 01, 2005 12:02 AM
Subject: Re: Lucene + LSI
It depends on the kind of
It depends on the kind of implementation you are thinking of.
You can use Lucene to create the inputs to the LSI, and then use them in
your own system. I've written that code and it works, for searches and
clustering.
But if you are figuring out an LSI enhanced Lucene search system (based
Have any one implemented LSI in Lucene?
Kindly let me know how hard/easy it is.
thanks
chandana
icle about which has been written
by Maciej Ceglowski who is (largely) responsible for the LSI/ContextGraph
implementation at NITLE. Article Using Bloom Filters on
Perl.com<http://Perl.com>.
Bloom filters look a bit like the random index from Sahlgren I have
mentioned.
Much to do!
Adam
I use my own LSI implementation based on Lucene for text clustering.
I've done some tests, but I do believe that integrating LSI onto the lucene
search subsystem (i.e. creating something like LSISimilarity) is not an easy
task
I start analyzing the documents using Lucene, and then extract
I've met other persons with such needs and we would also be interested.
Unfortunately, this seems not to be available.
A clear issue might be that LSI, in its original form at least, is
covered by an US patent. But maybe someone finds another form which is
not.
paul
Le 5 oct. 05, à
adasal <[EMAIL PROTECTED]> wrote:
> >
> > There are LSI and LSI like implementations in open source, but I am
> > unsure what the state of play is in Java. The most interesting work I know
> > of is in Perl available (cvs permitting) from
> > NITLE.<http://www
There are LSI and LSI like implementations in open source, but I am unsure
what the state of play is in Java. The most interesting work I know of is in
Perl available (cvs permitting) from
NITLE.<http://www.nitle.org/tools/semantic/search.htm>
What I would like to see is an implementat
Hi all,
I am looking for LSI implementation i lucene. Is it available. I couldnt
find it in the website. I searched in the archives but no help. could some
one tell me if it is available or not.
Could you tell me where can i see to find if there are any Language
processing tools for Indexing and
38 matches
Mail list logo