every hit.
Your diractory where your indexed docs are saved represents the vector space
model you want to bild. If you want to see how lucene computes the score
result, you can use the class explanation and similarity in lucene Api and you
will see that lucene deals with the documents and querys
On Jun 29, 2009, at 3:10 PM, Amir Hossein Jadidinejad wrote:
Hi,
It's my first experiment with Lucene. Please help me.
I'm going to index a set of documents and create a feature vector
for each of them. This vector contains all terms belong to the
document that weight using TFIDF.
After tha
Hi,
It's my first experiment with Lucene. Please help me.
I'm going to index a set of documents and create a feature vector for each of
them. This vector contains all terms belong to the document that weight using
TFIDF.
After that I want to compute the cosine similarity between all documents and
SM in "ieee-sw-rank" is
more precise in practice by lucene?
If so, it's an useful experiment.
2008/2/27, Dharmalingam <[EMAIL PROTECTED]>:
>
>
> Hi List,
>
> I am pretty new to Lucene. Certainly, it is very exciting. I need to
> implement a new Similarit
You can find those variants of the vector space model in this interesting
article:
http://ieeexplore.ieee.org/iel1/52/12658/00582976.pdf?tp=&isnumber=&arnumber=582976
Now, I got confirmed with you the current nature of Similarity API's will be
not easy to quickly realize t
uld be to focus on creating
the right kinds of queries, analyzers, etc.
-Grant
On Feb 28, 2008, at 3:55 PM, Dharmalingam wrote:
Thanks for your tips. My overall goal is to quickly implement 7
variants of
vector space model using Lucene. You can find these variants in the
updloaded file.
Thanks for your tips. My overall goal is to quickly implement 7 variants of
vector space model using Lucene. You can find these variants in the
updloaded file.
I am doing all these stuffs for a much broader goal: I am trying to recover
traceability links from requirements to source code files. I
On Feb 28, 2008, at 9:00 AM, Dharmalingam wrote:
Thanks for the reply. Sorry if my explanation is not clear. Yes, you
are
correct the model is based on Salton's VSM. However, the
calculation of the
term weight and the doc norm is, in my opinion, different from
Lucene. If
you look at th
y new to Lucene. Certainly, it is very exciting. I need to
>> implement a new Similarity class based on the Term Vector Space
>> Model given
>> in http://www.miislita.com/term-vector/term-vector-3.html
>>
>> Although that model is similar to Lucene’s model
>> (ht
Not sure I am understanding what you are asking, but I will give it a
shot. See below
On Feb 26, 2008, at 3:45 PM, Dharmalingam wrote:
Hi List,
I am pretty new to Lucene. Certainly, it is very exciting. I need to
implement a new Similarity class based on the Term Vector Space
Model
Hi List,
I am pretty new to Lucene. Certainly, it is very exciting. I need to
implement a new Similarity class based on the Term Vector Space Model given
in http://www.miislita.com/term-vector/term-vector-3.html
Although that model is similar to Lucene’s model
(http://hudson.zones.apache.org
Hi,
I am also interested in this problem.
Regards
Jason
On 4/28/06, trupti mulajkar <[EMAIL PROTECTED]> wrote:
>
> hi
>
> i am trying to implement the vector space model for lucene.
> i did find some code for generating the vectors, but can any1 suggest a
> bett
hi
i am trying to implement the vector space model for lucene.
i did find some code for generating the vectors, but can any1 suggest a better
way of creating the IndexReader object as it is the only way that can return
the index created.
cheers,
trupti mulajkar
MSc Advanced Computer Science
this is not directrly Lucene related) what you think about this
article.
Best wishes!
> --- Ursprüngliche Nachricht ---
> Von: "Runde, Kevin" <[EMAIL PROTECTED]>
> An:
> Betreff: RE: Vector Space Model <-> Probabilistic Model
> Datum: Wed, 15 Mar 2006 17:30:06 -
lcolm [mailto:[EMAIL PROTECTED]
Sent: Friday, February 17, 2006 8:47 AM
To: java-user@lucene.apache.org
Subject: Re: Vector Space Model <-> Probabilistic Model
I know of one I used for my Thesis. The REF is:
Fuhr, N. 2001, "Models in information retrieval", , pp. 21-50.
http://portal.ac
I know of one I used for my Thesis. The REF is:
Fuhr, N. 2001, "Models in information retrieval", , pp. 21-50.
http://portal.acm.org/citation.cfm?id=567294
I may have a electronic version. If you need it give me an email address as
this service doesn't allow attachments.
Hope this helps,
Mal
D]>
> An: java-user@lucene.apache.org
> Betreff: Re: Vector Space Model <-> Probabilistic Model
> Datum: Thu, 16 Feb 2006 14:19:02 -0500
>
> You may find some useful reading at:
> http://wiki.apache.org/jakarta-lucene/InformationRetrieval
>
> Karl Koch wrote:
>
You may find some useful reading at:
http://wiki.apache.org/jakarta-lucene/InformationRetrieval
Karl Koch wrote:
I am looking for a comparison between the theoretical Vector Space Model and
the theoretical Probabilistic Model in Information Retrieval. I know that
comcrete implementations do
I am looking for a comparison between the theoretical Vector Space Model and
the theoretical Probabilistic Model in Information Retrieval. I know that
comcrete implementations do differ from that. However, I am looking for
papers that compare the performance of both in particular applications
19 matches
Mail list logo