Re: A simple Vector Space Model and TFIDF usage

2009-07-02 Thread Kamal Najib
every hit. Your diractory where your indexed docs are saved represents the vector space model you want to bild. If you want to see how lucene computes the score result, you can use the class explanation and similarity in lucene Api and you will see that lucene deals with the documents and querys

Re: A simple Vector Space Model and TFIDF usage

2009-06-30 Thread Grant Ingersoll
On Jun 29, 2009, at 3:10 PM, Amir Hossein Jadidinejad wrote: Hi, It's my first experiment with Lucene. Please help me. I'm going to index a set of documents and create a feature vector for each of them. This vector contains all terms belong to the document that weight using TFIDF. After tha

A simple Vector Space Model and TFIDF usage

2009-06-29 Thread Amir Hossein Jadidinejad
Hi, It's my first experiment with Lucene. Please help me. I'm going to index a set of documents and create a feature vector for each of them. This vector contains all terms belong to the document that weight using TFIDF. After that I want to compute the cosine similarity between all documents and

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread h t
SM in "ieee-sw-rank" is more precise in practice by lucene? If so, it's an useful experiment. 2008/2/27, Dharmalingam <[EMAIL PROTECTED]>: > > > Hi List, > > I am pretty new to Lucene. Certainly, it is very exciting. I need to > implement a new Similarit

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
You can find those variants of the vector space model in this interesting article: http://ieeexplore.ieee.org/iel1/52/12658/00582976.pdf?tp=&isnumber=&arnumber=582976 Now, I got confirmed with you the current nature of Similarity API's will be not easy to quickly realize t

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
uld be to focus on creating the right kinds of queries, analyzers, etc. -Grant On Feb 28, 2008, at 3:55 PM, Dharmalingam wrote: Thanks for your tips. My overall goal is to quickly implement 7 variants of vector space model using Lucene. You can find these variants in the updloaded file.

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
Thanks for your tips. My overall goal is to quickly implement 7 variants of vector space model using Lucene. You can find these variants in the updloaded file. I am doing all these stuffs for a much broader goal: I am trying to recover traceability links from requirements to source code files. I

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
On Feb 28, 2008, at 9:00 AM, Dharmalingam wrote: Thanks for the reply. Sorry if my explanation is not clear. Yes, you are correct the model is based on Salton's VSM. However, the calculation of the term weight and the doc norm is, in my opinion, different from Lucene. If you look at th

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
y new to Lucene. Certainly, it is very exciting. I need to >> implement a new Similarity class based on the Term Vector Space >> Model given >> in http://www.miislita.com/term-vector/term-vector-3.html >> >> Although that model is similar to Lucene’s model >> (ht

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
Not sure I am understanding what you are asking, but I will give it a shot. See below On Feb 26, 2008, at 3:45 PM, Dharmalingam wrote: Hi List, I am pretty new to Lucene. Certainly, it is very exciting. I need to implement a new Similarity class based on the Term Vector Space Model

Vector Space Model: New Similarity Implementation Issues

2008-02-26 Thread Dharmalingam
Hi List, I am pretty new to Lucene. Certainly, it is very exciting. I need to implement a new Similarity class based on the Term Vector Space Model given in http://www.miislita.com/term-vector/term-vector-3.html Although that model is similar to Lucene’s model (http://hudson.zones.apache.org

Re: Vector space model

2006-04-28 Thread jason
Hi, I am also interested in this problem. Regards Jason On 4/28/06, trupti mulajkar <[EMAIL PROTECTED]> wrote: > > hi > > i am trying to implement the vector space model for lucene. > i did find some code for generating the vectors, but can any1 suggest a > bett

Vector space model

2006-04-28 Thread trupti mulajkar
hi i am trying to implement the vector space model for lucene. i did find some code for generating the vectors, but can any1 suggest a better way of creating the IndexReader object as it is the only way that can return the index created. cheers, trupti mulajkar MSc Advanced Computer Science

RE: Vector Space Model <-> Probabilistic Model

2006-03-16 Thread Karl Koch
this is not directrly Lucene related) what you think about this article. Best wishes! > --- Ursprüngliche Nachricht --- > Von: "Runde, Kevin" <[EMAIL PROTECTED]> > An: > Betreff: RE: Vector Space Model <-> Probabilistic Model > Datum: Wed, 15 Mar 2006 17:30:06 -

RE: Vector Space Model <-> Probabilistic Model

2006-03-15 Thread Runde, Kevin
lcolm [mailto:[EMAIL PROTECTED] Sent: Friday, February 17, 2006 8:47 AM To: java-user@lucene.apache.org Subject: Re: Vector Space Model <-> Probabilistic Model I know of one I used for my Thesis. The REF is: Fuhr, N. 2001, "Models in information retrieval", , pp. 21-50. http://portal.ac

Re: Vector Space Model <-> Probabilistic Model

2006-02-17 Thread Malcolm
I know of one I used for my Thesis. The REF is: Fuhr, N. 2001, "Models in information retrieval", , pp. 21-50. http://portal.acm.org/citation.cfm?id=567294 I may have a electronic version. If you need it give me an email address as this service doesn't allow attachments. Hope this helps, Mal

Re: Vector Space Model <-> Probabilistic Model

2006-02-17 Thread Karl Koch
D]> > An: java-user@lucene.apache.org > Betreff: Re: Vector Space Model <-> Probabilistic Model > Datum: Thu, 16 Feb 2006 14:19:02 -0500 > > You may find some useful reading at: > http://wiki.apache.org/jakarta-lucene/InformationRetrieval > > Karl Koch wrote: >

Re: Vector Space Model <-> Probabilistic Model

2006-02-16 Thread Grant Ingersoll
You may find some useful reading at: http://wiki.apache.org/jakarta-lucene/InformationRetrieval Karl Koch wrote: I am looking for a comparison between the theoretical Vector Space Model and the theoretical Probabilistic Model in Information Retrieval. I know that comcrete implementations do

Vector Space Model <-> Probabilistic Model

2006-02-16 Thread Karl Koch
I am looking for a comparison between the theoretical Vector Space Model and the theoretical Probabilistic Model in Information Retrieval. I know that comcrete implementations do differ from that. However, I am looking for papers that compare the performance of both in particular applications