Re: for check similarity of two sentences

2015-04-02 Thread Robust Links
Hi Heshan one approach could be something like this: 1- vectorize each ngram of each sentence. One vectorization strategy is to use word2vec (the deep learning package). i believe someone has ported word2vec (originally in C) to Lucene. do google search 2- aggregate each word vector (i.e some clu

Re: for check similarity of two sentences

2015-04-02 Thread Gimantha Bandara
Hi Heshan, I think you can achieve what you are looking for. You may read "lucene in Action 2nd edition" about lucene scoring system and FuzzyQuery. Hope this may help. May be someone can suggest much better approach. On Wed, Apr 1, 2015 at 8:14 AM, hesh jay wrote: > hi, > I am second year under