Re: Document Similarity

2012-07-30 Thread in.abdul
I had understood your need . You can use k mean clustering in mahout . Which can help your you case . You can better post this question in mahout user list where you get different idea . I had also had use case like this as i did as POC. But still my suggestion is that . You can post this question

Re: Document Similarity

2012-07-30 Thread in.abdul
Hi ELshaimaa, I couldnt able understood what is your need . Can you please explain your use case. If this is case "I need to use Lucene to find the most similar documents from the generated index" then go for morelikethis[1] components . Based on your use case people can suggest some good wa

Re: Getting terms from unstored fields, doc-wise

2012-07-26 Thread in.abdul
No , it's not possible to get the data which not stored .. On Jul 26, 2012 10:27 PM, "Phanindra R [via Lucene]" > Hi, > I've an index to analyze (manually). Unfortunately, I cannot rebuild > the index. Some of the fields are 'unstored'. I was wondering whether > there's any way to get the ter