Problem during evaluation of indexed documents

2005-05-21 Thread Falko Guderian
the correct number of keywords in each case. I can't find my fault. Is there an explanation? I would be pleased if someone could help me. Best regards Falko Guderian - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Indexing problem

2005-05-30 Thread Falko Guderian
s.doc() = 0) all term frequencies are greater than 0. But it isn't correct. The first doc doesn't contain all terms. Do you now this problem? How can I get the correct term frequencies in all docs? Best regards Falko Guderian

Indexing problem

2005-05-30 Thread Falko Guderian
requencies are greater than 0. But it isn't correct. The first doc doesn't contain all terms. Do you now this problem? How can I get the correct term frequencies in all docs? Best regards Falko Guderian - To unsub

Re: Lucene search clusters

2005-06-08 Thread Falko Guderian
You can add the WEKA packages http://www.cs.waikato.ac.nz/ml/weka/ . It has an EM clusterer. -Falko Some people just replied, but I forgot the most important thing... I'm thinking of this project as part of the Google's Summer of Code program, so I'm looking for other students. I've sent an e

Re: Lucene search clusters

2005-06-11 Thread Falko Guderian
You have to combine lucene and weka on your own. I don't know an open source implementation or some other tools. Sorry, you have to write a wrapper. -Falko Some people just replied, but I forgot the most important thing... I'm thinking of this project as part of the Google's Summer of Code prog