Terms not being found in query

2006-02-03 Thread kate
hi, i have an index with documents containing n-grams, in fields such as "3gram", "4gram", etc. one 5-gram found in the text is "oswax". using Luke, i can see that a field with this value exists for a particular document. however, searching for "5gram:oswax" produces no results (either usin

disk space

2006-02-03 Thread Azlan Abdul Latiff
i've emptied my index directory but still not much space being freed. Is there another directory whereby the documents might be stored? This e-mail has been sent via JARING webmail at http://www.jaring.my -

number of objects indexed

2006-02-03 Thread Lee_Gary
Hi all, Im just wondering if anyone has done any scalability tests with Lucene. Has anyone been able to index up to the 10 million mark in terms of number of objects indexed in a single index? An estimate of how many objects you have indexed is appreciated. Thanks!! Gary

Re: Frequency Matrix

2006-02-03 Thread Chris Hostetter
take a look at the TermEnum and TermDoc classes. they should give you all the info you need, using psuedo code something like this... foreach Term in TermEnum foreach doc in TermDoc record Term, TermDoc.doc, TermDoc.freq : Date: Fri, 3 Feb 2006 13:31:49 -0500 : From:

Frequency Matrix

2006-02-03 Thread varun sood
Hi, I am impelementing Lucene to index my website. I would like to know if its possible to generate a simple frequency matrix? By frequency matrix I mean, docmuent name on top X-Axis and keywords on left Y-Axis. and the cells of the matrix will contain the frequency of the keyword in a particluar