Re: Norm - please lit it up for me

2007-10-19 Thread Daniel Naber
On Friday 19 October 2007 19:07, Karl Wettin wrote: > doc[0] > doc[1] > > With normalization doc[0] and doc[1] are equally important. Omitting   > normalization makes doc[0] (usually) three times as important as doc[1]. Not quite, as the normalization only refers to the length of the document.

Re: Sample SynonymAnalyzer vs. Lucene 2.2

2007-10-19 Thread Daniel Naber
On Friday 19 October 2007 14:42, Sean Dague wrote: > Ends up only indexing the synonym, but not the base word itself. I cannot reproduce the problem, i.e. I see both the original term and its synonyms in the index. Maybe you can post the analyzer that uses this filter or a test case to reproduc

Re: Sort by date with Lucene 2.2.0 ...

2007-10-19 Thread Daniel Naber
On Thursday 18 October 2007 21:35, Dragon Fly wrote: > I'm am trying to sort a date field in my index but I'm seeing strange > results.  I have searched the Lucene user mail archive for Datetools but > still couldn't figure out the problem. It shouldn't make a difference but does it help if you s

Re: Norm - please lit it up for me

2007-10-19 Thread Karl Wettin
19 okt 2007 kl. 18.39 skrev Dino Korah: Could someone help me understand normalization factors for a field. doc[0] doc[1] With normalization doc[0] and doc[1] are equally important. Omitting normalization makes doc[0] (usually) three times as important as doc[1]. Also please tell me w

Norm - please lit it up for me

2007-10-19 Thread Dino Korah
Hi, Could someone help me understand normalization factors for a field. Also please tell me what are the situations where I should omit normalization factors when adding a document. Many thanks. Dino Korah

Re: Get the preceding and following terms

2007-10-19 Thread mark harwood
See CollocationFinder here: http://issues.apache.org/jira/browse/LUCENE-474 - Original Message From: Fabrice Estiévenart <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, 19 October, 2007 2:01:11 PM Subject: Get the preceding and following terms Hello, I'm trying to crea

Re: Get the preceding and following terms

2007-10-19 Thread Grant Ingersoll
Have a look at the SpanQuery and it's derivitives. I have some examples in my ApacheCon Europe talk http://cnlp.org/presentations/ slides/AdvancedLuceneEU.pdf You will most likely have to do some post processing. Using Term Vectors and the new TermVectorMapper may help facilitate this. C

Get the preceding and following terms

2007-10-19 Thread Fabrice Estiévenart
Hello, I'm trying to create a suggestion tool based on Lucene. From a term in the index, i'd like to know the terms that often precede and follow this term in the indexed documents (NOT according to the alphabetical order). I know that Lucene stores the term positions but I don't know how to

Sample SynonymAnalyzer vs. Lucene 2.2

2007-10-19 Thread Sean Dague
I've noticed that the sample code to do the SynonymAnalyzer from the book Lucene in Action doesn't work right on Lucene 2.2. In my sample application I am using categorizing food names into food categories using the Synonym patern. Using the excerpted: private void addAliasesToStack(Token toke

RE: Sort by date with Lucene 2.2.0 ...

2007-10-19 Thread Dragon Fly
Thanks for the reply. I expected the results to be sorted in reverse chron (i.e. more recent time stamps first) because my Sort object is: new Sort (DATE_FIELD, true); Using my previous example, I would expect to see the following (because 48 seconds is more recent than 24 seconds): Septembe