On Friday 19 October 2007 19:07, Karl Wettin wrote:
> doc[0]
> doc[1]
>
> With normalization doc[0] and doc[1] are equally important. Omitting
> normalization makes doc[0] (usually) three times as important as doc[1].
Not quite, as the normalization only refers to the length of the document.
On Friday 19 October 2007 14:42, Sean Dague wrote:
> Ends up only indexing the synonym, but not the base word itself.
I cannot reproduce the problem, i.e. I see both the original term and its
synonyms in the index. Maybe you can post the analyzer that uses this
filter or a test case to reproduc
On Thursday 18 October 2007 21:35, Dragon Fly wrote:
> I'm am trying to sort a date field in my index but I'm seeing strange
> results. I have searched the Lucene user mail archive for Datetools but
> still couldn't figure out the problem.
It shouldn't make a difference but does it help if you s
19 okt 2007 kl. 18.39 skrev Dino Korah:
Could someone help me understand normalization factors for a field.
doc[0]
doc[1]
With normalization doc[0] and doc[1] are equally important. Omitting
normalization makes doc[0] (usually) three times as important as doc[1].
Also please tell me w
Hi,
Could someone help me understand normalization factors for a field.
Also please tell me what are the situations where I should omit
normalization factors when adding a document.
Many thanks.
Dino Korah
See CollocationFinder here: http://issues.apache.org/jira/browse/LUCENE-474
- Original Message
From: Fabrice Estiévenart <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Friday, 19 October, 2007 2:01:11 PM
Subject: Get the preceding and following terms
Hello,
I'm trying to crea
Have a look at the SpanQuery and it's derivitives. I have some
examples in my ApacheCon Europe talk http://cnlp.org/presentations/
slides/AdvancedLuceneEU.pdf
You will most likely have to do some post processing. Using Term
Vectors and the new TermVectorMapper may help facilitate this.
C
Hello,
I'm trying to create a suggestion tool based on Lucene. From a term in
the index, i'd like to know the terms that often precede and follow this
term in the indexed documents (NOT according to the alphabetical order).
I know that Lucene stores the term positions but I don't know how to
I've noticed that the sample code to do the SynonymAnalyzer from the
book Lucene in Action doesn't work right on Lucene 2.2. In my sample
application I am using categorizing food names into food categories
using the Synonym patern.
Using the excerpted:
private void addAliasesToStack(Token toke
Thanks for the reply. I expected the results to be sorted in reverse chron
(i.e. more recent time stamps first) because my Sort object is:
new Sort (DATE_FIELD, true);
Using my previous example, I would expect to see the following (because 48
seconds is more recent than 24 seconds):
Septembe
10 matches
Mail list logo