Re: Per-token weighting / attribute data in index

2006-06-03 Thread Andrzej Bialecki
Marvin Humphrey wrote: Enabling individual positions to be boosted is indeed one of the main targets of the current discussion. A slightly easier to understand application would be boosting individual tokens according to relative font size. For instance, we might assume that text between tags i

Re: Per-token weighting / attribute data in index

2006-06-02 Thread Marvin Humphrey
On Fri, Jun 02, 2006 at 03:47:10PM -0700, Chris Hostetter wrote: > You may want to check out the java-dev list ... there's been some talk > among the people who really unerstand the low levels of lucene's file > formats about adding arbitrary "payload" data with each term/doc pair .. a > proposal t

Re: Per-token weighting / attribute data in index

2006-06-02 Thread Scott Davies
Dang, that's what I was afraid of. Good to hear they're actively considering extensions that'd fix the issue, though. In the meantime I guess I'll try limping along without 'em. Thanks! -- Scott On 6/2/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : : A simple example would be indexing and

Re: Per-token weighting / attribute data in index

2006-06-02 Thread Chris Hostetter
: : A simple example would be indexing and scoring the hyperlink text from : other web pages that point to the page P that I'm indexing/scoring. I : might have some metric saying how much I "trust" each of the pages or : sites with hyperlinks to P, and want to use that metric to increase or Hmmm.

Re: Per-token weighting / attribute data in index

2006-06-02 Thread Scott Davies
ur goal, and where lucene fails to meet it? : Date: Fri, 2 Jun 2006 13:14:41 -0700 : From: Scott Davies <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: java-user@lucene.apache.org : Subject: Per-token weighting / attribute data in index : : Hi...reasonably experienced we

Re: Per-token weighting / attribute data in index

2006-06-02 Thread Chris Hostetter
-0700 : From: Scott Davies <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: java-user@lucene.apache.org : Subject: Per-token weighting / attribute data in index : : Hi...reasonably experienced web search programmer but total Lucene newbie here. : : After poking through Lucene f

Per-token weighting / attribute data in index

2006-06-02 Thread Scott Davies
Hi...reasonably experienced web search programmer but total Lucene newbie here. After poking through Lucene for a while, I still haven't figured out a decent way to tweak the scoring based on per-token data. For example, as far as I can tell so far, the only reasonable way to have words in the t