Hello Alessandro,
I was looking for something more light weight, or extend SpanOrQuery to support
this case. The rewrite is part of our custom parser extending
ExtendedDismaxQParser where we add support for payloads by rewriting everything
to a payload equivalent.
This is almost the case with
On Wed, May 23, 2018 at 11:41 AM, Markus Jelsma
wrote:
> Hello,
>
> To support payloads we rewrite SynonymQuery to a pair of SpanTerm queries
> which we then can wrap in the PayloadScoreQuery. This is not the right way
> to do this because if both clauses match, both are also sc
Hello,
To support payloads we rewrite SynonymQuery to a pair of SpanTerm queries which
we then can wrap in the PayloadScoreQuery. This is not the right way to do this
because if both clauses match, both are also scored. We could try to rewrite
SynonymQuery to a SpanOrQuery but i suppose that
d it has the
> > added benefits that it would not require me to meddle into the scoring
> > process, which I'm still a bit terrified of. Thanks for the tip.
> >
> > I guess the question is still valid though? i.e. how would one take into
> > account payloads for
ss, which I'm still a bit terrified of. Thanks for the tip.
>
> I guess the question is still valid though? i.e. how would one take into
> account payloads for scoring entire spans? Does this make sense at all? Any
> links to a more-or-less straightforward example?
>
> O
? i.e. how would one take into
account payloads for scoring entire spans? Does this make sense at all? Any
links to a more-or-less straightforward example?
On the length of payloads: I understood that you have other restrictions,
but payloads take a bytesref as value, so you can encode arbitrary da
gt; Subject: Re: Using POS payloads for chunking
>
> I think it'd be interesting to also investigate using TypeAttribute [1]
> together with TypeTokenFilter [2].
>
> Regards,
> Tommaso
>
> [1] :
> https://lucene.apache.org/core/6_5_0/core/org/apache/lucene/analy
7 23:29
> > To: java-user
> > Subject: Re: Using POS payloads for chunking
> >
> > Markus:
> >
> > I don't believe that payloads are limited in size at all. LUCENE-7705
> > was done in part because there _was_ a hard-coded 256 limit for some
> > of th
-Original message-
> From:Erick Erickson
> Sent: Wednesday 14th June 2017 23:29
> To: java-user
> Subject: Re: Using POS payloads for chunking
>
> Markus:
>
> I don't believe that payloads are limited in size at all. LUCENE-7705
> was done in part because there
Markus:
I don't believe that payloads are limited in size at all. LUCENE-7705
was done in part because there _was_ a hard-coded 256 limit for some
of the tokenizers. The Payload (at least recent versions) just have
some bytes after them, and (with LUCENE-7705) can be arbitrarily long.
Of c
t up
with. Finally a BM25 extension that has, amongst others, a mapping of bitset to
score. Nouns get a bonus, prepositions and other useless pieces get a
punishment etc.
Payloads are really great things to use! We also use it to distinguish between
compounds and their subwords, o.a. we supply
Markus - how are you encoding payloads as bitsets and use them for scoring?
Curious to see how folks are leveraging them.
Erik
> On Jun 14, 2017, at 4:45 PM, Markus Jelsma wrote:
>
> Hello,
>
> We use POS-tagging too, and encode them as payload bitsets for scoring,
Hello,
We use POS-tagging too, and encode them as payload bitsets for scoring, which
is, as far as is know, the only possibility with payloads.
So, instead of encoding them as payloads, why not index your treebanks POS-tags
as tokens on the same position, like synonyms. If you do that, you can
in sequences of payloads.
(trying not to be too pedantic, a regular chunker looks for 'chunks' based
on part-of-speech tags, e.g. noun phrases can be searched for with patterns
like "(DT)?(JJ)*(NN|NP)+", that is, an optional determinant and zero or
more adjectives preceding
re/4_10_2/core/org/apache/lucene/search/spans/SpanNearPayloadCheckQuery.html>
since
it properly handles the fact that payloads aren't ordered by SpanNearQuery
<http://lucene.apache.org/core/4_10_2/core/org/apache/lucene/search/spans/SpanNearQuery.html>."
?
I used SpanNearQuery a
Hi
I can get the payloads for query terms using getPayloadsForQuery from
PayloadSpanUtil.
However this does not support ConstantScore queries. So how do I get the
payloads for queries that get rewritten to ConstantScore query for example
PrefixQuery, WildcardQuery.
Thanks
Puneet
Awesome works well for me!
Thanks
Rohit Banga
http://iamrohitbanga.com/
On Sun, Mar 23, 2014 at 10:06 PM, Manuel Le Normand <
manuel.lenorm...@gmail.com> wrote:
> Hello Rohit,
> We had a similar query time bottleneck when attempting to map lucene's
> internal id's to the uniqueKey, especially a
Hello Rohit,
We had a similar query time bottleneck when attempting to map lucene's
internal id's to the uniqueKey, especially as we generally return only the
uniqueKey to the user we had no other use of the stored field. As you
noted, every internal id --> uniqueKey id requires a disk seek and as
On Sat, Mar 22, 2014 at 5:18 AM, Rohit Banga wrote:
> Awesome BinaryDocValues sounds nice!
> I saw that NumericDocValues did not inherit from a base class hence I
> thought there is no StringDocValues :).
>
> Can I expect that a searcher manager will invoke
> searcherfactory.newSearcher at most o
Awesome BinaryDocValues sounds nice!
I saw that NumericDocValues did not inherit from a base class hence I
thought there is no StringDocValues :).
Can I expect that a searcher manager will invoke
searcherfactory.newSearcher at most once between searcher manager
refreshes? I believe IndexSearcher i
On Fri, Mar 21, 2014 at 10:25 PM, Rohit Banga wrote:
> Thanks Michael for your response.
You're welcome!
> Few questions:
>
> 1. Can I expect better performance when retrieving a single NumericDocValue
> for all hits vs when I retrieve documents for all hits to fetch the field
> value? As far as
/lucenes-searchermanager-simplifies.html),
> but how about managing two data sources Lucene index and HashMap String> with SearcherManager? Is there a way to achieve this using a custom
> SearcherFactory?
>
>
> Thanks
> Rohit Banga
> http://iamrohitbanga.com/
>
>
t; wrote:
> DocValues are better than payloads.
>
> E.g. index a NumericDocValuesField with each doc, holding your id.
>
> Then at search time you can use MultiDocValues.getNumericValues.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Fri,
DocValues are better than payloads.
E.g. index a NumericDocValuesField with each doc, holding your id.
Then at search time you can use MultiDocValues.getNumericValues.
Mike McCandless
http://blog.mikemccandless.com
On Fri, Mar 21, 2014 at 4:35 PM, Rohit Banga wrote:
> Hi everyone
>
&g
me metadata along with the internal
document id when I did a search. I do not want to read all documents just
to retrieve this metadata.
The best solution I have come across searching on the net is to use
payloads which will be returned by the fast index search query along with
the document ids.
Is
SynonymFilter makes sense.
The planned payloads are indeed not needed. I guess a better solution would
be making out of the boost an attribute during query time that will be
consumed in the queryParser in order to boost these n-gram terms.
Thanks for the hints.
Manuel
On Wed, Mar 12, 2014 at
s. Part of it will be achieved by
> the CommonGram filter in which I put the frequent words.
>
> I think of going a step further and index phrase queries (extracted from my
> query log) entities (from gazetteers). In order to control the boost on
> these N-gram matches I plan adding payl
a step further and index phrase queries (extracted from my
query log) entities (from gazetteers). In order to control the boost on
these N-gram matches I plan adding payloads to these terms.
I'm thinking of two different implementations:
1. Using MappingCharFilter - the mapping.txt wou
Hi,
It's great to see support for payloads in the suggesters - this is really
helpful, and pretty much addresses LUCENE-4516. Are there any plans to also
support them for WFSTs? We have some cases where we don't need the Analyzer's
capabilities (we look up the completion us
Second link is that:
http://digitalpebble.blogspot.com/2010/08/using-payloads-with-dismaxqparser-in.html
2013/12/1 Furkan KAMACI
> Hi;
>
> I use Solr 4.5.1 I have a case: When a user searches for some specific
> keywords some documents should be listed at much more higher th
://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! However
that example extends Qparser directly but I want to use capabilities of
*edismax*.
So I found that example:
file:///home/furkan/Desktop/DigitalPebble's%20Blog%20%20Using%20Payloads%20with%20DisMaxQParser%20in%20SOLR.html
Thi
it extend edismax or I should do
that: if a search occurs on a usual field and a spesific field(payload
field) score will be calculated from sum of usual field score and payload
fied score via my custom alternative similarity. I've done per field
similarity function but I got that error:
"Fie
n I do that? Custom payload similarity class or custom function
> query?
>
> I've followed here:
> http://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! but
> decodeNormValue if a final method anymore. How about that:
> http://www.solrtutorial.com/custom-solr
Hi;
I use Solr 4.5.1 Due to my question is more related to Lucene I wanted to
ask this question here. Is there any example which uses payloads for
boosting at a custom function query?
Thanks;
Furkan KAMACI
score (payload * normalize
coefficient)
How can I do that? Custom payload similarity class or custom function
query?
I've followed here:
http://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! but
decodeNormValue if a final method anymore. How about that:
http://www.solrtuto
That was it. Reversing the order fixed it.
Thanks for wading through the lack of whitespace.
ThanksKyle
> From: luc...@mikemccandless.com
> Date: Tue, 15 Oct 2013 09:24:17 -0400
> Subject: Re: Payloads disabled in 4.5?
> To: java-user@lucene.apache.org
>
> Something catastroph
those two?
Mike McCandless
http://blog.mikemccandless.com
On Tue, Oct 15, 2013 at 8:13 AM, Kyle Judson wrote:
>
>
>
>
>
>
>
>
> Hi All,
>
>
> I'm trying to create an index with payloads using Lucene 4.5 but the payloads
> don't seem to be getting
Hi All,
I'm trying to create an index with payloads using Lucene 4.5 but the payloads
don't seem to be getting into the index.
I looked into the code and it looks like storePayloads argument to
addOrUpdateInternal when called from addOrUpdate in FieldInfos is set to false
Am 23.04.2013 16:17, schrieb Alan Woodward:
> It doesn't sound as though an inverted index is really what you want to be
> querying here, if I'm reading you right. You want to get the payloads for
> spans at a specific position, but you don't particularly care about th
Hi Carsten,
It doesn't sound as though an inverted index is really what you want to be
querying here, if I'm reading you right. You want to get the payloads for
spans at a specific position, but you don't particularly care about the actual
term at that position? You
d on a SpanQuery which depends on a specific
expression to search for. In my use case, I need to retrieve Spans
specified by their offsets only, and then get their payloads and process
them further. Alternatively, I could query for the occurence of certain
string patterns in the payloads and c
g to figure out a way to use a query as Uwe suggested. My
>> scenario is to perform a query and then retrieve some of the payloads
>> upon user request, so there no obvious way to wrap this into a query as
>> I can't know what (terms) to query for.
>
> I wonder: is
Am 23.04.2013 13:47, schrieb Carsten Schnober:
> I'm trying to figure out a way to use a query as Uwe suggested. My
> scenario is to perform a query and then retrieve some of the payloads
> upon user request, so there no obvious way to wrap this into a query as
> I can't know
Am 23.04.2013 13:21, schrieb Michael McCandless:
> Actually, term vectors can store payloads now (LUCENE-1888), so if that
> field was indexed with FieldType.setStoreTermVectorPayloads they should be
> there.
>
> But I suspect the TokenSources.getTokenStream API (which I think u
Actually, term vectors can store payloads now (LUCENE-1888), so if that
field was indexed with FieldType.setStoreTermVectorPayloads they should be
there.
But I suspect the TokenSources.getTokenStream API (which I think un-inverts
the term vectors to recreate the token stream = very slow?) wasn
TermVectors are per-document and do not contain payloads. You are reading the
per-document TermVectors which is a "small index" *stored* for each document as
a binary blob. This blob only contains the terms of this document with its
positions/offsets, but no payloads (offsets are use
Hi,
I'm trying to extract payloads from an index for specific tokens the
following way (inserting sample document number and term):
Terms terms = reader.getTermVector(16504, "term");
TokenStream tokenstream = TokenSources.getTokenStream(terms);
while (tokenstream
ver, when I print the document off and look at
the index in Luke, the field I'm trying to append the payload to has no
terms or payloads associated with it even though I specified both term and
payload attributes in the token stream code. I'm quite confused how these
things are supposed
Hello everybody,
I am currently experimenting with Lucene 4.0 and would like to add payloads.
Payload should only be added once per term on the first position. My current
code looks like this:
public final boolean incrementToken() throws java.io.IOException {
String term
Hello everybody,
I am currently unsure how stored data is written and loaded from index.
I want to store for every term of a document some binary data but only once
and not for every position!
Therefore I am not sure if Payloads or stored Fields are the better solution
(Or the not implemented
first level (Lucene) a document for one
wikipedia article containing all distinct terms of its versions. On the
second level (payloads) I store the frequency information corresponding
to each article version and its terms. If I search now I can find an
article by its term and through the term and
Payloads only make sense in terms of specific positions in the index, so I
don't think there is a way to hack Lucene for it. You could, I suppose, just
store the payload for the first instance of the term.
Also, what's the use case you are trying to solve here? Why store term
freq
pert usages
> of Lucene. From what I can tell, you have sent the same question 3 times in
> a matter of less than a day. Sending more than once in a 2-3 day period is
> just going to make it less likely that you will get help, not more likely.
>
> Some suggestions inline below.
rently using Lucene 3.0.2 with payloads. I store extra information
> in the payloads about the term like frequencies and therefore I don't need
> frequencies and term positions stored normally by Lucene. I would like to
> set f.setOmitTermFreqAndPositions(true) but then I am not able to
Hello everybody,
I am currently using Lucene 3.0.2 with payloads. I store extra information
in the payloads about the term like frequencies and therefore I don't need
frequencies and term positions stored normally by Lucene. I would like to
set f.setOmitTermFreqAndPositions(true) but then
ions inline below.
>
>
> Lately I encountered the new payloads support and it looks its a great
> solution for my project.
>
>
> *The problem:*
>
> The use case is as follows:
>
> I need to support a way to calculate statistics on web pages.
>
> Each pa
Hi Guys,
I've been using Lucene for more than 5 years and it is a great tool -
great job! Thanks for everything...
Lately I encountered the new payloads support and it looks its a great
solution for my project.
*The problem:*
The use case is as follows:
I need to support a w
Hi Guys,
I've been using Lucene for more than 5 years and it is a great tool -
great job! Thanks for everything...
Lately I encountered the new payloads support and it looks its a great
solution for my project.
*The problem:*
The use case is as follows:
I need to support a w
rm translations from the
> subdocuments.
>
> The IMO better alternative is not to split the document and to assign the
> language tags as payloads to the terms. But then I need
>
> (i) a search filter that eliminates docs based on a given language tag and
>
> (ii) a way
document and to assign the
language tags as payloads to the terms. But then I need
(i) a search filter that eliminates docs based on a given language tag and
(ii) a way to access the term payloads from the documents returned by the
searcher
For both I haven't found a solution yet. Can I wr
I see the payload in the token now.
--
View this message in context:
http://n3.nabble.com/How-to-calculate-payloads-in-queries-too-tp712743p713413.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To
I am interested in using payloads in the following way. I store
Func(index-term) as a payload at index-term when indexing. When querying I
want to compute Func(query-term) as well. Then my similarity returns some
other function, Gunc(Func(index-term1),Func(query-term)).
As an example, maybe
er@lucene.apache.org
Subject: RE: Payloads
> Let's say I have a document that
> contains the following text:
>
> "Graph Algorithms is one of the most important topics in computer
> science"
>
> And a query "!Graph Algorithms" then the term Graph in the q
.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Message-
> From: Elias Khsheibun [mailto:eli...@gmail.com]
> Sent: Sunday, December 20, 2009 2:51 PM
> To: java-user@lucene.apache.org
> Subject: RE: Payloads
>
>
> I'm t
hits2 = searcher.search(query2, 10).scoreDocs;
for (int i = 0; i < hits2.length; i++) {
Document hitDoc = searcher.doc(hits2[i].doc);
System.out.println(hitDoc.get("title"));
}
}
}
-Original Message-
From: AHMET ARSLAN [mail
What do you mean by a custom one - please explain. I must use a
PayloadTermQuery ?
And for the TermPositionPayloadTokenFilter there is a method that is not
used - incrementToken (only used in the main method) ... I didn't see in the
code the place that examines if the query term is at an even offs
> If I need to override the QueryParser
> to return PayloadTermQuery, what
> function for PayloadFunction should I use in the
> constructor (If you can
> show me an example).
I am not sure about that. Maybe custom one.
> In your code I didn't see an indexer, will this work with
> the regular
> I
oaded ?
-Original Message-
From: AHMET ARSLAN [mailto:iori...@yahoo.com]
Sent: Saturday, December 19, 2009 8:34 PM
To: java-user@lucene.apache.org
Subject: RE: Payloads
> Let's say I have a document that
> contains the following text:
>
> "Graph Algorithms is one of t
to "Graph|2.0 Algorithms" you
can use DelimitedPayloadTokenFilter to set payload of marked term.
Additionally you need to everride QueryParser to return PayloadTermQuery
and scorePayload method of DefaultSimilarity.
By doing so payloads will be included in score calculation.
public
About 60 students I think, if you have given some answers I would be
grateful if you could link me to them or quote them again.
-Original Message-
From: Uwe Schindler [mailto:u...@thetaphi.de]
Sent: Saturday, December 19, 2009 7:00 PM
To: java-user@lucene.apache.org
Subject: RE: Payloads
> Sent: Saturday, December 19, 2009 5:54 PM
> To: java-user@lucene.apache.org
> Subject: RE: Payloads
>
> Let's say I have a document that contains the following text:
>
> "Graph Algorithms is one of the most important topics in computer science"
>
> And
is even) - we apply
this doubling of weight only if a '!' operator precedes the term and if its
offset from the document is even.
-Original Message-
From: AHMET ARSLAN [mailto:iori...@yahoo.com]
Sent: Saturday, December 19, 2009 6:48 PM
To: java-user@lucene.apache.org
Subject:
> I want to override the operator - it
> is for a project purpose.
Can you explain your requirements more? What do you mean by "an even offset of
the document"?
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apa
I want to override the operator - it is for a project purpose.
-Original Message-
From: AHMET ARSLAN [mailto:iori...@yahoo.com]
Sent: Saturday, December 19, 2009 6:41 PM
To: java-user@lucene.apache.org
Subject: Re: Payloads
> Hi,
>
> I need to add a query operator '!'
the offset of its first word.
>
> I guess it involves payloads...
>
> Elias.
'!' is already a query operator. It is equivalent of NOT. So you cannot use it.
Why not use carat operator? Like singleterm^2 "some phrase"^2
[Boosti
t involves payloads...
Elias.
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
ell on a prior span.
>>
>> Mike
>>
>> On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen
>> wrote:
>>> Right we're getting the spans, however it's just the payloads that are
>>> missing, randomly...
>>>
>>> On Wed, Dec 9,
f it already fell on a prior span.
>
> Mike
>
> On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen
> wrote:
>> Right we're getting the spans, however it's just the payloads that are
>> missing, randomly...
>>
>> On Wed, Dec 9, 2009 at 2:23 AM, Michae
it back, if it already fell on a prior span.
Mike
On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen
wrote:
> Right we're getting the spans, however it's just the payloads that are
> missing, randomly...
>
> On Wed, Dec 9, 2009 at 2:23 AM, Michael McCandless
> wrote:
>
Right we're getting the spans, however it's just the payloads that are
missing, randomly...
On Wed, Dec 9, 2009 at 2:23 AM, Michael McCandless
wrote:
> There was a thread a while back about how span queries don't enumerate
> every possible span, but I can't remember
There was a thread a while back about how span queries don't enumerate
every possible span, but I can't remember if that included sometimes
missing payloads...
Mike
On Tue, Dec 8, 2009 at 7:34 PM, Jason Rutherglen
wrote:
> Howdy,
>
> I am wondering
Howdy,
I am wondering if anyone has seen
NearSpansUnordered.getPayload() not return payloads that are
verifiably accessible via IR.termPositions? It's a bit confusing
because most of the time they're returned properly.
I suspect the payload logic gets tripped up in
NearSpansUnordered
I don't mind adding the "positions" of the payloads in them. However,
maybe we can be little more clear in the javadocs what's going on
underneath?
On Wed, Nov 25, 2009 at 5:36 AM, Mark Miller wrote:
> Grant Ingersoll wrote:
>> On Nov 20, 2009, at 6:49 PM, Jason Ru
an NearSpansOrdered in this
>> regard.
>>
>> NearSpansUnordered returns payloads in a hash set that's
>> computed each method call by iterating over the SpanCell as a
>> linked list, whereas NearSpansOrdered stores the payloads in a
>> list (which is ordered) only when co
On Nov 20, 2009, at 6:49 PM, Jason Rutherglen wrote:
> I'm interested in getting the payload information from the
> matching span, however it's unclear from the javadocs why
> NearSpansUnordered is different than NearSpansOrdered in this
> regard.
>
> NearSpansUno
regard.
>
> NearSpansUnordered returns payloads in a hash set that's
> computed each method call by iterating over the SpanCell as a
> linked list, whereas NearSpansOrdered stores the payloads in a
> list (which is ordered) only when collectPayloads is true.
>
> At first
I'm interested in getting the payload information from the
matching span, however it's unclear from the javadocs why
NearSpansUnordered is different than NearSpansOrdered in this
regard.
NearSpansUnordered returns payloads in a hash set that's
computed each method call by ite
ght help you. Otherwise,
> the PayloadTermQuery is still a Query and can be used in a BooleanQuery.
> Beyond that, you may need to write what you need, since it doesn't exist
> yet.
>
>
> On Oct 26, 2009, at 10:25 AM, Mauro Dragoni wrote:
>
>> Hi to everyone,
>>
Hi to everyone,
I started to use payloads in my indexes.
However, I didn't find a class that permits to perform queries using
more than one term.
The only way that works is to use the BoostingTermQuery class, but by
using this class, I can search only one term a time.
Does exist a way to do a
Hi to everyone,
I started to use payloads in my indexes.
However, I didn't find a class that permits to perform queries using
more than one term.
The only way that works is to use the BoostingTermQuery class, but by
using this class, I can search only one term a time.
Does exist a way to
On May 22, 2009, at 12:28 AM, Dmitri Bichko wrote:
Hi,
I may be missing something obvious, but how do I get the payloads for
the specific token positions that were matched by a query?
See SpanQuery.getPayloadSpans() and it's SpanQuery derivatives.
For example, if I have a phrase
Hi,
I may be missing something obvious, but how do I get the payloads for
the specific token positions that were matched by a query?
For example, if I have a phrase query like "A keyword B" that matches
the field "A keyword B A", I can get the payloads for A and B with
Index
gt;> 2009/4/26 Murat Yakici
>>>>
>>>>
>>>>
>>>>> Yes, this is more or less what I had in mind. However, for this
>>>>> approach
>>>>> one requires some *prior knowledge* of the vocabulary of the document
>>>>&g
exWriter.
If I remember it correctly, there is also some intention to add document
payloads functionality. I have the same concerns on this. So I think we
need a clear view on the topic. Where is the payload work moving? How we
can generate a score without duplicating some of the work that
IndexWr
y have an indexing time penalty. If your window size is
> >> the document itself, you will be doing the same job twice (calculating
> >> the
> >> num of times a term occurs in doc X, index time weights etc.).
> >> IndexWriter
> >> already does these somewhere
gt; IndexWriter
>> already does these somewhere down deep.
>>
>>
>> Simply put, I want to add some scores to documents/terms, but I can't
>> generate that score before I observe the document/terms. If I do that I
>> would replicate some of the work that is be
rate that score before I observe the document/terms. If I do that I
> would replicate some of the work that is being already done by
> IndexWriter.
>
> If I remember it correctly, there is also some intention to add document
> payloads functionality. I have the same concerns on thi
ply put, I want to add some scores to documents/terms, but I can't
generate that score before I observe the document/terms. If I do that I
would replicate some of the work that is being already done by
IndexWriter.
If I remember it correctly, there is also some intention to add document
payload
}
public void reset(Reader input) throws IOException
{
tok.reset(input);
}
public void close() throws IOException
{
tok.close();
}
}
**
*Example for the use of payloads:*
PayloadAnalyzer panalyzer = new PayloadAnalyzer();
File ind
Here is what I am doing, not so magical... There are two classes, an
analyzer and an a TokenStream in which I can inject my document dependent
data to be stored as payload.
private PayloadAnalyzer panalyzer = new PayloadAnalyzer();
private class PayloadAnalyzer extends Analyzer {
1 - 100 of 165 matches
Mail list logo