RE: Rewrite SynonymQuery to support payloads

2018-05-24 Thread Markus Jelsma
Hello Alessandro, I was looking for something more light weight, or extend SpanOrQuery to support this case. The rewrite is part of our custom parser extending ExtendedDismaxQParser where we add support for payloads by rewriting everything to a payload equivalent. This is almost the case with

Re: Rewrite SynonymQuery to support payloads

2018-05-23 Thread Alessandro Benedetti
On Wed, May 23, 2018 at 11:41 AM, Markus Jelsma wrote: > Hello, > > To support payloads we rewrite SynonymQuery to a pair of SpanTerm queries > which we then can wrap in the PayloadScoreQuery. This is not the right way > to do this because if both clauses match, both are also sc

Rewrite SynonymQuery to support payloads

2018-05-23 Thread Markus Jelsma
Hello, To support payloads we rewrite SynonymQuery to a pair of SpanTerm queries which we then can wrap in the PayloadScoreQuery. This is not the right way to do this because if both clauses match, both are also scored.  We could try to rewrite SynonymQuery to a SpanOrQuery but i suppose that

Re: Using POS payloads for chunking

2017-06-15 Thread José Tomás Atria
d it has the > > added benefits that it would not require me to meddle into the scoring > > process, which I'm still a bit terrified of. Thanks for the tip. > > > > I guess the question is still valid though? i.e. how would one take into > > account payloads for

Re: Using POS payloads for chunking

2017-06-15 Thread Erick Erickson
ss, which I'm still a bit terrified of. Thanks for the tip. > > I guess the question is still valid though? i.e. how would one take into > account payloads for scoring entire spans? Does this make sense at all? Any > links to a more-or-less straightforward example? > > O

Re: Using POS payloads for chunking

2017-06-15 Thread José Tomás Atria
? i.e. how would one take into account payloads for scoring entire spans? Does this make sense at all? Any links to a more-or-less straightforward example? On the length of payloads: I understood that you have other restrictions, but payloads take a bytesref as value, so you can encode arbitrary da

RE: Using POS payloads for chunking

2017-06-14 Thread Markus Jelsma
gt; Subject: Re: Using POS payloads for chunking > > I think it'd be interesting to also investigate using TypeAttribute [1] > together with TypeTokenFilter [2]. > > Regards, > Tommaso > > [1] : > https://lucene.apache.org/core/6_5_0/core/org/apache/lucene/analy

Re: Using POS payloads for chunking

2017-06-14 Thread Tommaso Teofili
7 23:29 > > To: java-user > > Subject: Re: Using POS payloads for chunking > > > > Markus: > > > > I don't believe that payloads are limited in size at all. LUCENE-7705 > > was done in part because there _was_ a hard-coded 256 limit for some > > of th

RE: Using POS payloads for chunking

2017-06-14 Thread Markus Jelsma
-Original message- > From:Erick Erickson > Sent: Wednesday 14th June 2017 23:29 > To: java-user > Subject: Re: Using POS payloads for chunking > > Markus: > > I don't believe that payloads are limited in size at all. LUCENE-7705 > was done in part because there

Re: Using POS payloads for chunking

2017-06-14 Thread Erick Erickson
Markus: I don't believe that payloads are limited in size at all. LUCENE-7705 was done in part because there _was_ a hard-coded 256 limit for some of the tokenizers. The Payload (at least recent versions) just have some bytes after them, and (with LUCENE-7705) can be arbitrarily long. Of c

RE: Using POS payloads for chunking

2017-06-14 Thread Markus Jelsma
t up with. Finally a BM25 extension that has, amongst others, a mapping of bitset to score. Nouns get a bonus, prepositions and other useless pieces get a punishment etc. Payloads are really great things to use! We also use it to distinguish between compounds and their subwords, o.a. we supply

Re: Using POS payloads for chunking

2017-06-14 Thread Erik Hatcher
Markus - how are you encoding payloads as bitsets and use them for scoring? Curious to see how folks are leveraging them. Erik > On Jun 14, 2017, at 4:45 PM, Markus Jelsma wrote: > > Hello, > > We use POS-tagging too, and encode them as payload bitsets for scoring,

RE: Using POS payloads for chunking

2017-06-14 Thread Markus Jelsma
Hello, We use POS-tagging too, and encode them as payload bitsets for scoring, which is, as far as is know, the only possibility with payloads. So, instead of encoding them as payloads, why not index your treebanks POS-tags as tokens on the same position, like synonyms. If you do that, you can

Using POS payloads for chunking

2017-06-14 Thread José Tomás Atria
in sequences of payloads. (trying not to be too pedantic, a regular chunker looks for 'chunks' based on part-of-speech tags, e.g. noun phrases can be searched for with patterns like "(DT)?(JJ)*(NN|NP)+", that is, an optional determinant and zero or more adjectives preceding

Span near query with payloads

2015-04-14 Thread Shay Hummel
re/4_10_2/core/org/apache/lucene/search/spans/SpanNearPayloadCheckQuery.html> since it properly handles the fact that payloads aren't ordered by SpanNearQuery <http://lucene.apache.org/core/4_10_2/core/org/apache/lucene/search/spans/SpanNearQuery.html>." ? I used SpanNearQuery a

Getting payloads for query terms of constantscore query

2014-05-19 Thread Puneet Pawaia
Hi I can get the payloads for query terms using getPayloadsForQuery from PayloadSpanUtil. However this does not support ConstantScore queries. So how do I get the payloads for queries that get rewritten to ConstantScore query for example PrefixQuery, WildcardQuery. Thanks Puneet

Re: Question about Payloads in Lucene 4.5

2014-03-27 Thread Rohit Banga
Awesome works well for me! Thanks Rohit Banga http://iamrohitbanga.com/ On Sun, Mar 23, 2014 at 10:06 PM, Manuel Le Normand < manuel.lenorm...@gmail.com> wrote: > Hello Rohit, > We had a similar query time bottleneck when attempting to map lucene's > internal id's to the uniqueKey, especially a

Re: Question about Payloads in Lucene 4.5

2014-03-23 Thread Manuel Le Normand
Hello Rohit, We had a similar query time bottleneck when attempting to map lucene's internal id's to the uniqueKey, especially as we generally return only the uniqueKey to the user we had no other use of the stored field. As you noted, every internal id --> uniqueKey id requires a disk seek and as

Re: Question about Payloads in Lucene 4.5

2014-03-22 Thread Michael McCandless
On Sat, Mar 22, 2014 at 5:18 AM, Rohit Banga wrote: > Awesome BinaryDocValues sounds nice! > I saw that NumericDocValues did not inherit from a base class hence I > thought there is no StringDocValues :). > > Can I expect that a searcher manager will invoke > searcherfactory.newSearcher at most o

Re: Question about Payloads in Lucene 4.5

2014-03-22 Thread Rohit Banga
Awesome BinaryDocValues sounds nice! I saw that NumericDocValues did not inherit from a base class hence I thought there is no StringDocValues :). Can I expect that a searcher manager will invoke searcherfactory.newSearcher at most once between searcher manager refreshes? I believe IndexSearcher i

Re: Question about Payloads in Lucene 4.5

2014-03-22 Thread Michael McCandless
On Fri, Mar 21, 2014 at 10:25 PM, Rohit Banga wrote: > Thanks Michael for your response. You're welcome! > Few questions: > > 1. Can I expect better performance when retrieving a single NumericDocValue > for all hits vs when I retrieve documents for all hits to fetch the field > value? As far as

Re: Question about Payloads in Lucene 4.5

2014-03-21 Thread Rohit Banga
/lucenes-searchermanager-simplifies.html), > but how about managing two data sources Lucene index and HashMap String> with SearcherManager? Is there a way to achieve this using a custom > SearcherFactory? > > > Thanks > Rohit Banga > http://iamrohitbanga.com/ > >

Re: Question about Payloads in Lucene 4.5

2014-03-21 Thread Rohit Banga
t; wrote: > DocValues are better than payloads. > > E.g. index a NumericDocValuesField with each doc, holding your id. > > Then at search time you can use MultiDocValues.getNumericValues. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Fri,

Re: Question about Payloads in Lucene 4.5

2014-03-21 Thread Michael McCandless
DocValues are better than payloads. E.g. index a NumericDocValuesField with each doc, holding your id. Then at search time you can use MultiDocValues.getNumericValues. Mike McCandless http://blog.mikemccandless.com On Fri, Mar 21, 2014 at 4:35 PM, Rohit Banga wrote: > Hi everyone > &g

Question about Payloads in Lucene 4.5

2014-03-21 Thread Rohit Banga
me metadata along with the internal document id when I did a search. I do not want to read all documents just to retrieve this metadata. The best solution I have come across searching on the net is to use payloads which will be returned by the fast index search query along with the document ids. Is

Re: Indexing useful N-grams (phrases & entities) and adding payloads

2014-03-12 Thread Manuel Le Normand
SynonymFilter makes sense. The planned payloads are indeed not needed. I guess a better solution would be making out of the boost an attribute during query time that will be consumed in the queryParser in order to boost these n-gram terms. Thanks for the hints. Manuel On Wed, Mar 12, 2014 at

Re: Indexing useful N-grams (phrases & entities) and adding payloads

2014-03-12 Thread Michael McCandless
s. Part of it will be achieved by > the CommonGram filter in which I put the frequent words. > > I think of going a step further and index phrase queries (extracted from my > query log) entities (from gazetteers). In order to control the boost on > these N-gram matches I plan adding payl

Indexing useful N-grams (phrases & entities) and adding payloads

2014-03-12 Thread Manuel Le Normand
a step further and index phrase queries (extracted from my query log) entities (from gazetteers). In order to control the boost on these N-gram matches I plan adding payloads to these terms. I'm thinking of two different implementations: 1. Using MappingCharFilter - the mapping.txt wou

Suggesters: payloads and filter predicates

2014-01-08 Thread Oliver Christ
Hi, It's great to see support for payloads in the suggesters - this is really helpful, and pretty much addresses LUCENE-4516. Are there any plans to also support them for WFSTs? We have some cases where we don't need the Analyzer's capabilities (we look up the completion us

Re: Using Payloads as a Coefficient For Score At a Custom QParser That extends ExtendedDismaxQParser

2013-12-01 Thread Furkan KAMACI
Second link is that: http://digitalpebble.blogspot.com/2010/08/using-payloads-with-dismaxqparser-in.html 2013/12/1 Furkan KAMACI > Hi; > > I use Solr 4.5.1 I have a case: When a user searches for some specific > keywords some documents should be listed at much more higher th

Using Payloads as a Coefficient For Score At a Custom QParser That extends ExtendedDismaxQParser

2013-12-01 Thread Furkan KAMACI
://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! However that example extends Qparser directly but I want to use capabilities of *edismax*. So I found that example: file:///home/furkan/Desktop/DigitalPebble's%20Blog%20%20Using%20Payloads%20with%20DisMaxQParser%20in%20SOLR.html Thi

Re: Custom Relevancy Using Field Payloads

2013-11-29 Thread Furkan KAMACI
it extend edismax or I should do that: if a search occurs on a usual field and a spesific field(payload field) score will be calculated from sum of usual field score and payload fied score via my custom alternative similarity. I've done per field similarity function but I got that error: "Fie

Re: Custom Relevancy Using Field Payloads

2013-11-29 Thread Erik Hatcher
n I do that? Custom payload similarity class or custom function > query? > > I've followed here: > http://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! but > decodeNormValue if a final method anymore. How about that: > http://www.solrtutorial.com/custom-solr

Using Payloads For Boosting at a Custom Function Query?

2013-11-27 Thread Furkan KAMACI
Hi; I use Solr 4.5.1 Due to my question is more related to Lucene I wanted to ask this question here. Is there any example which uses payloads for boosting at a custom function query? Thanks; Furkan KAMACI

Custom Relevancy Using Field Payloads

2013-11-27 Thread Furkan KAMACI
score (payload * normalize coefficient) How can I do that? Custom payload similarity class or custom function query? I've followed here: http://sujitpal.blogspot.com/2013/07/porting-payloads-to-solr4.html#! but decodeNormValue if a final method anymore. How about that: http://www.solrtuto

RE: Payloads disabled in 4.5?

2013-10-15 Thread Kyle Judson
That was it. Reversing the order fixed it. Thanks for wading through the lack of whitespace. ThanksKyle > From: luc...@mikemccandless.com > Date: Tue, 15 Oct 2013 09:24:17 -0400 > Subject: Re: Payloads disabled in 4.5? > To: java-user@lucene.apache.org > > Something catastroph

Re: Payloads disabled in 4.5?

2013-10-15 Thread Michael McCandless
those two? Mike McCandless http://blog.mikemccandless.com On Tue, Oct 15, 2013 at 8:13 AM, Kyle Judson wrote: > > > > > > > > > Hi All, > > > I'm trying to create an index with payloads using Lucene 4.5 but the payloads > don't seem to be getting

Payloads disabled in 4.5?

2013-10-15 Thread Kyle Judson
Hi All, I'm trying to create an index with payloads using Lucene 4.5 but the payloads don't seem to be getting into the index. I looked into the code and it looks like storePayloads argument to addOrUpdateInternal when called from addOrUpdate in FieldInfos is set to false

Re: Reading Payloads

2013-04-23 Thread Carsten Schnober
Am 23.04.2013 16:17, schrieb Alan Woodward: > It doesn't sound as though an inverted index is really what you want to be > querying here, if I'm reading you right. You want to get the payloads for > spans at a specific position, but you don't particularly care about th

Re: Reading Payloads

2013-04-23 Thread Alan Woodward
Hi Carsten, It doesn't sound as though an inverted index is really what you want to be querying here, if I'm reading you right. You want to get the payloads for spans at a specific position, but you don't particularly care about the actual term at that position? You

Re: Reading Payloads

2013-04-23 Thread Carsten Schnober
d on a SpanQuery which depends on a specific expression to search for. In my use case, I need to retrieve Spans specified by their offsets only, and then get their payloads and process them further. Alternatively, I could query for the occurence of certain string patterns in the payloads and c

Re: Reading Payloads

2013-04-23 Thread Alan Woodward
g to figure out a way to use a query as Uwe suggested. My >> scenario is to perform a query and then retrieve some of the payloads >> upon user request, so there no obvious way to wrap this into a query as >> I can't know what (terms) to query for. > > I wonder: is

Re: Reading Payloads

2013-04-23 Thread Carsten Schnober
Am 23.04.2013 13:47, schrieb Carsten Schnober: > I'm trying to figure out a way to use a query as Uwe suggested. My > scenario is to perform a query and then retrieve some of the payloads > upon user request, so there no obvious way to wrap this into a query as > I can't know

Re: Reading Payloads

2013-04-23 Thread Carsten Schnober
Am 23.04.2013 13:21, schrieb Michael McCandless: > Actually, term vectors can store payloads now (LUCENE-1888), so if that > field was indexed with FieldType.setStoreTermVectorPayloads they should be > there. > > But I suspect the TokenSources.getTokenStream API (which I think u

Re: Reading Payloads

2013-04-23 Thread Michael McCandless
Actually, term vectors can store payloads now (LUCENE-1888), so if that field was indexed with FieldType.setStoreTermVectorPayloads they should be there. But I suspect the TokenSources.getTokenStream API (which I think un-inverts the term vectors to recreate the token stream = very slow?) wasn&#

RE: Reading Payloads

2013-04-23 Thread Uwe Schindler
TermVectors are per-document and do not contain payloads. You are reading the per-document TermVectors which is a "small index" *stored* for each document as a binary blob. This blob only contains the terms of this document with its positions/offsets, but no payloads (offsets are use

Reading Payloads

2013-04-23 Thread Carsten Schnober
Hi, I'm trying to extract payloads from an index for specific tokens the following way (inserting sample document number and term): Terms terms = reader.getTermVector(16504, "term"); TokenStream tokenstream = TokenSources.getTokenStream(terms); while (tokenstream

Lucene 3.5 Payloads

2012-01-31 Thread Stephen Howe
ver, when I print the document off and look at the index in Luke, the field I'm trying to append the payload to has no terms or payloads associated with it even though I specified both term and payload attributes in the token stream code. I'm quite confused how these things are supposed

Lucene 4.0 Payloads

2011-03-17 Thread Alex vB
Hello everybody, I am currently experimenting with Lucene 4.0 and would like to add payloads. Payload should only be added once per term on the first position. My current code looks like this: public final boolean incrementToken() throws java.io.IOException { String term

How are stored Fields/Payloads loaded

2011-02-28 Thread Alex vB
Hello everybody, I am currently unsure how stored data is written and loaded from index. I want to store for every term of a document some binary data but only once and not for every position! Therefore I am not sure if Payloads or stored Fields are the better solution (Or the not implemented

Re: Storing payloads without term-position and frequency

2011-02-03 Thread Alex
first level (Lucene) a document for one wikipedia article containing all distinct terms of its versions. On the second level (payloads) I store the frequency information corresponding to each article version and its terms. If I search now I can find an article by its term and through the term and

Re: Storing payloads without term-position and frequency

2011-02-03 Thread Grant Ingersoll
Payloads only make sense in terms of specific positions in the index, so I don't think there is a way to hack Lucene for it. You could, I suppose, just store the payload for the first instance of the term. Also, what's the use case you are trying to solve here? Why store term freq

Re: Payloads API and support

2011-02-02 Thread Ophir Cohen
pert usages > of Lucene. From what I can tell, you have sent the same question 3 times in > a matter of less than a day. Sending more than once in a 2-3 day period is > just going to make it less likely that you will get help, not more likely. > > Some suggestions inline below.

Re: Storing payloads without term-position and frequency

2011-02-02 Thread Yuhan Zhang
rently using Lucene 3.0.2 with payloads. I store extra information > in the payloads about the term like frequencies and therefore I don't need > frequencies and term positions stored normally by Lucene. I would like to > set f.setOmitTermFreqAndPositions(true) but then I am not able to

Storing payloads without term-position and frequency

2011-02-02 Thread Alex vB
Hello everybody, I am currently using Lucene 3.0.2 with payloads. I store extra information in the payloads about the term like frequencies and therefore I don't need frequencies and term positions stored normally by Lucene. I would like to set f.setOmitTermFreqAndPositions(true) but then

Re: Payloads API and support

2011-02-02 Thread Grant Ingersoll
ions inline below. > > > Lately I encountered the new payloads support and it looks its a great > solution for my project. > > > *The problem:* > > The use case is as follows: > > I need to support a way to calculate statistics on web pages. > > Each pa

Payloads API and support

2011-02-01 Thread Ophir Cohen
Hi Guys, I've been using Lucene for more than 5 years and it is a great tool - great job! Thanks for everything... Lately I encountered the new payloads support and it looks its a great solution for my project. *The problem:* The use case is as follows: I need to support a w

Fwd: Payloads API and support

2011-02-01 Thread Ophir Cohen
Hi Guys, I've been using Lucene for more than 5 years and it is a great tool - great job! Thanks for everything... Lately I encountered the new payloads support and it looks its a great solution for my project. *The problem:* The use case is as follows: I need to support a w

Re: Retrieve term payloads / custom PayloadFilter

2010-07-08 Thread Erick Erickson
rm translations from the > subdocuments. > > The IMO better alternative is not to split the document and to assign the > language tags as payloads to the terms. But then I need > > (i) a search filter that eliminates docs based on a given language tag and > > (ii) a way

Retrieve term payloads / custom PayloadFilter

2010-07-08 Thread Bernhard Haslhofer
document and to assign the language tags as payloads to the terms. But then I need (i) a search filter that eliminates docs based on a given language tag and (ii) a way to access the term payloads from the documents returned by the searcher For both I haven't found a solution yet. Can I wr

Re: How to calculate payloads in queries too

2010-04-12 Thread Mike Schultz
I see the payload in the token now. -- View this message in context: http://n3.nabble.com/How-to-calculate-payloads-in-queries-too-tp712743p713413.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To

How to calculate payloads in queries too

2010-04-11 Thread Mike Schultz
I am interested in using payloads in the following way. I store Func(index-term) as a payload at index-term when indexing. When querying I want to compute Func(query-term) as well. Then my similarity returns some other function, Gunc(Func(index-term1),Func(query-term)). As an example, maybe

RE: Payloads

2009-12-21 Thread Elias Khsheibun
er@lucene.apache.org Subject: RE: Payloads > Let's say I have a document that > contains the following text: > > "Graph Algorithms is one of the most important topics in computer > science" > > And a query "!Graph Algorithms" then the term Graph in the q

RE: Payloads

2009-12-20 Thread Uwe Schindler
.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Elias Khsheibun [mailto:eli...@gmail.com] > Sent: Sunday, December 20, 2009 2:51 PM > To: java-user@lucene.apache.org > Subject: RE: Payloads > > > I'm t

RE: Payloads

2009-12-20 Thread Elias Khsheibun
hits2 = searcher.search(query2, 10).scoreDocs; for (int i = 0; i < hits2.length; i++) { Document hitDoc = searcher.doc(hits2[i].doc); System.out.println(hitDoc.get("title")); } } } -Original Message- From: AHMET ARSLAN [mail

RE: Payloads

2009-12-19 Thread Elias Khsheibun
What do you mean by a custom one - please explain. I must use a PayloadTermQuery ? And for the TermPositionPayloadTokenFilter there is a method that is not used - incrementToken (only used in the main method) ... I didn't see in the code the place that examines if the query term is at an even offs

RE: Payloads

2009-12-19 Thread AHMET ARSLAN
> If I need to override the QueryParser > to return PayloadTermQuery, what > function for PayloadFunction should I use in the > constructor (If you can > show me an example). I am not sure about that. Maybe custom one. > In your code I didn't see an indexer, will this work with > the regular > I

RE: Payloads

2009-12-19 Thread Elias Khsheibun
oaded ? -Original Message- From: AHMET ARSLAN [mailto:iori...@yahoo.com] Sent: Saturday, December 19, 2009 8:34 PM To: java-user@lucene.apache.org Subject: RE: Payloads > Let's say I have a document that > contains the following text: > > "Graph Algorithms is one of t

RE: Payloads

2009-12-19 Thread AHMET ARSLAN
to "Graph|2.0 Algorithms" you can use DelimitedPayloadTokenFilter to set payload of marked term. Additionally you need to everride QueryParser to return PayloadTermQuery and scorePayload method of DefaultSimilarity. By doing so payloads will be included in score calculation. public

RE: Payloads

2009-12-19 Thread Elias Khsheibun
About 60 students I think, if you have given some answers I would be grateful if you could link me to them or quote them again. -Original Message- From: Uwe Schindler [mailto:u...@thetaphi.de] Sent: Saturday, December 19, 2009 7:00 PM To: java-user@lucene.apache.org Subject: RE: Payloads

RE: Payloads

2009-12-19 Thread Uwe Schindler
> Sent: Saturday, December 19, 2009 5:54 PM > To: java-user@lucene.apache.org > Subject: RE: Payloads > > Let's say I have a document that contains the following text: > > "Graph Algorithms is one of the most important topics in computer science" > > And

RE: Payloads

2009-12-19 Thread Elias Khsheibun
is even) - we apply this doubling of weight only if a '!' operator precedes the term and if its offset from the document is even. -Original Message- From: AHMET ARSLAN [mailto:iori...@yahoo.com] Sent: Saturday, December 19, 2009 6:48 PM To: java-user@lucene.apache.org Subject:

RE: Payloads

2009-12-19 Thread AHMET ARSLAN
> I want to override the operator - it > is for a project purpose. Can you explain your requirements more? What do you mean by "an even offset of the document"? - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apa

RE: Payloads

2009-12-19 Thread Elias Khsheibun
I want to override the operator - it is for a project purpose. -Original Message- From: AHMET ARSLAN [mailto:iori...@yahoo.com] Sent: Saturday, December 19, 2009 6:41 PM To: java-user@lucene.apache.org Subject: Re: Payloads > Hi, > > I need to add a query operator '!'

Re: Payloads

2009-12-19 Thread AHMET ARSLAN
the offset of its first word. > > I guess it involves payloads... > > Elias. '!' is already a query operator. It is equivalent of NOT. So you cannot use it. Why not use carat operator? Like singleterm^2 "some phrase"^2 [Boosti

Payloads

2009-12-19 Thread Elias Khsheibun
t involves payloads... Elias. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: NearSpansUnordered payloads not returning all the time

2009-12-09 Thread Michael McCandless
ell on a prior span. >> >> Mike >> >> On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen >> wrote: >>> Right we're getting the spans, however it's just the payloads that are >>> missing, randomly... >>> >>> On Wed, Dec 9,

Re: NearSpansUnordered payloads not returning all the time

2009-12-09 Thread Jason Rutherglen
f it already fell on a prior span. > > Mike > > On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen > wrote: >> Right we're getting the spans, however it's just the payloads that are >> missing, randomly... >> >> On Wed, Dec 9, 2009 at 2:23 AM, Michae

Re: NearSpansUnordered payloads not returning all the time

2009-12-09 Thread Michael McCandless
it back, if it already fell on a prior span. Mike On Wed, Dec 9, 2009 at 11:25 AM, Jason Rutherglen wrote: > Right we're getting the spans, however it's just the payloads that are > missing, randomly... > > On Wed, Dec 9, 2009 at 2:23 AM, Michael McCandless > wrote: >

Re: NearSpansUnordered payloads not returning all the time

2009-12-09 Thread Jason Rutherglen
Right we're getting the spans, however it's just the payloads that are missing, randomly... On Wed, Dec 9, 2009 at 2:23 AM, Michael McCandless wrote: > There was a thread a while back about how span queries don't enumerate > every possible span, but I can't remember

Re: NearSpansUnordered payloads not returning all the time

2009-12-09 Thread Michael McCandless
There was a thread a while back about how span queries don't enumerate every possible span, but I can't remember if that included sometimes missing payloads... Mike On Tue, Dec 8, 2009 at 7:34 PM, Jason Rutherglen wrote: > Howdy, > > I am wondering

NearSpansUnordered payloads not returning all the time

2009-12-08 Thread Jason Rutherglen
Howdy, I am wondering if anyone has seen NearSpansUnordered.getPayload() not return payloads that are verifiably accessible via IR.termPositions? It's a bit confusing because most of the time they're returned properly. I suspect the payload logic gets tripped up in NearSpansUnordered

Re: NearSpansUnordered payloads

2009-11-25 Thread Jason Rutherglen
I don't mind adding the "positions" of the payloads in them. However, maybe we can be little more clear in the javadocs what's going on underneath? On Wed, Nov 25, 2009 at 5:36 AM, Mark Miller wrote: > Grant Ingersoll wrote: >> On Nov 20, 2009, at 6:49 PM, Jason Ru

Re: NearSpansUnordered payloads

2009-11-25 Thread Mark Miller
an NearSpansOrdered in this >> regard. >> >> NearSpansUnordered returns payloads in a hash set that's >> computed each method call by iterating over the SpanCell as a >> linked list, whereas NearSpansOrdered stores the payloads in a >> list (which is ordered) only when co

Re: NearSpansUnordered payloads

2009-11-25 Thread Grant Ingersoll
On Nov 20, 2009, at 6:49 PM, Jason Rutherglen wrote: > I'm interested in getting the payload information from the > matching span, however it's unclear from the javadocs why > NearSpansUnordered is different than NearSpansOrdered in this > regard. > > NearSpansUno

Re: NearSpansUnordered payloads

2009-11-24 Thread Christopher Tignor
regard. > > NearSpansUnordered returns payloads in a hash set that's > computed each method call by iterating over the SpanCell as a > linked list, whereas NearSpansOrdered stores the payloads in a > list (which is ordered) only when collectPayloads is true. > > At first

NearSpansUnordered payloads

2009-11-20 Thread Jason Rutherglen
I'm interested in getting the payload information from the matching span, however it's unclear from the javadocs why NearSpansUnordered is different than NearSpansOrdered in this regard. NearSpansUnordered returns payloads in a hash set that's computed each method call by ite

Re: Multiterms query and payloads

2009-10-27 Thread Mauro Dragoni
ght help you.  Otherwise, > the PayloadTermQuery is still a Query and can be used in a BooleanQuery. >  Beyond that, you may need to write what you need, since it doesn't exist > yet. > > > On Oct 26, 2009, at 10:25 AM, Mauro Dragoni wrote: > >> Hi to everyone, >>

Re: Multiterms query and payloads

2009-10-26 Thread Grant Ingersoll
Hi to everyone, I started to use payloads in my indexes. However, I didn't find a class that permits to perform queries using more than one term. The only way that works is to use the BoostingTermQuery class, but by using this class, I can search only one term a time. Does exist a way to do a

Multiterms query and payloads

2009-10-26 Thread Mauro Dragoni
Hi to everyone, I started to use payloads in my indexes. However, I didn't find a class that permits to perform queries using more than one term. The only way that works is to use the BoostingTermQuery class, but by using this class, I can search only one term a time. Does exist a way to

Re: Retrieving payloads for terms matched by a query

2009-05-22 Thread Grant Ingersoll
On May 22, 2009, at 12:28 AM, Dmitri Bichko wrote: Hi, I may be missing something obvious, but how do I get the payloads for the specific token positions that were matched by a query? See SpanQuery.getPayloadSpans() and it's SpanQuery derivatives. For example, if I have a phrase

Retrieving payloads for terms matched by a query

2009-05-21 Thread Dmitri Bichko
Hi, I may be missing something obvious, but how do I get the payloads for the specific token positions that were matched by a query? For example, if I have a phrase query like "A keyword B" that matches the field "A keyword B A", I can get the payloads for A and B with Index

Re: Using Payloads

2009-04-27 Thread liat oren
gt;> 2009/4/26 Murat Yakici >>>> >>>> >>>> >>>>> Yes, this is more or less what I had in mind. However, for this >>>>> approach >>>>> one requires some *prior knowledge* of the vocabulary of the document >>>>&g

Re: Using Payloads

2009-04-27 Thread Murat Yakici
exWriter. If I remember it correctly, there is also some intention to add document payloads functionality. I have the same concerns on this. So I think we need a clear view on the topic. Where is the payload work moving? How we can generate a score without duplicating some of the work that IndexWr

Re: Using Payloads

2009-04-27 Thread liat oren
y have an indexing time penalty. If your window size is > >> the document itself, you will be doing the same job twice (calculating > >> the > >> num of times a term occurs in doc X, index time weights etc.). > >> IndexWriter > >> already does these somewhere

Re: Using Payloads

2009-04-26 Thread Murat Yakici
gt; IndexWriter >> already does these somewhere down deep. >> >> >> Simply put, I want to add some scores to documents/terms, but I can't >> generate that score before I observe the document/terms. If I do that I >> would replicate some of the work that is be

Re: Using Payloads

2009-04-26 Thread liat oren
rate that score before I observe the document/terms. If I do that I > would replicate some of the work that is being already done by > IndexWriter. > > If I remember it correctly, there is also some intention to add document > payloads functionality. I have the same concerns on thi

Re: Using Payloads

2009-04-26 Thread Murat Yakici
ply put, I want to add some scores to documents/terms, but I can't generate that score before I observe the document/terms. If I do that I would replicate some of the work that is being already done by IndexWriter. If I remember it correctly, there is also some intention to add document payload

Re: Using Payloads

2009-04-26 Thread liat oren
} public void reset(Reader input) throws IOException { tok.reset(input); } public void close() throws IOException { tok.close(); } } ** *Example for the use of payloads:* PayloadAnalyzer panalyzer = new PayloadAnalyzer(); File ind

Re: Using Payloads

2009-04-25 Thread Murat Yakici
Here is what I am doing, not so magical... There are two classes, an analyzer and an a TokenStream in which I can inject my document dependent data to be stored as payload. private PayloadAnalyzer panalyzer = new PayloadAnalyzer(); private class PayloadAnalyzer extends Analyzer {

  1   2   >