Re: Is anybody using setNorm in Production?

2009-04-19 Thread kenny kim
I have used it to adjust norm values of some fields using a custom normalization function which is mixed up of L1 norm and L2 norm. I traverse all documents again after finishing indexing and set each document's norm value using setNorm. On 2009. 04. 19, at 오후 11:50, Mark Miller wrote: Just

Re: LocalLucene/Lucene Spatial

2009-04-19 Thread patrick o'leary
1) The only reason for ConstantScoreQuery is because it lets me convert the LocalLucene filter to a query. 2) TierRangeQuery- not at the moment, we're not really doing a range search, we're pulling predefined id's (e.g. a shape) from the index using the term enumerator. Average time take for a bo

Re: LocalLucene/Lucene Spatial

2009-04-19 Thread John Wang
Is there a reason the Query build is from a bitset via a ConstantScoreQuery instead a RangeQuery? Seems we would be paying a penalty for loading the bitset, esp the bitset would be rather sparse. Furthermore, is TrieRangeQuery planning to be somehow used in the spatial package? Thanks -John On

Re: Indexing Complex XML

2009-04-19 Thread crackeur
try vtd-xml http://vtd-xml.sf.net it works with any XML regardless of complexity - Original Message - From: "Digy" To: java-user@lucene.apache.org Sent: Saturday, April 18, 2009 12:25:21 PM GMT -08:00 US/Canada Pacific Subject: RE: Indexing Complex XML doc.add(new Field("authors",

Is anybody using setNorm in Production?

2009-04-19 Thread Mark Miller
Just a curiosity poll. This is a question on the java-dev list that came up. Anyone taking advantage of setNorm out there? Care to share how/why? -- - Mark http://www.lucidimagination.com - To unsubscribe, e-mail: java-user

Fwd: Proximity and Percentage match search in Lucene

2009-04-19 Thread Radha Sreedharan
What I need is the following : If my document field is ( ab,bc,cd,ef) and Search tokens are (ab,bc,cd). Given the following : I should get a hit even if all of the search tokens aren't present If the tokens are found they should be found within a distance x of each other ( proximity search)

Proximity and Percentage match search in Lucene

2009-04-19 Thread Radha Sreedharan
What I need is the following : If my document field is ( ab,bc,cd,ef) and Search tokens are (ab,bc,cd). Given the following : I should get a hit even if all of the search tokens aren't present If the tokens are found they should be found within a distance x of each other ( proximity search) >

changing term freq in indexing time

2009-04-19 Thread liat oren
Hi, I would like to be able to set the term freq to differnt values at index time, or at search time. So if a document has the following text: 1 2, the freq of 1 will get 100 and the freq of 2 will get 200. I want to avoid expanding it by writing 1 100 times. I looked at Similarity class and wan