Re: How do I write in 3.x format to an upgradeded index using Lucene 4.10

2017-02-09 Thread kiwi clive
Hi Tx, This is just to close the loop. Thank you very much for your helpful suggestions. This works fine and solves our problem. Much appreciated.Clive From: kiwi clive To: Trejkaz ; Lucene Users Mailing List Sent: Wednesday, February 1, 2017 7:37 PM Subject: Re: How do I write in

Re: How do I write in 3.x format to an upgradeded index using Lucene 4.10

2017-01-31 Thread kiwi clive
From: Trejkaz To: Lucene Users Mailing List ; kiwi clive Sent: Wednesday, February 1, 2017 2:53 PM Subject: Re: How do I write in 3.x format to an upgradeded index using Lucene 4.10 > If we take our old 3.x index and apply IndexUpgrader to it, we end up with a > 4.10 index. >

How do I write in 3.x format to an upgradeded index using Lucene 4.10

2017-01-31 Thread kiwi clive
Hi Guys We have several hundred thousand indexes that have been written in Lucene 3.x format. I am trying to migrate to Lucene 4.10 without the need to reindex and  the process should be transparent to our customers. Reindexing all our legacy data is not an option. The predominant analyzer we c

Lucene Searcher Caching and Performance

2015-08-04 Thread kiwi clive
Hi Guys, We have an index/query server that contains several thousand fairly hefty indexes. Each searcher is shared between many 'user-threads' and once opened we keep the searcher in a cache which is refreshed depending on how often it is used. Due to memory limitations on the server, we need s

Re: Search Performance with NRT

2015-05-27 Thread kiwi clive
Hi Mike, Thanks for the very prompt and clear response. We look forward to using the new (new for us) Lucenene goodies :-) Clive From: Michael McCandless To: Lucene Users ; kiwi clive Sent: Thursday, May 28, 2015 2:34 AM Subject: Re: Search Performance with NRT As long as you

Search Performance with NRT

2015-05-27 Thread kiwi clive
Hi Guys We are considering changing our Lucene indexer / search architecture from 2 separate JVMs to a single one to benefit from the very latest index views NRT readers provide. In the past we cached our IndexSearchers to avoid cold searches every time and reopened them periodically.  In the

Re: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread kiwi clive
;java-user@lucene.apache.org" ; kiwi clive Sent: Tuesday, January 27, 2015 4:01 PM Subject: Re: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8) : I seem to remember reading that certain versions of lucene were : incompatible with some java versions although I cannot find anyth

Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread kiwi clive
Hello guys, We currently run with Lucene 3.6 and Java6. In view of the fact that Java7 is soon to be deprecated, we are keen to move to Java8 and also to move to the latest version of Lucene. I understand Lucene 5 is coming although we are happy to move to 4.x as there are lots of goodies there

Re: search-time facetting in Lucene

2013-05-06 Thread kiwi clive
runs (we have 10,000+ shards). If it were not for the consolidation required, I thin bobo would have been the way forward. I appreciate you taking the time to explain the situation. Clive From: Shai Erera To: "java-user@lucene.apache.org" ; kiw

search-time facetting in Lucene

2013-05-05 Thread kiwi clive
Hello all Lucene version 3.6.1. Sorry if this is a really stupid question, but is it possible to use search-time facetting on an existing lucene index without the need to reindex? My (limited) understanding is that FacetsCollector will pull facet data from indexes that have been created with

Re: Lucene 4.1 org.apache.lucene.document.Field Deprecation

2013-03-19 Thread kiwi clive
Thank you Mike, Much appreciated :-) From: Michael McCandless To: java-user@lucene.apache.org; kiwi clive Sent: Monday, March 18, 2013 4:14 PM Subject: Re: Lucene 4.1 org.apache.lucene.document.Field Deprecation You need to create your own FieldType, e.g

Lucene 4.1 org.apache.lucene.document.Field Deprecation

2013-03-18 Thread kiwi clive
Hi chaps, Lucene 4.1.0: I notice org.apache.lucene.document.Field(String name, String value, Field.Store store, Field.Index index, Field.TermVector termVector) is marked as deprecated while its suggested replacements (TextField and StringField) to not seem to have support for Term Vectors. I

BlockJoin and RawTermFilter (lucene 4.0.0)

2013-01-16 Thread kiwi clive
Hi Guys, Apologies if this has been asked before but I could not an appropriate post. The Lucene Documentation stresses the use of a RawTermFiler when building up the parentFilter and I was previously using the following in lucene 3.6.0:     Filter parentQueryFilter = new CachingWrappe

Re: Using Lucene 2.3 indices with Lucene 4.0

2012-11-28 Thread kiwi clive
Be aware that StandardAnalyzer changed slightly. This is particularly important if you use it to analyze email addresses and certain text-numeral combinations. My understanding is that the newer version of StandardAnalyzer is more consistent with what it should be doing but if you relied on its

Re: Changing behavior of StandardAnalyzer

2012-11-14 Thread kiwi clive
Hi Bin Lan, This bit me too. You can choose to StandardAnalyzer and set the version number to 2.9. Otherwise you can try using ClassicAnalyzer which I belive is 'old' Standard Analyzer before it was tidied up. Clive From: Bin Lan To: java-user@lucene.apac

Re: Combining The results from DB and Index Regd.,

2012-11-13 Thread kiwi clive
the top page-size records.  This way the ramindex only has to be small and the database does the heavy lifting. - Although at the cost of some sql trickery :-) From: selvakumar netaji To: java-user@lucene.apache.org; kiwi clive Sent: Tuesday, November 13

Re: Combining The results from DB and Index Regd.,

2012-11-13 Thread kiwi clive
I have used the last solution you mention many times to good effect as you can sort across the two data sources and merge the results. Obviously it depends on your architecture, RAM and and the amount of data you are dealing with. Clive From: selvakumar netaj

Re: Lucene 3.6.0 high CPU usage

2012-11-09 Thread kiwi clive
org; kiwi clive Sent: Friday, November 9, 2012 10:04 AM Subject: Re: Lucene 3.6.0 high CPU usage Are you getting the same, improved or worse performance/throughput? Has the bottleneck switched from IO to CPU? -- Ian. On Thu, Nov 8, 2012 at 12:40 PM, kiwi clive wrote: > Having played with

Re: Lucene 3.6.0 high CPU usage

2012-11-08 Thread kiwi clive
of documents through the indexer and it seams to work admirably although I would be happier if the load average came down. Any lucene devs out there who could shed some light on this behaviour ? Thanks, Clive From: kiwi clive To: "java-user@lucene.apach

Re: A large number of files in an index (3.6)

2012-10-29 Thread kiwi clive
difference. Clive From: Lance Norskog To: java-user@lucene.apache.org; kiwi clive Sent: Sunday, October 28, 2012 11:09 PM Subject: Re: A large number of files in an index (3.6) An option: instead of merging continuously as you run, you can optimize with

Re: Lucene 3.6.0 Index Size

2012-10-26 Thread kiwi clive
mind is that the default merge policy has changed in 3.6 from 2.3.2 (I'm almost certain of that). So it's just a hunch but you may have some unmerged segments left over at the end. Try calling IndexWriter.close(true) after you're done indexing. On Fri, Oct 26, 2012 at 10:50 AM, ki

Lucene 3.6.0 Index Size

2012-10-26 Thread kiwi clive
Hello. We have an index that when creted using lucene2.3.2, has a size of about 4G. Creating the same index (with the same parameters) with lucene 3.6.0 results in an 11G index. Could someone shed some light into why the index is so much larger, given the same data and the same parameters? I

Re: StandardAnalyzer functionality change

2012-10-25 Thread kiwi clive
I did some tests and found for our need, ClassicAnalyzer was better (backwards compatible). Our analyzer uses different tokenizers on certain fields but (used to) fall back to StandardAnalyzer by default. ClassicAnalyzer will meet our needs but I see we should move onto a newer implementation su

Re: StandardAnalyzer functionality change

2012-10-24 Thread kiwi clive
rg/apache/lucene/analysis/standard/StandardTokenizer.html > http://lucene.apache.org/core/4_0_0-BETA/analyzers-common/org/apache/lucene/analysis/standard/ClassicTokenizer.html > > -- Jack Krupansky > > -Original Message- From: kiwi clive > Sent: Wednesday, October 24, 2012 6:42 AM > To:

StandardAnalyzer functionality change

2012-10-24 Thread kiwi clive
Hi all, Sorry if I'm asking an age old question but we have migrated to lucene 3.6.0 and I see StandardAnalyzer has changed its behaviour, particularly when tokenizing email addresses. From reading the forums, I understand StandardAnalyzer was renamed to ClassicAnalyzer - is this the case ? I

Re: createJoinQuery use

2012-04-04 Thread kiwi clive
Searcher); Query toQuery = JoinUtil.createJoinQuery("to", "from", actualQuery, indexSearcher); And then use a boolean query to combine it? What is it you want to achieve? Martijn On 29 March 2012 14:04, kiwi clive wrote: > Hi Chaps, > > JoinUtil.createJoinQuery() spec

Re: JoinUtil.createJoinQuery in 3.6 ?

2012-03-29 Thread kiwi clive
impl whilst the trunk (4.0) doesn't have this limitation. Martijn On 29 March 2012 14:39, Michael McCandless wrote: > It'll be in both 3.6 and 4.0. > > Mike McCandless > > http://blog.mikemccandless.com > > On Thu, Mar 29, 2012 at 7:55 AM, kiwi clive wrote: > >

createJoinQuery use

2012-03-29 Thread kiwi clive
Hi Chaps, JoinUtil.createJoinQuery() specifies a Query for the from side of the join. Is it possible to query over both sides of the join (while still providing the two join fields) ? If not, what is the recommended best practice to do this? Thanks, and apologies for the dumb questions C

JoinUtil.createJoinQuery in 3.6 ?

2012-03-29 Thread kiwi clive
Hi Guys, Will this be available in Lucene 3.6 or is it only going into version 4.0 ? Clive

BlockJoinQuery Clarification

2012-03-22 Thread kiwi clive
Hello I've been looking at the BlockJoinQuery in Lucene 3.4.0 and would like to clarify my understanding. Suppose we have a parent document that we index with (say) 4 child documents. My understanding is that these go in as an atomic unit and allows us to query and join across the documents.

Re: Closing IndexWriter can be very slow on large indexes

2011-07-31 Thread kiwi clive
Hi Mike, The problem was due to close().  A shutdown was calling close() which seems to cause lucene to perform a merge. For a busy very large index (with lots of deletes and updates), the merge process could take a very long time to complete (hours). Calling close(false) solved the problem as