ToChildBlockJoinQuery question

2015-01-21 Thread McKinley, James T
Hi, I'm attempting to use ToChildBlockJoinQuery in Lucene 4.8.1 by following Mike McCandless' blog post: http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html I have a set of child documents which are named works and a set of parent documents which are named persons tha

RE: ToChildBlockJoinQuery question

2015-01-21 Thread McKinley, James T
ric "Empty" DocIdSet. Which leads to the exception from ToChildBlockJoinQuery. The summary is, make sure that your source query only hits documents that were actually added using 'addDocuments()'. Since it looks like you're extracting your block relationships from the exis

RE: ToChildBlockJoinQuery question

2015-01-22 Thread McKinley, James T
ument. A child can't be its own parent. BlockJoin is a very powerful feature, but what it's really doing is modelling relationships using an index that doesn't know what a relationship is. The relationships are determined by a combination of the order that you indexed the block,

RE: ToChildBlockJoinQuery question

2015-01-22 Thread McKinley, James T
r) -Mike On 1/22/15 11:27 AM, McKinley, James T wrote: > Hi Greg, > > Thanks describing how block join queries were intended to work. Your > description makes sense to me, however according to the API docs: > > http://lucene.apache.org/core/4_8_0/join/org/apache/lucene/search/join/ToC

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread McKinley, James T
Why do you say not to use G1GC? We are using Java 7 & G1GC with Lucene 4.8.1 in production. Thanks. Jim From: Uwe Schindler [u...@thetaphi.de] Sent: Tuesday, January 27, 2015 2:49 PM To: java-user@lucene.apache.org; 'kiwi clive' Subject: RE: Lucene Versi

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread McKinley, James T
h_these_settings.html) In fact, the problems with G1GC can sometimes lead to index corruption, and are hard to reproduce. So better don't use... Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- &

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-06 Thread McKinley, James T
/lucene-java/JavaBugs? Regards Piotr On Tue, Jan 27, 2015 at 9:55 PM, McKinley, James T < james.mckin...@cengage.com> wrote: > Hi Uwe, > > OK, thanks for the info. We'll see if we can download the Lucene test > suite and check it out. > > FWIW, we use G1GC in our pro

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-09 Thread McKinley, James T
course, is critically important to Solr so from that perspective it is about Solr too. https://wiki.apache.org/lucene-java/JavaBugs And, I assume, it also applies to your custom app. FWIW, Erick On Fri, Feb 6, 2015 at 12:10 PM, McKinley, James T wrote: > Just to be clear in case there wa

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-11 Thread McKinley, James T
ng G1GC with the 64-bit JVM given that it has better performance on large heaps which are becoming more common today. FWIW, Jim From: McKinley, James T [james.mckin...@cengage.com] Sent: Monday, February 09, 2015 11:00 AM To: java-user@lucene.apache.org Subject: RE:

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-12 Thread McKinley, James T
are away users, because I don't want them having index corruption. I am sick of people asking "but isnt it fine on the latest version" and so on. It is not. On Wed, Feb 11, 2015 at 11:41 AM, McKinley, James T wrote: > Hi, > > A couple mailing list members have brought t

Re: Lucene Searcher Caching and Performance

2015-08-04 Thread McKinley, James T
Hi Clive, We essentially do what you're suggesting, namely we create a single index searcher (as well as the directory reader it uses) on each partition that is shared amongst all threads. We also perform various index operations (searching, browsing terms etc.) for a while to "warm up" Lucene

Re: How to index & search arrays of double?

2015-08-06 Thread McKinley, James T
Hi Stan, I played around with LIRE a couple years ago. I don't know exactly how it works, but it doesn't just use Lucene from what I remember, it has its own classes built around Lucene to perform the image search. There used to be a PDF of a paper on the site, but I couldn't find a link when

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-29 Thread McKinley, James T
Hi Adrien and Will, Thanks for your responses. I work with Selva and he's busy right now with other things, so I'll add some more context to his question in an attempt to improve clarity. The merge in question is part of our batch indexing workflow wherein we index new content for a given par

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-29 Thread McKinley, James T
compared to 4.x? This sounds robust. Is the index batch creation workflow a separate process? Distributed shared filesystems? --will -Original Message- From: McKinley, James T [mailto:james.mckin...@cengage.com] Sent: Tuesday, September 29, 2015 2:22 PM To: java-user@lucene.apache.org

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-30 Thread McKinley, James T
ks? Meaning Adrien's comment is a > whole lot significant? Merges ALWAYS pre-merge CheckIntegrity? Is this > a 5.0 feature drop? You can't deprecate, um, er totally remove an > index time audit feature on a point release of any level IMHO. > > > -Original Message- &g

Re: Scoring over Multiple Indexes

2015-10-22 Thread McKinley, James T
Hi Scott, I don't know your reasons for splitting your index up, but assuming you want to do that and then merge the search results back together I think you could re-unify the term document frequencies across all your indexes and then extend IndexSearcher and override termStatistics and collec

Re: Serializing Queries

2016-03-18 Thread McKinley, James T
We use Kryo to pass query objects between hosts: https://github.com/EsotericSoftware/kryo We initially had some trouble with it creating dynamic classes and running out of PermGen space but we got around that using an ObjectPool: http://commons.apache.org/proper/commons-pool/api-1.6/org/apache/

Re: Persistence/Serialization of Automaton

2016-03-24 Thread McKinley, James T
Here's an archive link from this mailing list regarding serializing queries, I guess this would work for Automaton objects as well. http://mail-archives.apache.org/mod_mbox/lucene-java-user/201603.mbox/browser Hope it helps. Jim From: José Tomás Atria S