Re: Simultaneous Indexing and searching

2020-09-09 Thread Christoph Kaser
Hi Richard, it seems like lucene index replication could help you here: you could create the index on the backend server and replicate it to the frontend servers. http://shaierera.blogspot.com/2013/05/the-replicator.html http://blog

Re: searchAfter is missing results when custom noncontinuous slices are used

2017-06-01 Thread Christoph Kaser
an issue. Maybe we just improve the javadocs as you suggested, but the situation sure is trappy today. Thanks, Mike McCandless http://blog.mikemccandless.com On Wed, May 24, 2017 at 10:06 AM, Christoph Kaser wrote: Hello everybody, I have observed an unexpected behavior in Lucene, and I am un

searchAfter is missing results when custom noncontinuous slices are used

2017-05-24 Thread Christoph Kaser
Hello everybody, I have observed an unexpected behavior in Lucene, and I am unsure whether this is a bug, or a missing warning in the documentation: I am using the IndexSearcher with an ExecutorService in order to take advantage of multiple CPU cores during the searches. I want to limit the

Re: Lucene commit

2016-08-21 Thread Christoph Kaser
Hello Paul, this is already possible using DirectoryReader.openIfChanged(indexReader,indexWriter). This will give you an indexreader that already "sees" all changes made by the writer (up to that point), even though the changes were not yet committed: https://lucene.apache.org/core/6_1_0/core

Re: Newbie Questions

2016-08-10 Thread Christoph Kaser
There is no way to "update" a document in lucene, you always have to remove the existing document and add the updated version with ALL its fields. The updateDocument-method of IndexWriter exists only for convenience (and to assure the operation is atomic), but internally it does just that: remo

Re: Replacement for Filter-as-abstract-class in Lucene 5.4?

2016-01-15 Thread Christoph Kaser
Nevermind, I missed the part about it being a base class for own queries. Sorry for the confusion! Am 15.01.2016 um 09:49 schrieb Christoph Kaser: Isn't that what ConstantScoreQuery does? The only difference is that it returns 1.0f as score instead of 0.0f. Regards Christoph Am 15.01

Re: Replacement for Filter-as-abstract-class in Lucene 5.4?

2016-01-15 Thread Christoph Kaser
Isn't that what ConstantScoreQuery does? The only difference is that it returns 1.0f as score instead of 0.0f. Regards Christoph Am 15.01.2016 um 09:27 schrieb Uwe Schindler: I had the same problem while migrating old code. Filter is very convenient to use, so why is it deprecated? I agree we

Re: IndexWriter.addIndexes with LeafReader parameter

2016-01-13 Thread Christoph Kaser
You could try using the org.apache.lucene.index.SlowCodecReader to wrap your index reader: SlowCodecReaderWrapper.wrap(indexReader) returns a CodecReader from an index reader. Regards Christoph Am 13.01.2016 um 09:09 schrieb Manner Róbert: Unfortunately I can not use that, because I do not wa

Re: How to merge several Taxonomy indexes

2015-04-02 Thread Christoph Kaser
together (Correct me if I am wrong). Can it be used to merge several taxonomyDirectories together and create one taxonomy index? On Mon, Mar 23, 2015 at 9:19 PM, Christoph Kaser wrote: Hi Gimantha, have a look at the class org.apache.lucene.facet.taxonomy.TaxonomyMergeUtils, which does exactly

Re: How to merge several Taxonomy indexes

2015-03-23 Thread Christoph Kaser
IndexSearcher. But I am stuck at merging the taxonomy indexes. Is there a way to merge taxonomy indexes? -- Dipl.-Inf. Christoph Kaser IconParc GmbH Sophienstrasse 1 80333 München www.iconparc.de Tel +49 -89- 15 90 06 - 21 Fax +49 -89- 15 90 06 - 49 Geschäftsleitung: Dipl.-Ing. Roland Brückner

Re: Can't get case insensitive keyword analyzer to work

2014-08-12 Thread Christoph Kaser
heQuery, theCollector); ScoreDoc[] theHits = theCollector.topDocs().scoreDocs; System.out.println("Number of results found: " + theHits.length); } -- Regards Milind -- Regards Milind --

Re: search performance

2014-06-02 Thread Christoph Kaser
Can you take thread stacktraces (repeatedly) during those 5 minute searches? That might give you (or someone on the mailing list) a clue where all that time is spent. You could try using jstack for that: http://docs.oracle.com/javase/7/docs/technotes/tools/share/jstack.html Regards Christoph

Re: IndexReplication Client and IndexWriter

2014-04-16 Thread Christoph Kaser
12:02 PM, Christoph Kaser wrote: Hello Shai and Mike, thank you for your answers! I created LUCENE-5597 for this feature. Unfortunately, I am not sure I will be able to provide patches: I don't need this feature at the moment (my interest was more academic) and unfortunately don't h

Re: IndexReplication Client and IndexWriter

2014-04-11 Thread Christoph Kaser
rk on this. Additionally, I created LUCENE-5599, which provides a patch to fix a small performance issue I had with the replicator when replicating large indexes. Regards, Christoph Kaser Am 08.04.2014 12:45, schrieb Michael McCandless: You might be able to use a class on the NRT replication branch (L

IndexReplication Client and IndexWriter

2014-04-08 Thread Christoph Kaser
the client side? Usually, this should not be needed, as only the master should be changed, however if I want to implement a "tree topology", I need an IndexWriter on a non-leaf client, because the IndexRevision that I need to publish needs one. Regards, Christoph -- Dipl.-Inf. Chris

Re: BooleanFilter vs BooleanQuery performance

2013-12-16 Thread Christoph Kaser
il: u...@thetaphi.de -Original Message- From: Christoph Kaser [mailto:lucene_l...@iconparc.de] Sent: Monday, December 16, 2013 2:30 PM To: java-user@lucene.apache.org Subject: BooleanFilter vs BooleanQuery performance Hi all, from my tests on an index with 22 million entries, it seems that in many ca

BooleanFilter vs BooleanQuery performance

2013-12-16 Thread Christoph Kaser
without scoring. Is there a better alternative to using a BooleanFilter? Regards Christoph -- Dipl.-Inf. Christoph Kaser IconParc GmbH Sophienstrasse 1 80333 München www.iconparc.de Tel +49 -89- 15 90 06 - 21 Fax +49 -89- 15 90 06 - 49 Geschäftsleitung: Dipl.-Ing. Roland Brückner, Dipl.-Inf

Using DocValues with CollationKeyAnalyzer

2012-11-06 Thread Christoph Kaser
s SortedBytesDocValuesField meant to be used like that? Best Regards, Christoph Kaser - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: ToParentBlockJoinQuery - Faceting on Parent and Child Documents

2012-08-03 Thread Christoph Kaser
searcher.search(query,null,facetCollector) This seems to work fine. Best regards, Christoph Kaser Am 03.08.2012 13:50, schrieb Martijn v Groningen: Hi Jayendra, This isn't supported yet. You could implement this by creating a custom Lucene collector. This collector could count the unique hits i

Re: Nested indexing doubt.

2012-06-08 Thread Christoph Kaser
Hi Ananth, You have to add the child documents before the parent document, otherwise the blockjoinquery won't work. Regards, Christoph Am 08.06.2012 10:18, schrieb Ananth V: Hey guys, I'm trying to index nested documents in lucene 3.6. I have the parent document having a 'typ

Re: ToParentBlockJoinQuery$BlockJoinWeight cannot explain match on parent document

2012-05-29 Thread Christoph Kaser
parent doc is scored the way it is. Martijn On 25 May 2012 13:20, Christoph Kaser wrote: Hello all, I try to calculate score explanations for a query that contains a ToParentBlockJoinQuery and get the following exception: java.lang.UnsupportedOperationException

ToParentBlockJoinQuery$BlockJoinWeight cannot explain match on parent document

2012-05-25 Thread Christoph Kaser
Hello all, I try to calculate score explanations for a query that contains a ToParentBlockJoinQuery and get the following exception: java.lang.UnsupportedOperationException: org.apache.lucene.search.join.ToParentBlockJoinQuery$BlockJoinWeight cannot explain match on parent document at org.a

Re: ToParentBlockJoinQuery and grand-children

2012-05-25 Thread Christoph Kaser
eated the issue for the missing access to the computed scores: https://issues.apache.org/jira/browse/LUCENE-4077 Regards, Christoph Am 24.05.2012 18:32, schrieb Michael McCandless: On Thu, May 24, 2012 at 11:48 AM, Christoph Kaser wrote: thank you for your response. Unfortunately, I won

Re: ToParentBlockJoinQuery and grand-children

2012-05-24 Thread Christoph Kaser
from the ToParentBlockJoinCollector? I can tell the collector to track the scores and the max score, but I did not find a way to retrieve either the parent scores nor the max score (of the parent documents). Christoph Kaser Am 23.05.2012 20:10, schrieb Michael McCandless: You do have to

ToParentBlockJoinQuery and grand-children

2012-05-23 Thread Christoph Kaser
Hello, I would like to use the ToParentBlockJoinQuery and its collector to query a document with children and grand children, but I can't figure out how to get the document ids that represent grand children. I know how to build the query and get the parent and child documents: /Example

Re: Memory question

2012-05-16 Thread Christoph Kaser
Another option to consider is to *decrease* the JVM maximum heap size. This in effect leaves more memory for swapped in mmio pages and decreases the GC effort, which might increase system performance and stability. Regards, Christoph Am 15.05.2012 21:38, schrieb Chris Bamford: Thanks Uwe. W

Parallel searching use ExecutorService and Collectors

2012-05-08 Thread Christoph Kaser
still use my own collectors? Best regards, Christoph Kaser - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Document-Ids and Merges

2012-04-05 Thread Christoph Kaser
tra lookup. Christoph Kaser Am 28.03.2012 19:40, schrieb Michael McCandless: On Wed, Mar 28, 2012 at 3:37 AM, Christoph Kaser wrote: Thank you for your answer! That's too bad. I thought of using my own ID-field, but I wanted to save the additional indirection (from docId to my ID t

Re: Document-Ids and Merges

2012-03-28 Thread Christoph Kaser
eld and use that, if you need a stable ID. Mike McCandless http://blog.mikemccandless.com On Tue, Mar 27, 2012 at 3:29 AM, Christoph Kaser wrote: Hi all, I have a search application with 16 million documents that uses custom scores per document using a ValueSource. These values are updated

Re: Document-Ids and Merges

2012-03-28 Thread Christoph Kaser
ild the array from the documents that are in that segment only. It's like FieldCache in a way. The array is relevant as long as the segment exists (i.e. not merged away). Hope this helps. Shai On Mar 27, 2012 9:29 AM, "Christoph Kaser" wrote: Hi all, I have a search applicati

Document-Ids and Merges

2012-03-27 Thread Christoph Kaser
Hi all, I have a search application with 16 million documents that uses custom scores per document using a ValueSource. These values are updated a lot (and sometimes all at once), so I can't really write them into the index for performance reasons. Instead, I simply have a huge array of float

Re: Indexing product keys with and without spaces in them

2012-01-03 Thread Christoph Kaser
ing would be better if you could do that: CRXUSB2.016GB ==>"CRX USB2.0 16GB", to be indexed as well as the base value. If you can't either have a dedicated product id search field or standardise the product ids, this is going to be hard. -- Ian, On Tue, Jan 3, 2012 at 8:44

Re: Indexing product keys with and without spaces in them

2012-01-03 Thread Christoph Kaser
analysis/injection at indexing would be better if you could do that: CRXUSB2.016GB ==> "CRX USB2.0 16GB", to be indexed as well as the base value. If you can't either have a dedicated product id search field or standardise the product ids, this is going to be hard. -- Ian,

Re: Indexing product keys with and without spaces in them

2012-01-03 Thread Christoph Kaser
e rest of the text. Any help would be great! Best regards, Christoph --**--**- To unsubscribe, e-mail: java-user-unsubscribe@lucene.**apache.org For additional commands, e-mail: java-user-help@lucene.apache.**org -- Dipl.-Inf. Christoph Kaser IconParc GmbH Sophienstrasse 1 80333 München

Indexing product keys with and without spaces in them

2012-01-03 Thread Christoph Kaser
Hello, we use lucene as search engine in an online shop. The products in this shop often contain product keys like CRXUSB2.0-16GB. We would like our customers to be able to find products by entering their key. The problem is that product keys sometimes contain spaces or dashes and customers so

Re: Numeric field min max values

2011-11-08 Thread Christoph Kaser
ext()); Hope that helps, Christoph Kaser Am 07.11.2011 21:07, schrieb Uwe Schindler: This is caused by lower-precision terms used by NumericField to allow fast NumericRangeQuery. You have to filter those values by looking at the first few bits, which contains the precision. - Uwe Schindler

Re: Merging several taxonomy indexes for faceted search

2011-10-24 Thread Christoph Kaser
any help! Best Regards, Christoph -- Dipl.-Inf. Christoph Kaser IconParc GmbH Sophienstrasse 1 80333 München www.iconparc.de Tel +49 -89- 15 90 06 - 21 Fax +49 -89- 15 90 06 - 49 Geschäftsleitung: Dipl.-Ing. Roland Brückner, Dipl.-Inf. Sven Angerer. HRB 121830, Amtsgericht München

Merging several taxonomy indexes for faceted search

2011-10-19 Thread Christoph Kaser
Hi all, I am planing to change my existing lucene index to use the new facets introduced in lucene 3.4.0. Unfortunately, I could not find an answer to my question in the documentation: I create a relatively large index of 8 million books by dividing it into several smaller groups of docume

Merging several taxonomy indexes for faceted search

2011-10-19 Thread Christoph Kaser
Hi all, I am planing to change my existing lucene index to use the new facets introduced in lucene 3.4.0. Unfortunately, I could not find an answer to my question in the documentation: I create a relatively large index of 8 million books by dividing it into several smaller groups of docume