Re: Reversing norms to the field length

2025-04-02 Thread David Smiley
I did this: https://github.com/apache/lucene/pull/14433/files but using the newer LongValuesSource API not the older ValueSource API. On Wed, Apr 2, 2025 at 3:12 PM David Smiley wrote: > Actually, I think all that's needed is a new ValueSource that gets the > norm and calls SmallFloat.byte4ToIn

Re: [Question] What is the `close` contract of sliced IndexInput?

2025-04-02 Thread Bruno Roustant
Recently I wondered the same question because BufferedIndexInput.SliceIndexInput.close() calls base.close() where base is a clone of the original input (so in the end it's a noop?) ByteBuffersIndexInput.close() simply sets in = null but does not call in.close() (shouldn't it do so?)

Re: [Question] What is the `close` contract of sliced IndexInput?

2025-04-02 Thread Bruno Roustant
Ah, ByteBuffersDataInput isn't closeable, as explained by Michael above. Le mer. 2 avr. 2025 à 15:34, Bruno Roustant a écrit : > Recently I wondered the same question because > > BufferedIndexInput.SliceIndexInput.close() calls base.close() where base > is a clone of the original input (so in th

Re: Reversing norms to the field length

2025-04-02 Thread David Smiley
Actually, I think all that's needed is a new ValueSource that gets the norm and calls SmallFloat.byte4ToInt, which inverts Similarity.computeNorm (which calls SmallFloat.intToByte4). All the Similarity impls keep that same implementation of computeNorm. Admittedly I'm unsure about NormValueSource

Re: Planned OS Upgrade for lucene-solr-1 and lucene-solr-2

2025-04-02 Thread Uwe Schindler
Hi, go ahead. I preserved some config files. Actually the Gradle builds are easier to maintain as Ant, so basically it is just 2 files. Uwe Am 02.04.2025 um 17:21 schrieb Uwe Schindler: Oh sorry, will do! I ping you. Uwe Am 02.04.2025 um 17:02 schrieb Christos Malliaridis: @Uwe Did you hav

Re: Planned OS Upgrade for lucene-solr-1 and lucene-solr-2

2025-04-02 Thread Uwe Schindler
Can you make sure that the same people get sudo/root acces slike before? It would also be ideal to maybe ask them to preserve the jenkins directory. I cleaned them up a bit already. If we get a new machine, could you ask them for more power and memory? The current machines are very slow, looks

Re: Planned OS Upgrade for lucene-solr-1 and lucene-solr-2

2025-04-02 Thread Christos Malliaridis
@Uwe Did you have the time to run a backup of the home directory? I would proceed and open an infra ticket if we are ready, I just need to know what "all packages we need for Jenkins Slaves" includes, as I am not sure how to look this up to include it in the ticket. Christos Hi, > I tried the d

Re: Welcome Michael Froh as Lucene committer

2025-04-02 Thread David Smiley
Welcome Michael! (belated) On Thu, Mar 6, 2025 at 3:07 AM Dawid Weiss wrote: > > Hello everyone, > > I'm pleased to announce that Michael Froh has accepted the PMC's > invitation to become a Lucene committer. > > Michael, the tradition is for new committers to shortly introduce > themselves. >

Grouping cfs cfe and si

2025-04-02 Thread Bruno Roustant
When compound files are enabled, Lucene90CompoundFormat groups all the files of a segment into two files cfs and cfe. Then the segment info is written to the si file. And optionally a liv file is also generated. With a goal of reducing further the number of files of a segment, to be more cost effe

Re: Proposing a 10.2.0 release

2025-04-02 Thread Ignacio Vera
Hey Tommaso, I have no objections if you can get the PR merged today. Cheers, Ignacio On Tue, Apr 1, 2025 at 8:14 PM Tommaso Teofili wrote: > > I'd like to get https://github.com/apache/lucene/pull/14094 in by tomorrow, > unless anyone objects. > > > On Mon, 31 Mar 2025 at 17:15, Robert Muir

Re: Proposing a 10.2.0 release

2025-04-02 Thread Mayya Sharipova
Hello everyone, We have discovered that a change introduced in 10.2 doesn't work nicely when two fields are indexed with the same analyzer. There is a workaround, but for now I will revert this change for th

[ Question ] Text Field with IndexOptions.DOCS don't use skip data due to SlowImpactsEnum with Lucene 9.12.0 compared to 9.11.1

2025-04-02 Thread ANIKETH JAIN
Hey folks, While investigating a regression in OpenSearch versions 2.17.1 ( Lucene 9.11.1 ) and 2.18.0 ( Lucene 9.12.0 ) for simple Term Query in Big5 workload over process.name field, I noticed that the new Lucene912PostingsReader creates the ImpactsEnum by wrapping SlowImpactsEnum over postings

Re: [ Question ] Text Field with IndexOptions.DOCS don't use skip data due to SlowImpactsEnum with Lucene 9.12.0 compared to 9.11.1

2025-04-02 Thread Michael Froh
Hey, Full disclosure -- I sit at a desk next to Aniketh, so we chatted about this one in the real world. Our current working theory is as follows: We're using the TOP_SCORES score mode on both the old and new code paths. On the old code path, we were returning a BlockImpactsDocsEnum, even though

Re: MultiRangeQuery for DocValues

2025-04-02 Thread Mikhail Khludnev
Hello, Here is extension this idea to numeric dvs https://github.com/apache/lucene/pull/14404 Really appreciate your review. Thanks in advance. On Fri, Nov 29, 2024 at 8:37 PM Greg Miller wrote: > Ah right OK. Apologies. I misread your original email in haste. What > you’re proposing makes sense

Re: [ Question ] Text Field with IndexOptions.DOCS don't use skip data due to SlowImpactsEnum with Lucene 9.12.0 compared to 9.11.1

2025-04-02 Thread Adrien Grand
> It was fast because (once the collector has filled its priority queue), we'd check the (constant) impacts to find the first block that's strictly better than the min competitive score. Since all scores are equal, that would quickly skip to the end This sounds correct to me. To expand on your ana