Re: How to find RAM/disk usage of each vector field

2024-11-05 Thread Patrick Zhai
I wouldn't call this a good way, but as the last resort you can parse the metadata files yourself, as it is not so hard to parse (yet), the logics are in: Lucene99HnswVectorsFormat.java Lucene99FlatVectorsFormat.java The risk for sure is that whenever the format is changed the parsing logic will ne

Re: How to find RAM/disk usage of each vector field

2024-11-05 Thread Adrien Grand
I cannot think of good ways to do this. Why is it important to break down per field as opposed to scaling based on the total volume of vector data? On Tue, Nov 5, 2024 at 10:58 PM Tanmay Goel wrote: > Hi Rui > > Thanks for your response and the snippet that you shared is great but not > exactly

Re: How to find RAM/disk usage of each vector field

2024-11-05 Thread Tanmay Goel
Hi Rui Thanks for your response and the snippet that you shared is great but not exactly what I was looking for. With this snippet we are able to find the total size of the .vec files, but I want to see inside the .vec files and try to compute a map of vector_field_name to the number of bytes on d

RE: Indexing multiple numeric ranges

2024-11-05 Thread Siraj Haider
That’s great! I will look into it. Thanks a lot! -Siraj -Original Message- From: Adrien Grand Sent: Tuesday, November 5, 2024 11:19 AM To: java-user@lucene.apache.org Subject: Re: Indexing multiple numeric ranges Hello Siraj, You can do this by creating a Lucene document that has 3 o

Re: Indexing multiple numeric ranges

2024-11-05 Thread Adrien Grand
Hello Siraj, You can do this by creating a Lucene document that has 3 org.apache.lucene.document.IntRange fields in it, one for each of the ranges that you would like to index. Lucene will then match the document if any of the ranges matches. On Tue, Nov 5, 2024 at 5:16 PM Siraj Haider wrote: >

Indexing multiple numeric ranges

2024-11-05 Thread Siraj Haider
Hello there, I want to index multiple numeric ranges in lucene index and then perform range searches on it. For examples, I want to index 3 numeric ranges 1-5, 7-10, 13-20 and then run a search with a range (i.e. 2-4) as criteria and have it return the document if searched range is part of any o