Re: Num Found of docs containing vector fields

2023-10-22 Thread Alessandro Benedetti
Hi, Using the knn query parser is not going to cut it as it's driven by the topK parameter (and you do all distance similarity and stuff). Normally in Solr you just do: field:[* TO *] to retrieve docs that have at least one value in a field. I can't try right now, can you check if that's implement

Re: SOLR 8.11.2 Cloud issue

2023-10-22 Thread Ishan Chattopadhyaya
Unfortunately, the mailing list doesn't allow images to be attached. I think it is time to retire this mailing list, move to a modern discussion forum. Thankfully, some community members are taking a lead on that front. On Mon, 23 Oct, 2023, 11:47 am Scott Q., wrote: > Hmm...yes. > > Attaching i

Re: SOLR 8.11.2 Cloud issue

2023-10-22 Thread Scott Q.
Hmm...yes. Attaching image - this is when I access the ~cloud URL via direct link. But notice there's no Cloud menu on the left side... If I access the direct links On Sunday, 22/10/2023 at 21:38 Ishan Chattopadhyaya wrote: Are you sure you're running Solr in SolrCloud mode? On Sun, 22 Oct,

Re: Num Found of docs containing vector fields

2023-10-22 Thread ufuk yılmaz
Hi, I don’t know much about vectors but in order to avoid “414 Request-URI Too Large” error you can send a post request with type application/x-www-form-urlencoded and put q and value in the request body ~ufuk yilmaz — > On 23 Oct 2023, at 02:35, rajani m wrote: > > 414 Request-URI Too Large

Re: SOLR 8.11.2 Cloud issue

2023-10-22 Thread Ishan Chattopadhyaya
Are you sure you're running Solr in SolrCloud mode? On Sun, 22 Oct, 2023, 3:36 pm Scott Q., wrote: > I have SOLR 8.11.2 running with 5 nodes in Cloud > configuration and it works great. In fact it was an upgrade from an > earlier version that's been stable for a few years. > > > Since upgrading

Num Found of docs containing vector fields

2023-10-22 Thread rajani m
Hi Solr Users, How do you query Solr to get a count of the number of docs consisting of vector fields? Would you make use of knn qp and populate a query vector of the same length with 1's? For example, localhost:8983/solr/v9/select?q={!knn f=vector_field topK=10}[512 dim vector]. Also, this ht

Re: SOLR 9.3 - JVM Crashes

2023-10-22 Thread Shawn Heisey
On 10/21/2023 2:31 AM, Ing. Andrea Vettori wrote: Hello, we’re using two SOLR servers (same hw, same version of solr and java, same solr config). The SOLR version is 9.3 and JVM is Adoptium JDK 17.0.8.1 on Linux. They both were running fine since a couple years (we upgraded from SOLR 8 to 9 wi

Re: SOLR 9.3 - JVM Crashes

2023-10-22 Thread rajani m
Hi Andrea, Good to hear that. What type of *cache* config do you have in the old configuration? I ask because Legacy Solr Cache from 8.x does not work when solr is upgraded to 9.x. As per the docs

Re: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Mikhail Khludnev
You can find id terms repeating in an index via https://solr.apache.org/guide/solr/latest/query-guide/terms-component.html and terms.mincount=2 or do the same via facets q=*:*&facet=true&facet.field=id&facet.limit=-1&facet.mincount=2 (just on top of my head) Then you can query duplicated ids one by

Re: SOLR 9.3 - JVM Crashes

2023-10-22 Thread Ing. Andrea Vettori
Update: after restoring the old configuration (i.e. without CaffeineCache) the error has not happened again. And we also reindexed half the documents to put some load on the server. — Ing. Andrea Vettori Sistemi Informativi B2BIres s.r.l. > On 21 Oct 2023, at 10:31, Ing. Andrea Vettori wrote:

Re: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Dmitri Maziuk
On 10/22/23 12:25, Gus Heck wrote: Echoing what Thomas says, this problem indicates your indexing system probably has a significant design flaw. For most systems, you should have a notion of document identity that is external to Solr, and that should be used as (or to deterministically generate)

Re: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Gus Heck
Echoing what Thomas says, this problem indicates your indexing system probably has a significant design flaw. For most systems, you should have a notion of document identity that is external to Solr, and that should be used as (or to deterministically generate) the id in Solr. If you don't do this

Re: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Thomas Corthals
Hi Vince I would fix whatever indexing process caused the doubles and just rebuild the index from the source data. That's something you should always be able to do anyway. Thomas Op zo 22 okt 2023 om 14:38 schreef Vince McMahon < sippingonesandze...@gmail.com>: > all fields are the same will be

Re: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Vince McMahon
all fields are the same will be the duplicated documents. i wanted to remove those, first. On Sun, Oct 22, 2023 at 8:27 AM ufuk yılmaz wrote: > When do you consider two documents are duplicates? When 1 field has the > same value, when multiple fields have the same value, or all fields etc? > >

RE: what is SOLR syntax to remove duplicated documents

2023-10-22 Thread ufuk yılmaz
When do you consider two documents are duplicates? When 1 field has the same value, when multiple fields have the same value, or all fields etc? Sent from Mail for Windows From: Vince McMahon Sent: Sunday, October 22, 2023 3:22 PM To: users@solr.apache.org Subject: what is SOLR syntax to remove

what is SOLR syntax to remove duplicated documents

2023-10-22 Thread Vince McMahon
I have a SOLR 8.X. I suspect one of the core has duplicates and wants to remove the duplicated documents. Signature, as in the SOLR guide, is not implemented. https://solr.apache.org/guide/6_6/de-duplication.html in sql, a query without the use of a hash column will be liked: ;WITH CTE AS (

SOLR 8.11.2 Cloud issue

2023-10-22 Thread Scott Q.
I have SOLR 8.11.2 running with 5 nodes in Cloud configuration and it works great. In fact it was an upgrade from an earlier version that's been stable for a few years. Since upgrading to 8.11.2 the WEB UI no longer displays the Cloud menu on the left. I can still access it via direct URL because