Re: NGinx .conf Setup for Solr

2024-06-04 Thread Thomas Corthals
Hi Lee, Solr was installed on a different server that doesn't run anything else so we can tweak the resources independently of what the web server needs. But I don't see a reason why this workaround wouldn't work with a Solr install on the same server. The subdomain isn't tied to Solr in any way.

Re: NGinx .conf Setup for Solr

2024-06-04 Thread Dmitri Maziuk
On 6/3/24 21:53, Lee Daniel wrote: *## Drupal* The Drupal site is hosted on the same server as Solr. I installed it on the same server. This server host multiple sites, which we plan to use Solr for some of them. One IP multiple sites. You can't have 2 different programs listen to the same p

Ignore unknown fields when indexing PDFs

2024-06-04 Thread Uwe Amberger
Hallo! Problem description: I want to index a wide variety of PDFs whose content I have no knowledge of. So I cannot define any fields in advance. Users should be able to search for terms, and every PDF containing these terms should be found. I think that a schemaless schema (which adds unknown

Re: Ignore unknown fields when indexing PDFs

2024-06-04 Thread Walter Underwood
PDFs don’t have fields. PDFs are instructions for a monkey with rubber stamps to make a printed page. They have instructions to move to a location and put a character there. As an XML developer friend said, turning a PDF document into structured text is like turning hamburger back into a cow.

Solr 9 and splainer.io

2024-06-04 Thread Jérôme BERNARDES
Hello, we migrated from Solr 7.3.1 to Solr 9.2.1. We used to debug solr requests using splainer.io Splainer jsonp requests to solr are now blocked because of CORS and Content Security Policy restrictions. Is there a way to configure solr jetty to allow requests from splainer.io or at least a loc

Re: Ignore unknown fields when indexing PDFs

2024-06-04 Thread Thomas Corthals
When you extra text from PDF with Tika, it includes additional metadata fields. This is the document I get after executing the example from the ref guide at https://solr.apache.org/guide/solr/latest/indexing-guide/indexing-with-tika.html#trying-out-solr-cell { "responseHeader":{ "status":0,

Re: Ignore unknown fields when indexing PDFs - thanks to wunder

2024-06-04 Thread solr
Walter Underwood, thanks for this clear description of the PDF world - I love it! Walter Claassen Von:"Walter Underwood" An: users@solr.apache.org Datum: 04.06.2024 18:20 Betreff:Re: Ignore unknown fields when indexing PDFs PDFs don’t have fields. PDFs are instructions for

Re: Ignore unknown fields when indexing PDFs

2024-06-04 Thread Jeremy Buckley - IQS-C
Try this. In your schema, explicitly define all the fields that you want in your collection. Then, as the last field entry, add: On Tue, Jun 4, 2024 at 1:06 PM Thomas Corthals wrote: > When you extra text from PDF with Tika, it includes additional metadata > fields. This is the document I ge

what happens when copyField overflows?

2024-06-04 Thread Dmitri Maziuk
Hi all, playing with the suggester, I find that you can't use it on fields longer than ~32KB. A StrField would be the right size but is not tokenized, a TextField works but I don't see a way to limit its size. Either way, what happens if I `copyField` more than 32K into a field limited to 32

Re: Solr 9 and splainer.io

2024-06-04 Thread Eric Pugh
There is a solr package for splainer! https://github.com/o19s/splainer?tab=readme-ov-file#splainer-package-for-solr On Tue, Jun 4, 2024 at 12:42 PM Jérôme BERNARDES wrote: > Hello, > > we migrated from Solr 7.3.1 to Solr 9.2.1. > We used to debug solr requests using splainer.io > Splainer json

GCP-Solr indexing performance is slow in 9.6.0 compare to 7.2.1

2024-06-04 Thread Sathish Ponnusamy
Hi, We are migrating the solr from 7.2.1 to 9.6.0 in GCP and setup is done in GCP VM. [3 nodes for solr/3 nodes for ZK [6 VMs]]. While indexing the data in the solr 9.6.0 version, I noticed the indexing took 1 Hr 20 minutes for 47 GB data. The same data took only 45 mins in the 7.2.1 version. Jus

Solr 9.5.0 - Enabling Open telemetry module makes solr node down

2024-06-04 Thread Sathish Ponnusamy
Hi, We are using solr 9.5.0 and enabled the open telemetry module by passing below options. Our goal is to export the metrics to the OTEL collector which is installed in the same solr node 4317 port. Whenever we enable the open telemetry module with below parameters in the solr.in.sh file, The sol