[ANNOUNCE] Apache Solr 8.11.0 released

2021-11-17 Thread Adrien Grand
The Solr PMC is pleased to announce the release of Apache Solr 8.11 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Solr project. Its major features include powerful full-text search, hit highlighting, faceted search and analytics, rich document parsing, geospa

Alternate to CDCR

2021-11-17 Thread Vaddi, Seshasai
Hi Solr Team, I’m looking for an alternatives to SOLR CDCR, As you mentioned it’s getting deprecated. Could you help me other best alternatives to achieve for disaster recovery Sent from Mail for Windows

Re: OpenNLP dictionary-based lemmatizer memory issue

2021-11-17 Thread Spyros Kapnissis
Thank you Alessandro for your comments and getting back so quickly, that sounds great! On Tue, Nov 16, 2021 at 7:35 PM Alessandro Benedetti wrote: > Hi, > I've done an initial review and it looks ok to me! > Before committing I added a couple of other committers to the loop, let's > see if they

NRT Searching (getting it working)

2021-11-17 Thread Derek C
Hi all, I'm trying to get Near Real Time searching working with SOLR (so that documents I insert, or documents I update, are visible in a SOLR query as quickly as possible). I've tried configuring autoCommit and autoSoftCommit in solrconfig.xml but it's taking about 10 minutes to see the updates

Re: NRT Searching (getting it working)

2021-11-17 Thread Shawn Heisey
On 11/17/21 7:05 AM, Derek C wrote: Hi all, I'm trying to get Near Real Time searching working with SOLR (so that documents I insert, or documents I update, are visible in a SOLR query as quickly as possible). I have about 2.2 million documents in a SOLR core (quite a lot of fields too - mayb

Re: NRT Searching (getting it working)

2021-11-17 Thread Derek C
Hi Shawn, Thanks for the help. Now that you mention it. Just after I sent the email I did start looking at top and I was seeing 100%, 200%, 300% CPU usage (and the VM only has 4 cores so I was looking at maxed out cores). I've grabbed screenshots now and I'm not seeing those high numbers but

[Operator] [ANNOUNCE] Apache Solr Operator v0.5.0 released

2021-11-17 Thread Houston Putman
The Apache Solr PMC is pleased to announce the release of the Apache Solr Operator v0.5.0. The Apache Solr Operator is a safe and easy way of managing a Solr ecosystem in Kubernetes. This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. Th

Solr limit in words search

2021-11-17 Thread Scott Q.
I am facing a weird issue, possibly caused by my config. I have indexed a document which has a field called subject, subject is defined as:                                                                                                                                                

Solr upgrade 3.6.1 TO 8.10.1 : ERROR Data at the root level is invalid. Line 1, position 1.

2021-11-17 Thread Heller, George A III CTR (USA)
We have an existing ASP>NET C# application that currently uses Solr 3.6.1 for indexing and searching of documents. Using Solr 3.6.1,everything works fine. I built a Solr 8.10.1 server and when I try to upload documents to a Solr collection in 8.10.1, I get a "Data is invalid at root level" error

Solr limit in words search - take 2

2021-11-17 Thread Scott
My apologies for the previous e-mail…should have never sent that as html I am facing a weird issue, possibly caused by my config. I have indexed a document which has a field called subject, subject is defined as:                                                                            

Re: Solr upgrade 3.6.1 TO 8.10.1 : ERROR Data at the root level is invalid. Line 1, position 1.

2021-11-17 Thread Scott Q.
I'm no expert but I see you're expecting XML and as far as I know, the default response in Solr 8 is JSON. Maybe check that ? On Wednesday, 17-11-2021 at 11:16 Heller, George A III CTR (USA) wrote: We have an existing ASP>NET C# application that currently uses Solr 3.6.1 for indexing and s

Re: Solr limit in words search - take 2

2021-11-17 Thread Michael Gibney
This is not the most thorough answer, but hopefully gets you headed in the right direction: Very strange things can happen when your index-time analysis chain generates "graph" token-streams (as yours does). A couple of things you could try: 1. experiment with setting `enableGraphQueries=false` on

RE: Solr limit in words search - take 2

2021-11-17 Thread Scott
Thanks Michael, let me look at those links. I forgot to mention initially but I'm running version 8.6.2 Cloud/ZooKeeper -Original Message- From: Michael Gibney Sent: Wednesday, November 17, 2021 12:07 PM To: users@solr.apache.org Subject: Re: Solr limit in words search - take 2 This i

Re: NRT Searching (getting it working)

2021-11-17 Thread Andy Lester
> > I'm not sure how to understand the solr_gc.log file (but I'd like to) There’s a product called gceasy at gceasy.io . You can get a basic report on your GC log from uploading your log to them for analysis. Andy

RE: Solr limit in words search - take 2

2021-11-17 Thread Scott
Could this be related ? https://solr.apache.org/guide/6_6/filter-descriptions.html#FilterDescriptions-WordDelimiterGraphFilter "If you use this filter during indexing, you must follow it with a Flatten Graph Filter to squash tokens on top of one another like the Word Delimiter Filter, because t

Re: Solr limit in words search - take 2

2021-11-17 Thread Michael Gibney
Right, sorry I forgot to mention the absence of FlattenGraphFilter. Tbh I'm not 100% clear on what cases it helps out with; but at the end of the day it has no effect on underlying issues having to do with the fact that if your index-time analysis chain produces "graph" tokenstreams, the Lucene `[D

Re: NRT Searching (getting it working)

2021-11-17 Thread Derek C
That's an amazing online tool - thanks Andy (I think looking at the generated charts/graphs that the Garbage collection and memory usage is OK) Derek On Wed, Nov 17, 2021 at 5:53 PM Andy Lester wrote: > > > > I'm not sure how to understand the solr_gc.log file (but I'd like to) > > There’s a p

RE: Solr limit in words search - take 2

2021-11-17 Thread Scott
Ok, I'll add in the indexer and see what happens. It's so weird that it works, even in this state, when the docs say : This filter _must_ be included I would have expected the indexer to throw errors if this filter is really required... Thanks! -Original Message- From: Michael Gibne

Re: NRT Searching (getting it working)

2021-11-17 Thread Andy Lester
> On Nov 17, 2021, at 12:41 PM, Derek C wrote: > > That's an amazing online tool - thanks Andy It was Shawn Heisey who pointed me to it. There many other JVM GC tools out there if you search a bit. https://sematext.com/blog/java-gc-log-analysis-tools/

omitting term frequencies but keeping positions?

2021-11-17 Thread Edward Turner
Hi there, Is there a way to omit only term frequencies but keep positions? I see it's possible to omit frequencies and positions, or just positions for a field in a schema.xml ( https://solr.apache.org/guide/8_5/field-type-definitions-and-properties.html#field-default-properties), but we would lik

Re: Solr limit in words search

2021-11-17 Thread Shawn Heisey
On 11/17/21 9:00 AM, Scott Q. wrote: I am facing a weird issue, possibly caused by my config. I have indexed a document which has a field called subject, subject is defined as: -- the definition you included is blank in the email that I got.  I do not know why.  If it was an email attachment

Re: NRT Searching (getting it working)

2021-11-17 Thread Shawn Heisey
On 11/17/21 8:24 AM, Derek C wrote: Now that you mention it. Just after I sent the email I did start looking at top and I was seeing 100%, 200%, 300% CPU usage (and the VM only has 4 cores so I was looking at maxed out cores). I've grabbed screenshots now and I'm not seeing those high number

Execute bulk partial update on solr

2021-11-17 Thread Karan Jain
Hi *Everyone*, We want to update a field of more than one document without ID being passed. We may need to update 1.5 M documents in one go. It seems to us that executing atomic update for a field in Solr cluster

Re: Solr upgrade 3.6.1 TO 8.10.1 : ERROR Data at the root level is invalid. Line 1, position 1.

2021-11-17 Thread Shawn Heisey
On 11/17/21 9:16 AM, Heller, George A III CTR (USA) wrote: We have an existing ASP>NET C# application that currently uses Solr 3.6.1 for indexing and searching of documents. Using Solr 3.6.1,everything works fine. I built a Solr 8.10.1 server and when I try to upload documents to a Solr col

Re: Execute bulk partial update on solr

2021-11-17 Thread Shawn Heisey
On 11/17/21 5:45 PM, Karan Jain wrote: 1) Can we try atomic update for updating a field of more than one document without ID being passed? If not, please suggest another way to do this. I have never heard of a

Re: [ANNOUNCE] Apache Solr 8.11.0 released

2021-11-17 Thread David Smiley
I'll start the Docker image release process now; it should be out by the weekend hopefully. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Nov 17, 2021 at 3:54 AM Adrien Grand wrote: > The Solr PMC is pleased to announce the release of Apache

RE: Solr limit in words search

2021-11-17 Thread Scott
Thanks Shawn, not sure if you saw, but I resent without html formatting and it came through fine. I'll put it here again along with the preliminary conclusion that I was missing the Flatten filter in my indexer. Here are the schema details + output you requested:

Re: Solr limit in words search

2021-11-17 Thread Shawn Heisey
On 11/17/2021 9:36 PM, Scott wrote: Thanks Shawn, not sure if you saw, but I resent without html formatting and it came through fine. I'll put it here again along with the preliminary conclusion that I was missing the Flatten filter in my indexer. Here are the schema details + output you reque

Re: Incremental backup for Standalone Solr

2021-11-17 Thread Abeleshev Artyom
Thanks, Jason, for the detailed answer. Now I got the point and It makes sense. Best regards, Artem Abeleshev On Mon, Nov 15, 2021 at 10:05 PM Jason Gerlowski wrote: > Hey Artem, > > Incremental backups were written primarily with SolrCloud in mind. > Many of the APIs (backup listing, backup d