Re: Fastest way to index data to solr

2022-09-30 Thread Thomas Corthals
Hi Gus, I have a followup question. Is JSON parsed faster than XML by Solr if they represent the exact same documents? Thomas Op vr 30 sep. 2022 om 06:58 schreef Gus Heck : > If you are using a non-java language you can use JSON. >

Re: Fastest way to index data to solr

2022-09-30 Thread Dave
I don’t have any tests but I know anything is faster than xml. You may as well stick to text files. Xml is garbage that’s why they made yaml which is the parent of json > On Sep 30, 2022, at 3:47 AM, Thomas Corthals wrote: > > Hi Gus, > > I have a followup question. Is JSON parsed faster tha

Re: Fastest way to index data to solr

2022-09-30 Thread Andy Lester
I can’t imagine a case where the speed in parsing the input data won’t be dwarfed by the time spent on everything else. You’re talking about an in-memory operation that does a ton of I/O. It’s not going to make a noticeable difference once way or the other. > I have a followup question. Is JS

Is this possible? Transform JSON to conform to existing UpdateRequestHandler

2022-09-30 Thread Matthew Castrigno
Hello, I am new to SOLR and I am trying to configure it to accept existing API definition. I need to create a requestHandler that will perform index operations based on the value in a specific field as opposed to using a parameter in the request. Ie. A payload that may look like this: {

Re: Is this possible? Transform JSON to conform to existing UpdateRequestHandler

2022-09-30 Thread Matthew Castrigno
Maybe this is what I need? https://solr.apache.org/guide/8_11/json-request-api.html JSON Request API | Apache Solr Reference Guide 8.11 JSON Parameter Merging. If multiple json parameters are provided in a single request, Solr attempts to

Re: Fastest way to index data to solr

2022-09-30 Thread Joel Bernstein
Unless something has changed recently, you will have a memory leak if you don't atleast soft commit during the load. This is due to the in-memory tlog data used for real-time get. This in-memory tlog data is released when a new searcher is opened. So, if you're having memory issues while bulk load

Is this possible?

2022-09-30 Thread Matthew Castrigno
Hello, I am new to SOLR and I am trying to configure it to accept existing API definition. I need to create a requestHandler that will perform index operations based on the value in a specific field as opposed to using a parameter in the request. Ie. A payload that may look like this: {"id"