DIH and SolrCloud

2022-03-31 Thread James Greene
Are there any working examples of using the DataImportHandler's delta import with SolrCloid? I have a single zookeeper instance with four solr instances (using docker-compose). Once all services are online I use the zk upload tool to upload my conf configuration directory to zookeeper. From there

Re: DIH and SolrCloud

2022-03-31 Thread Jan Høydahl
DIH is deprecated and removed from Solr 9.0. DIH did not support SolrCloud very well, as it runs on one local core, not distributed. Please seek support from its new 3rd party plugin home in GitHub: https://github.com/rohitbemax/dataimporthandler

DIH and import from other core

2022-03-31 Thread Tealdi Paolo
Hi all, I'm searching for alternative to DIH functionality for record ingestion from one core to another. It's very useful, simple and quick function to check for new configurations. The new external plugin seems to support only database connection. Any hints ? Best regards, Paolo Tealdi Ing.

Re: DIH and import from other core

2022-03-31 Thread Eric Pugh
I’ve used thehttps://solr.apache.org/guide/8_1/collections-api.html#reindexcollection https://solr.apache.org/guide/8_1/collections-api.html#reindexcollection> command for testing new configurations… It uses Solr Streaming under the covers: https://solr.apache.org/guide/8_1/streaming-expressi

RE: query with quoted string unexpected results

2022-03-31 Thread Teresa McMains
An update: I did get access to the solr admin page and I'm trying a query simply on Lead to succeed Vs "Lead to succeed" And the first returns records, including the one I want and the second does not. Can someone explain, though, the debug output for the quoted search string -- specifically,

R: DIH and import from other core

2022-03-31 Thread Tealdi Paolo
Hi Eric Many thanks for the answer. I noticed that reindexcollection seems to be SLOWER than DIH import. Best regards, Paolo Tealdi Ing. Paolo Tealdi Area IT - Politecnic

SOLR 7.3 - does the _text_ field work as a catch-all by default ?

2022-03-31 Thread Serban Alexe
Hi all, I started working on a SOLR-based search system, and I'm still trying to figure-out what's-what... I noticed that the *_text_* field works as a *catch-all* field, although there is no *copyField* defined to do this. It simply gathers *everything* (documents content, tags, keywords, etc...

Re: Problem with facet in SOLR

2022-03-31 Thread Andy Lester
> I have indexed 4 fields and want to use facet on "taxo_domain_mother" but i > am not getting any result It looks like you don’t have facet.field=taxo_domain_mother specified in your query. It’s hard to tell exactly because screenshots make it difficult to figure things out. > Do i need t

Intermittent multi core creation failure from a configset

2022-03-31 Thread Hari Iyer
Hi, I have a batch process that indexes/re-indexes multiple cores from a database where each core data is processed by one thread. I am using solr 8.11.1. I get the following SAX parse exception (see below) on the managed schema when creating any one of the cores. At times, I get a “Unable to cr

Re: R: DIH and import from other core

2022-03-31 Thread dmitri maziuk
On 2022-03-31 9:29 AM, Tealdi Paolo wrote: Hi Eric Many thanks for the answer. I noticed that reindexcollection seems to be SLOWER than DIH import. (Warning: there be python there) This is trimmed down from a working script: https://gist.github.com/dmaziuk/57b9c1926578bc10f12c0999c4b7ab53

Re: DIH and import from other core

2022-03-31 Thread matthew sporleder
> On Mar 31, 2022, at 12:05 PM, dmitri maziuk wrote: > > On 2022-03-31 9:29 AM, Tealdi Paolo wrote: >> Hi Eric >> Many thanks for the answer. >> I noticed that reindexcollection seems to be SLOWER than DIH import. > > (Warning: there be python there) > > This is trimmed down from a working

Re: DIH and import from other core

2022-03-31 Thread dmitri maziuk
On 2022-03-31 11:11 AM, matthew sporleder wrote: commitWithin will help you a ton, even a short one Yeah. It's a small table and a "playground" Solr index so I haven't bothered tuning it... yet. Dima

Re: R: DIH and import from other core

2022-03-31 Thread Thomas Corthals
Op do 31 mrt. 2022 om 18:05 schreef dmitri maziuk : > On 2022-03-31 9:29 AM, Tealdi Paolo wrote: > > Hi Eric > > > > Many thanks for the answer. > > I noticed that reindexcollection seems to be SLOWER than DIH import. > > (Warning: there be python there) > > This is trimmed down from a working scr

Re: Atomic indexing without whole document getting indexed again

2022-03-31 Thread gnandre
Thanks, this is what I was looking for. Although, when I am experimenting with them now, I see no performance improvement. I suspect that it is still doing atomic updates and not in-place updates. How do I confirm whether in-place updates are happening vs atomic updates? I can't tell it simply by l

Re: Atomic indexing without whole document getting indexed again

2022-03-31 Thread gnandre
Here is what I tried to confirm if it is still doing atomic indexing and not in-place indexing. I changed one other unrelated field's name and reloaded the schema. Now, when I performed the indexing just for the field that I wanted to update in-place, it should not have complained about this other

Re: R: DIH and import from other core

2022-03-31 Thread dmitri maziuk
On 2022-03-31 11:17 AM, Thomas Corthals wrote: You can speed that up significantly by sending multiple documents in the same request and only committing once: https://web.archive.org/web/20170418205443/http://www.raspberry.nl/2011/04/08/solr-update-performance/ Yes: batching multiple documents

Re: DIH and SolrCloud

2022-03-31 Thread James Greene
Sure a redirect was expected but this solr users right? Not solr team itself... does anyone have dataimport.properties being maintained with zookeeper? I'm okay with a single instance running the import which DOES properly distribute the documents to multiple shards. On Thu, Mar 31, 2022, 6:38 AM

Re: DIH and SolrCloud

2022-03-31 Thread dmitri maziuk
On 2022-03-31 6:35 PM, James Greene wrote: Sure a redirect was expected but this solr users right? Not solr team itself... does anyone have dataimport.properties being maintained with zookeeper? I'm okay with a single instance running the import which DOES properly distribute the documents to mu

Search query with uppercase getting different results

2022-03-31 Thread ozatomic
Hi, While doing some testing with my instance of solr I have noticed that if i mix cases in a search string it is treated like 2 words? is this the expected behavior? or a config issue. Examples "myword" returns results with: - myword - myWord - MYWord "myWORD" returns results with: - myw

Re: Search query with uppercase getting different results

2022-03-31 Thread James Greene
Expected, your search term is getting tokenized to the lowercase to uppercase is similar to a natural language 'word break' thus it gets tokenized to 'my word'. Cheers, JAG On Thu, Mar 31, 2022 at 10:27 PM ozatomic wrote: > Hi, > > While doing some testing with my instance of solr I have notice

Re: Search query with uppercase getting different results

2022-03-31 Thread ozatomic
Thanks JAG, Is this something that can configured so that it does not tokenize? On 1/04/2022 1:30 pm, James Greene wrote: Expected, your search term is getting tokenized to the lowercase to uppercase is similar to a natural language 'word break' thus it gets tokenized to 'my word'. Cheers, JAG

Re: Search query with uppercase getting different results

2022-03-31 Thread James Greene
Yes, you can update the field type from a TextField to a String or change the analyzer: https://solr.apache.org/guide/8_11/analyzers.html Cheers, JAG On Thu, Mar 31, 2022 at 10:41 PM ozatomic wrote: > Thanks JAG, > > Is this something that can configured so that it does not tokenize? > > On 1