Re: Advice on ways forward with or without Data Import Handler

2025-06-06 Thread Eric Pugh
We have a bit of a mismatch that the people who generally provide the minimal support to DIH are not the folks deploying and using it. It would be great to get more users of DIH to contribute fixes and maintain to the plugin. DIH was spun out of core Solr so that it could build a proper communi

Re: Advice on ways forward with or without Data Import Handler

2025-06-06 Thread Sarah Weissman
After hearing from a few people who were able to get DIH to keep working by building the jars into Solr 9, I did go back and eventually build a custom Solr image with the jars I needed in place and I did get that to work with a Solr 8 base image. I haven't tried with Solr 9 yet. In hindsight, bu

Re: Advice on ways forward with or without Data Import Handler

2025-06-03 Thread Gus Heck
Perhaps try out JesterJ? It has a database connector, and I encourage you to try it out. https://github.com/nsoft/jesterj There are discussion forums, issue reporting and a discord channel if you have questions or feedback. Full disclosure: I wrote most of JesterJ, though the JDBC connector was

RE: Advice on ways forward with or without Data Import Handler

2025-06-02 Thread Andrew Witt
About a year ago, we were in a similar position, running Solr v7 and using built-in DIH. We upgraded to Solr v9.4 and updated DIH using the contrib package from the SearchScale GitHub repo. It was a very successful upgrade. (Solr v9.5 was available at the time, but in early 2024, the latest Se

Re: Advice on ways forward with or without Data Import Handler

2025-05-30 Thread Mikhail Khludnev
Hello Sarah. It's hard to propose to stay on DIH, but fwiw, couldn't it still be deployed by placing jars into some /lib dir and cofiguring like in the good old days? I guess it still works and suits fine for custom images, where there's no need in runtime plugin deploy. Also, recently I added an

Re: Advice on ways forward with or without Data Import Handler

2025-05-29 Thread Andy Lester
> We’ve been using Solr with DIH for about 8 years or so but now we’re hitting > an impasse with DIH being deprecated in Solr 9. Additionally, I’m looking to > move our Solr deploy to Kubernetes and I’ve been struggling to figure out > what to do with the DIH component in a cloud setting. I s

Re: Advice on ways forward with or without Data Import Handler

2025-05-29 Thread Walter Underwood
Yep, good ol’ ETL. My solution was dumping the data as one JSON object per document, an optional transform step, then a multi-threaded Python loader that was schema-independent. The multi-threaded loader ran way faster than DIH. This approach also easily supports re-indexing and re-do after a f

Re: Advice on ways forward with or without Data Import Handler

2025-05-29 Thread Dmitri Maziuk
On 5/29/25 11:43, Sarah Weissman wrote: I’ve been banging my head against this all week and I’m trying to figure out the best way forward at this point. Is DIH still a viable option or should I be moving off of that something else? Any advice or perspectives on this would be appreciated. Al