Document path representation as multivalued fields: sensible?

2024-11-21 Thread Marc
ing method likely to keel over at some stage if the number of documents keeps growing? Best, Marc

Re: Document path representation as multivalued fields: sensible?

2024-11-25 Thread Marc
ences with the graph query parser? Best, Marc Am 22.11.2024 10:54, schrieb ufuk yılmaz: I don’t like multivalued fields much because they don’t play nice with docValues which enable many cool features I like about Solr. They also don’t match indexes (find docs that have this value on the 3rd pos

How Spark alone vs Spark-Solr performances compare?

2023-01-25 Thread marc nicole
Hello - I have hard times understanding the benefits of storing (indexing) documents from Spark for a later read into Spark again vs reading using Spark only. How Solr could improve querying time of data through indexing? Is the read operation using Solr faster than using spark.read(someFile.csv)?

How to create indexes for CSV dataset (with schema) in Java

2023-01-26 Thread marc nicole
Hello - I want to know how it is possible to create indexes for a certain attribute (and its different values) of a document, using Java. Thanks.

Re: How to create indexes for CSV dataset (with schema) in Java

2023-01-26 Thread marc nicole
In, java i want to create index of a collection named person and say an "Age" attribute for the values between 20 and 35. so that when i query Solr on those values it retrieves them quickly how to do that ? Le ven. 27 janv. 2023 à 03:42, Shawn Heisey a écrit : > On 1/26/2023

Re: How to create indexes for CSV dataset (with schema) in Java

2023-01-27 Thread marc nicole
ks/query?q=*:*&fq=price:%5B7%20TO%20*%5D > > You could of course query this from Java using our SolrJ client library. > > Perhaps that is a starting point for you to formulate in a better way what > you are trying to achieve? > > Jan > > > 27. jan. 2023 kl. 08:38 s

how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
Hello, I want to set up Zookeeper for Solr, how to do that for locally working with Solr ? The port should be 9983. For now I got an error saying can't connect to Zookeeper Thanks.

Re: how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
More into details, when i try to connect a Spark client to Solr (through a spark-solr connector) i get the following message: Caused by: org.apache.solr.common.SolrException: Cannot connect to cluster at localhost:9983/solr: cluster not found/not ready Le sam. 28 janv. 2023 à 09:36, marc nicole

Re: how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
start in "cloud mode". > > You can also run zookeeper on its own set of clusters but try to match > up the version with what comes embedded. > > On Sat, Jan 28, 2023 at 8:05 AM marc nicole wrote: > > > > More into details, when i try to connect a Spark client

Re: how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
try to match > up the version with what comes embedded. > > On Sat, Jan 28, 2023 at 8:05 AM marc nicole wrote: > > > > More into details, when i try to connect a Spark client to Solr (through > a > > spark-solr connector) i get the following message: > > Caused b

Re: how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
If Solr cloud is launched at port 8984, zooKeeper will be at port 9984 right ? Le sam. 28 janv. 2023 à 15:05, matthew sporleder a écrit : > https://solr.apache.org/guide/8_1/solr-control-script-reference.html > > On Sat, Jan 28, 2023 at 8:56 AM marc nicole wrote: > > >

Re: how to create and set up Zookeeper for Solr

2023-01-28 Thread marc nicole
Thanks matthew it is now working! Le sam. 28 janv. 2023 à 15:15, marc nicole a écrit : > If Solr cloud is launched at port 8984, zooKeeper will be at port 9984 > right ? > > Le sam. 28 janv. 2023 à 15:05, matthew sporleder a > écrit : > >> https://solr.apache.org/guide

Solr Query time performance

2023-01-28 Thread marc nicole
Hello guys, I have been playing with Solr lately, and I tested it over a csv file of about 200 rows (that I indexed in Solr). I also read the file in Spark and perform filtering over an attribute value and compute time of processing when the dataset is loaded from File System vs. Solr. I find the

Re: Solr Query time performance

2023-01-29 Thread marc nicole
ter > is much simpler 😉 > > Sorry, you cannot benchmark like this. You have to select a real use case > and then select technology base on the requirements at hand. And to > benchmark you must use a realistic data set. > > Jan Høydahl > > > 28. jan. 2023 kl. 23

When to index data into Solr?

2023-01-29 Thread marc nicole
Hello - I want to know whether it is common practice to index all the datasets from the start or the indexation should be performed when the data is being queried? Also, is there a size limit on the data to index into Solr? Thanks.

Re: When to index data into Solr?

2023-01-29 Thread marc nicole
for systems of that size > costs many hundreds of thousands or more annually) > > -Gus > > On Sun, Jan 29, 2023 at 10:19 AM marc nicole wrote: > > > Hello - I want to know whether it is common practice to index all the > > datasets from the start or the indexation

Re: Solr Query time performance

2023-01-29 Thread marc nicole
Much appreciated. Le dim. 29 janv. 2023 à 17:47, Andy Lester a écrit : > > > > On Jan 29, 2023, at 4:45 AM, marc nicole wrote: > > > > Let's say you're right about the 200 rows being too few. From which row > > count I can see the difference refle

How to index a csv dataset into Solr using SolrJ

2023-01-29 Thread marc nicole
Hi guys, I can't find a reference on how to index a dataset.csv file into Solr using SolrJ. https://solr.apache.org/guide/6_6/using-solrj.html Thanks.

Re: How to index a csv dataset into Solr using SolrJ

2023-01-29 Thread marc nicole
e each line > yourself if the format is fixed. > > If you need to post csv directly to Solr you’d use a plain http post with > content-type csv, but in most cases your app would do that. > > Jan Høydahl > > > 29. jan. 2023 kl. 20:21 skrev marc nicole : > > > >

Re: How to index a csv dataset into Solr using SolrJ

2023-02-10 Thread marc nicole
ant to use SolrJ to move it to Solr, rather than > curl or some other http client lib? > > Jan > > > 29. jan. 2023 kl. 20:44 skrev marc nicole : > > > > The Java code should perform the post. Any piece of code to show to > better > > explain this? &g

Re: How to index a csv dataset into Solr using SolrJ

2023-02-10 Thread marc nicole
e there's a way > to do > : > it by using some Generic request type and overriding content type.. > Can you > : > explain more what you app will do, where that CSV file comes from in > the > : > first place and why you'd want to use SolrJ to move it to Solr, rather > tha