Documents indexing

2023-12-04 Thread Ashkar
Hi Solr Users, I have a few questions. 1. Can I crawl One Drive and index the documents? 2. Are we able to crawl a website that has a login? 3. Can we crawl documents from an HTTP/HTTPS-based portal and do the indexing? Regards, [Logo] Ashkar System Analyst M +91 9605043

Re: Documents indexing

2023-12-04 Thread Charlie Hull
Hi Ashkar, Yes you can do all these things  - but not with Solr, which doesn't come with a built-in website crawler. You'll need to look at some other projects for that such as: http://crawler.archive.org/index.html Heritrix http://lucene.apache.org/nutch/ Nutch (created by Doug Cutting who a

Re: Invalid PHPS response for Luke request

2023-12-04 Thread Mikhail Khludnev
Hello Thomas. It's not really my wheelhouse. There might be a problem in LukeHandler or its' compatibility with phps. e.g. under "solr" key it puts Lucene's Document but not SolrDocument really (see LukeRequestHandler.handleRequestBody). Do you really need /luke can't you obtain a doc via /select o

Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source Large Language with Apache Solr (Alessandro) ~

2023-12-04 Thread Mikhail Khludnev
Hello, Alessandro, thank you. It's a great concise talk! I was a little bit late to the party. Koji thanks for the link! Wouldn't you share a recording? On Fri, Dec 1, 2023 at 11:16 AM Koji Sekiguchi wrote: > https://meet.google.com/zrf-khvk-ifw > > > 2023年12月1日(金) 17:09 Mikhail Khludnev : > >

Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source Large Language with Apache Solr (Alessandro) ~

2023-12-04 Thread Moriyasu Kannami
Hi Mikhail, I was the host of the Google Meet at that meet-up. At the meet-up, I didn't record the Google meet video. Regards, moriyasu 2023年12月4日(月) 18:29 Mikhail Khludnev : > Hello, > Alessandro, thank you. It's a great concise talk! > I was a little bit late to the party. Koji thanks for th

RE: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source LargeLanguage with Apache Solr (Alessandro) ~

2023-12-04 Thread ufuk yılmaz
Hi, Thanks for inviting us, it was very informative! Is it possible to have a copy of the slides? If they are private/closed source, than I understand. Sincerely, Ufuk yilmaz -ufuk yilmaz Sent from Mail for Windows From: Moriyasu Kannami Sent: Monday, December 4, 2023 12:58 PM To: Mikhail

Possible to identify _all_ fields in the index with actual multiple values?

2023-12-04 Thread Jimi Hullegård
We have a lot of fields that are defined as multiValued="true", but where I suspect that we only actually use it as a single value field. And I would like to tidy up the schema a bit, and only use multiValued="true" for fields where it's actually needed. So, I would like to get an overview of al

Re: Invalid PHPS response for Luke request

2023-12-04 Thread Thomas Corthals
Hello Mikhail I wrote the Luke functionality for the Solarium PHP client. There is no use case here, just the desire to fully support the functionality. If we can agree this is a bug in the PHPS output (JSON output doesn't contain errors) I'll open a JIRA issue for it and implement a forward comp

Re: Invalid PHPS response for Luke request

2023-12-04 Thread Mikhail Khludnev
It looks like https://github.com/apache/solr/pull/2114 I see 'i:0' in actual output. I'm not sure whether it's wrong or right. On Mon, Dec 4, 2023 at 3:23 PM Thomas Corthals wrote: > Hello Mikhail > > I wrote the Luke functionality for the Solarium PHP client. There is no use > case here, just t

Re: Invalid PHPS response for Luke request

2023-12-04 Thread Mikhail Khludnev
I noticed that every array element, and Solr document field is prepended with a numeric key which is a sequential element index. So, these i:0 might be pretty valid. On Mon, Dec 4, 2023 at 4:04 PM Mikhail Khludnev wrote: > It looks like https://github.com/apache/solr/pull/2114 > I see 'i:0' in a

Re: knn query parser, number of results and filtering by score

2023-12-04 Thread Christine Poerschke (BLOOMBERG/ LONDON)
Hi Mirko, Thanks for sharing both the cloud and non-cloud test scenarios here! Earlier today I merged a (one line) change for the https://issues.apache.org/jira/browse/SOLR-16952 issue. From reading this thread is appears that it could be correlated, as you mentioned, i.e. the 0.0 in "0.0 = no

Re: Invalid PHPS response for Luke request

2023-12-04 Thread Mikhail Khludnev
Probably, /like/admin might be out of scope for particular client api, since it's more like low level internal facility, subject to change -and errorprone-. пн, 4 дек. 2023 г., 15:23 Thomas Corthals : > Hello Mikhail > > I wrote the Luke functionality for the Solarium PHP client. There is no use

Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source LargeLanguage with Apache Solr (Alessandro) ~

2023-12-04 Thread Alessandro Benedetti
Thanks guys! The slides will be available roughly in one month here: https://sease.io/conferences-attended I'll write again here when available. If you want them more urgently feel free to reach me on Linkedin and I can share the pdf there with you. Cheers On Mon, 4 Dec 2023, 20:32 ufuk yılmaz,