Re: Nested documents

2025-05-26 Thread Mikhail Khludnev
Hi Marc, There are two concerns to consider: - what you need to have in search results. If you need to have unique files, nested is a way to go. If many versions of the same file appear fine, so denormalize them. FieldCollapsing, Grouping and query time join provides some flexibility but requires

Nested documents

2025-05-26 Thread Marc
Hi there, I am working on a new Solr installation for our application. Here, wer want to store file information in file documents and file versions in fileversion documents. All file versions are related to file documents therefore the question arises whether it would be a good idea to store

Re: Is nested documents good fit for collection of documents that grow over time?

2025-05-21 Thread Mikhail Khludnev
Hi, Nested is a big nononno for modelling collections. It's either query time join or field collapsing/grouping or just an aggregation or so. On Wed, May 21, 2025 at 3:56 PM Yaşar Arabacı wrote: > Hi, > > I am trying to figure out a schema for indexing documents that I will > download from anoth

Is nested documents good fit for collection of documents that grow over time?

2025-05-21 Thread Yaşar Arabacı
Hi, I am trying to figure out a schema for indexing documents that I will download from another document management system. In the original source, documents are stored inside collections. Collections act like folders with extra metadata (they cannot be nested though). Each document must be part

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-04-05 Thread David Smiley
ternal file and loads after each commit, instead of indexing into > solr > >> > for each change] > >> > > >> > > >> > https://solr.apache.org/guide/solr/latest/indexing-guide/external-files-processes.html > >> > Observation: > >>

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-03-24 Thread Uday Kumar
> >> > 2. POC on Inplace update: (Which helps in indexing fields which contains >> > changes, but not entire document) >> > >> > >> https://solr.apache.org/guide/solr/latest/indexing-guide/partial-document-updates.html#in-place-updates >> > Observa

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-03-17 Thread Uday Kumar
es, but not entire document) > > > > > https://solr.apache.org/guide/solr/latest/indexing-guide/partial-document-updates.html#in-place-updates > > Observation: > > a. Works with only single values fields > > b. Looks promising wrt indexing optimization but not suitable

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-03-04 Thread Alessandro Benedetti
gt; Observation: > a. Works with only single values fields > b. Looks promising wrt indexing optimization but not suitable wrt our > schema (as we have more multivalued fields). so, dropped > > > Then we moved for alternatives which is expected to help in optimizing > respo

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-03-04 Thread Uday Kumar
multivalued fields). so, dropped Then we moved for alternatives which is expected to help in optimizing response times *To improve Solr Response time:*Nested Documents POC: https://solr.apache.org/guide/solr/latest/indexing-guide/indexing-nested-documents.html *wrt this statement:* "In ter

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-03-03 Thread Uday Kumar
u talk about grouping, have you considered field collapsing? > > According to my experience going with nested documents rarely justify the > performance and functional overhead both at indexing and query time. > > But sometimes you need them. > > Also in place updates happen on very

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-02-28 Thread Alessandro Benedetti
What is your problem? Rather than asking about a solution you attempted is usually better to start from the problem. You talk about grouping, have you considered field collapsing? According to my experience going with nested documents rarely justify the performance and functional overhead both

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-02-27 Thread Uday Kumar
25 at 8:05 AM Uday Kumar .invalid> > wrote: > > > Hi all, > > We are doing a POC on indexing nested documents in expectation of > reducing > > grouping overhead while querying time. > > > > On Prod Indexing, we are using the traditional approach of reindexing t

Re: Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-02-26 Thread Mikhail Khludnev
Changing one child rewrites the whole block period. However in-place updating child docValues is promising in theory, although I don't know how it works in practice. On Thu, Feb 27, 2025 at 8:05 AM Uday Kumar wrote: > Hi all, > We are doing a POC on indexing nested documents in exp

Performance Considerations While Indexing Nested Documents: SolrV9.6.1

2025-02-26 Thread Uday Kumar
Hi all, We are doing a POC on indexing nested documents in expectation of reducing grouping overhead while querying time. On Prod Indexing, we are using the traditional approach of reindexing the entire document if there is any change in any of the fields. [we reindex ~2cr documents per day, FYI

Re: Nested documents limitations and cautions

2024-02-01 Thread Isabella Trevisan
Hi Alessandro, Thank you for your detailed answer. As I understand it, nested documents speed up searches on hierarchical documents compared to Query Time Joins andc onsume fewer resources but could create problems during reindexing. Are there any studies to understand if there are limits to the

Re: Nested documents limitations and cautions

2024-01-31 Thread Isabella Trevisan
Hi Alessandro, Thank you for your detailed answer. As I understand it, nested documents speed up searches on hierarchical documents compared to Query Time Joins andc onsume fewer resources but could create problems during reindexing. Are there any studies to understand if there are limits to the

Re: Nested documents limitations and cautions

2024-01-31 Thread Alessandro Benedetti
Hi Isabella, back in the day I wrote a blog post about nested documents, not strictly related to pros and cons but can be useful: https://sease.io/2019/06/apache-solr-childfilter-transformer.html In terms of pros and cons, exploring the details of nested documents will surely require a bit of

Nested documents limitations and cautions

2024-01-31 Thread Isabella Trevisan
Hi, We are studying a solution that takes advantage of nested documents and therefore we are looking for information on the pros and cons and limitations that this solution offers. Furthermore, we wish to understand in which case is better to use nested documents or query time joins. Further Have

Re: LTR Features on nested documents

2023-07-27 Thread Alessandro Benedetti
Hi Sergio, in the block join, under the hood, nested docs are separate Lucene(Solr documents). Assuming you are retrieving parents after querying children ( https://solr.apache.org/guide/solr/latest/query-guide/block-join-query-parser.html#block-join-parent-query-parser) that's all you got for rera

LTR Features on nested documents

2023-07-24 Thread Sergio García Maroto
Hi, I am trying to set up a list of features within LTR. I have a collection *"person" *with a design of two levels. I have Person documents with nested documetns classified as jobs. Within the job level I have two fields describing if the job is current and recency. I would like to incorporante

Re: Issue indexing nested documents

2023-04-12 Thread Charlie Hubbard
gt; On Tue, Apr 11, 2023 at 4:57 AM Charlie Hubbard > > wrote: > > > Hi, > > > > I'm not sure what you mean by "naming children" exactly. I don't have a > > stack trace, but this video discusses nested documents, and refers to > > enhanced supp

Re: Issue indexing nested documents

2023-04-11 Thread Mikhail Khludnev
(named fields) at 6.6? On Tue, Apr 11, 2023 at 4:57 AM Charlie Hubbard wrote: > Hi, > > I'm not sure what you mean by "naming children" exactly. I don't have a > stack trace, but this video discusses nested documents, and refers to > enhanced support in 6.x so I

Re: Issue indexing nested documents

2023-04-10 Thread Charlie Hubbard
Hi, I'm not sure what you mean by "naming children" exactly. I don't have a stack trace, but this video discusses nested documents, and refers to enhanced support in 6.x so I know 6.x supports nested documents. https://youtu.be/qV0fIg-LGBE?t=446 Here is what is

Re: Issue indexing nested documents

2023-04-10 Thread Mikhail Khludnev
... >}, >... >] > } > > It clearly doesn't like the field `pages` which isn't defined in the > schema, but reading the various documents from Solr seems like it doesn't > need to be? That Solr should recognize the nested documents

Re: Issue indexing nested documents

2023-04-10 Thread dmitri maziuk
On 2023-04-10 12:47 PM, Charlie Hubbard wrote: So I did try to add `_nest_path_` but I can't define it because the class ` solr.NestPathField`isn't available in Solr 6.6.6. I also have another strange error message about atomic updates like so: ... _root_ needs to be stored and/or docValues =t

Re: Issue indexing nested documents

2023-04-10 Thread Charlie Hubbard
So I did try to add `_nest_path_` but I can't define it because the class ` solr.NestPathField`isn't available in Solr 6.6.6. I also have another strange error message about atomic updates like so: some_document.pdf was rejected by the server for Error from server at http://localhost:8983/solr/fu

Re: Issue indexing nested documents

2023-04-10 Thread dmitri maziuk
On 2023-04-10 11:12 AM, Charlie Hubbard wrote: Hi I'm encountering the following error when indexing a parent and nested children documents. I'm using managed schemas, Solr 6.6.6, and Solrj to send the documents up to Solr for indexing. I have the default `_root_` field defined in the schema,

Issue indexing nested documents

2023-04-10 Thread Charlie Hubbard
_pageNumber: 1, _type: "page", content: "lorem ipsum dolor" }, SolrInputDocument { ... }, ... ] } It clearly doesn't like the field `pages` which isn't defined in the schema, but reading the various documen

highlighting on all hierarchies in nested documents

2022-11-20 Thread 123456780sss
t" in order to get the root document that contains the children that match our conditions (in fq). for example: fq = {!parent which='doc_type:root'}(+doc_type=child_doc +doc_body:word) fl=*.[child limit=-1]. We got the full document with the all nested documents in result, but th

Use Update Processor in nested fileds (nested documents)

2022-05-25 Thread dsaewqdsa
Hi, I want to use update Processor of solr, on nested fields, for example: Use UniqFieldsUpdateProcessorFactory, on field: "subject" in documents: { "id":"dsaewqhgfytr", "lecturers":[ { "id":"dsaewqhgfytr.dsadsa321321dsa", "subject":"english" }, { "id":"dsaewqhgfytr.dsadsa321321ewq", "subject":"mat

Re: Search inside parent and nested documents

2021-11-01 Thread govind nitk
n improvement for [child] . >> >> Will try this out. >> >> On Sat, Oct 30, 2021 at 1:37 PM govind nitk >> wrote: >> >>> Hi, >>> >>> I am using edismax parser for querying parent documents. With [child ] >>> transformer, I am able

Re: Search inside parent and nested documents

2021-10-30 Thread govind nitk
parser for querying parent documents. With [child ] >> transformer, I am able to get the child/nested documents of matching >> parents. >> >> But I am curious to know any way to user queryfields for child docs? >> >> For example, q= A B C >> I am trying to g

Re: Search inside parent and nested documents

2021-10-30 Thread govind nitk
o get the child/nested documents of matching > parents. > > But I am curious to know any way to user queryfields for child docs? > > For example, q= A B C > I am trying to get child documents matching these query words ( at least 1 > or all). > How to get querying on child docs

Search inside parent and nested documents

2021-10-30 Thread govind nitk
Hi, I am using edismax parser for querying parent documents. With [child ] transformer, I am able to get the child/nested documents of matching parents. But I am curious to know any way to user queryfields for child docs? For example, q= A B C I am trying to get child documents matching these

RE: Nested documents mixup with block join query parent mask

2021-06-21 Thread Manoj Mokashi
ts welcome. -Original Message- From: Manoj Mokashi Sent: Friday, June 18, 2021 4:17 PM To: users@solr.apache.org Subject: RE: Nested documents mixup with block join query parent mask A similar problem is seen with the parents parser, children belonging to other nodes sometimes appear as c

RE: Nested documents mixup with block join query parent mask

2021-06-18 Thread Manoj Mokashi
@solr.apache.org Subject: Nested documents mixup with block join query parent mask Hi, I am using solr 8.6.2 I have the following doc types marked by a type attribute : TR -> has child doc -> IN PB -> has child doc -> AU So the type IN docs exist only under TR and type AU only under PB But when I

Nested documents mixup with block join query parent mask

2021-06-18 Thread Manoj Mokashi
Hi, I am using solr 8.6.2 I have the following doc types marked by a type attribute : TR -> has child doc -> IN PB -> has child doc -> AU So the type IN docs exist only under TR and type AU only under PB But when I run a query like type:IN AND {!child of='type:PB'}type:PB, I get a subset of IN

RE: Atomic updates with nested documents

2021-06-01 Thread Chandramohan
Hi Adi, I am having solr 8.8. I have implemented nested documents with _root_ stored=true and _nest_path_. Partial update is failing for me. It works once when I restart solr and fails subsequent partial updates again till next restart. It fails with Error 500: "msg":"Did not fi

Atomic Update of parent document in nested documents fails and needs solr restart

2021-05-28 Thread chandru
Version: Solr 8.8.0 I have a core having nested documents. I am trying to do partial update on parent document. It works for the first time and fails after that, and needs restart to do partial update again. (Note: Normal update works fine. Partial update works fine when I don't have n

Re: Solr equivalent of relational joins on nested documents

2021-05-07 Thread Alain Rogister
} }, { "match": { "level2.field4": "mqnmnq" } } ] }

Re: Solr equivalent of relational joins on nested documents

2021-05-05 Thread Norbert Kutasi
Hello Alain, If I was going to implement queries on deeply nested documents, I can see quite a few examples here : https://solr.apache.org/guide/8_8/searching-nested-documents.html#parent-query-parser In a hierarchy like below the following query with a criteria on _nest_path_ suppose to

Re: Solr equivalent of relational joins on nested documents

2021-05-04 Thread Timothy Potter
d there is some Event where event.item1='..' and > > event.item2='..' and there is some ActionPlan that has an Objective where > > Objective.item3='..' and this Objective has an Action where > > Action.item1='..' and Action.item2='..'

Re: Solr equivalent of relational joins on nested documents

2021-05-04 Thread Alain Rogister
and > event.item2='..' and there is some ActionPlan that has an Objective where > Objective.item3='..' and this Objective has an Action where Action.item1='..' > and Action.item2='..' > > This is fairly trivial in a relational DB. But is it achieva

Re: Solr equivalent of relational joins on nested documents

2021-04-23 Thread Alain Rogister
re is some ActionPlan that has an Objective where > > > Objective.item3='..' and this Objective has an Action where > > > Action.item1='..' and Action.item2='..' > > > > > > This is fairly trivial in a relational DB.

Re: Solr equivalent of relational joins on nested documents

2021-04-23 Thread Alain Rogister
t; Profile.item1='...' and there is some Event where event.item1='..' and > > > event.item2='..' and there is some ActionPlan that has an Objective where > > > Objective.item3='..' and this Objective has an Action where > > > Action.item1=

Re: Solr equivalent of relational joins on nested documents

2021-04-22 Thread Timothy Potter
.' and there is some ActionPlan that has an Objective where > > Objective.item3='..' and this Objective has an Action where > > Action.item1='..' and Action.item2='..' > > > > This is fairly trivial in a relational DB. But is it achieva

Re: Solr equivalent of relational joins on nested documents

2021-04-22 Thread Alain Rogister
Hi, Thanks, I knew about that one, but note the date : 2012. A lot has changed in the meantime, specifically block joins and improved nested documents support. But I wonder about what can be expressed, even if we leave performance issues aside for another post. I can't find a comprehe

Re: Solr equivalent of relational joins on nested documents

2021-04-22 Thread Eran Buchnick
1='..' and > event.item2='..' and there is some ActionPlan that has an Objective where > Objective.item3='..' and this Objective has an Action where > Action.item1='..' and Action.item2='..' > > This is fairly trivial in a relati

Solr equivalent of relational joins on nested documents

2021-04-21 Thread Alain Rogister
' This is fairly trivial in a relational DB. But is it achievable in Solr ? Needless to say, there is no way a flat schema can work here. So I have assumed that nested documents were the only way to go. I have created a schema that includes these prerequisites, where "docType" i