On Tue, Feb 18, 2025 at 05:44:23PM +0000, Rahul Goswami wrote:
> The official documentation should be a good starting point for your
> questions   :
> https://solr.apache.org/guide/solr/latest/indexing-guide/copy-fields.html
> 
> Reading the above documentation should help answer some of them and provide
> a general structure to your understanding of copyFields.

Yes, it should have.  I had read it.

> Overall, copyFields are not much different from a regular field in terms of
> attributes that you can define (stored, indexed, docValues etc). It is a

Thank you.  This is good to know.  In
  https://solr.apache.org/guide/solr/latest/indexing-guide/copy-fields.html
there is no list of defined attributes and no statement that copyField
accepts the same attributes as field.  The only attributes documented
there are source, dest, and maxChars.

> destination for one or more of your source fields in cases when you might
> want a catchall field, or a field with a different analysis pipeline for
> search than the one defined for the source field.
> 
> What Solr does is before handing over the document for indexing to the
> lower Lucene layer, based on the copy field configuration in your defined
> schema, extracts fields from the document and copies them over to the copy
> field to form the "complete" document.

It's not stated in the page linked above whether the field value is
copied or referenced, and I've found no way to infer it.

> From this point on, the copy field gets stored(if stored=true), analyzed
> and indexed (if indexed=true) just like any other field.

Again, good to know.  Thank you.

This answers my immediate concern:  yes, we should pay attention to
the attributes of the source field and the copyField so that we can
avoid un-necessary duplication of storage.  We're working with full
texts of academic papers, which can be quite large, and we might save
quite a bit of storage and I/O with careful schema design.

> On Tue, Feb 18, 2025 at 12:20 PM <mw...@iu.edu> wrote:
> 
> > So *how* does copyField work?  Do I wind up with two identical copies
> > of the data stored in the index (if stored='true')?  Does the
> > copyField element accept a 'stored' attribute, and can it have a
> > different value than the source field declares?  What *are* the
> > defined attributes for copyField, and do they interact with the source
> > field's attributes?  Where should I have found answers to these
> > questions?
> >
> > --
> > Mark H. Wood
> > Lead Technology Analyst
> >
> > University Library
> > Indiana University Indianapolis
> > 755 W. Michigan Street
> > Indianapolis, IN 46202
> > 317-274-0749
> > library.indianapolis.iu.edu
> >

-- 
Mark H. Wood
Lead Technology Analyst

University Library
Indiana University Indianapolis
755 W. Michigan Street
Indianapolis, IN 46202
317-274-0749
library.indianapolis.iu.edu

Attachment: signature.asc
Description: PGP signature

Reply via email to