Like Pandas, in the R package we store R-specific metadata in the schema;
this lets us preserve all R class and other attributes when round-tripping
data to Feather or Parquet. These are not stored in a way that is readable
or useful to other implementations, partly by design. If there were type
information/metadata that was meaningful across libraries, it would make
sense to define (and document) an extension type.

Some discussion here:
https://arrow.apache.org/docs/r/articles/arrow.html#r-object-attributes

Neal

On Thu, Apr 8, 2021 at 10:52 AM Micah Kornfield <emkornfi...@gmail.com>
wrote:

>  1.  Do the standard libraries handle that metadata key automatically ?
>
> C++, Python and Java have facilities to support them automatically
> (extensions needs to register themselves), I'm not sure about other
> languages.
>
> 2. Are there other standard or best practice metadata keys that either
> people generally use or the standard libraries expect?
>
> I think there are some things related to Pandas types and parquet data that
> are propagated automatically through metadata.  Rust was looking into
> including file sources but I'm not sure that is being externalized.  There
> might be other things, hopefully people will chime in.
>
> On Thu, Apr 8, 2021 at 10:47 AM Michael Lavina <michael.lav...@factset.com
> >
> wrote:
>
> > Hey Team,
> >
> > I noticed that under extension type the Arrow docs specifically call out
> > to `ARROW:extension:name`  and `ARROW:extension:metadata` as
> > recommended/reserved metadata keys to handle extension types.
> >
> > 2 quick questions
> >
> >
> >   1.  Do the standard libraries handle that metadata key automatically ?
> >   2.  Are there other standard or best practice metadata keys that either
> > people generally use or the standard libraries expect?
> >
> > Thank you,
> > Michael
> >
>

Reply via email to