Just to be clear, I think we are referring to a "well known"/canonical
extension type [1] here?   I'd also be in favor of this (Disclaimer I'm a
colleague of Padeep's)

[1] https://arrow.apache.org/docs/format/Columnar.html#extension-types


On Fri, Jul 29, 2022 at 3:19 PM Wes McKinney <wesmck...@gmail.com> wrote:

> This seems like a common-enough data type that having a first-class
> logical type would be a good idea (perhaps even more so than UUID!).
> Compute engines would be able to implement kernels that provide
> manipulations of JSON data similar to what you can do with jq or
> GraphQL.
>
> On Fri, Jul 29, 2022 at 1:43 PM Pradeep Gollakota
> <pgollak...@google.com.invalid> wrote:
> >
> > Hi Team!
> >
> > I filed ARROW-17255 to support the JSON logical type in Arrow. Initially
> > I'm only interested in C++ support that wraps a string. I imagine that as
> > Arrow and Parquet get more sophisticated, we might want to do more
> > interesting things (shredding?) with the JSON.
> >
> > David mentioned that there have been discussions around other "common"
> > extensions like UUID. Is this something that the community would be
> > interested in? My goal at the moment is to be able to export data from
> > BigQuery to Parquet with the correct LogicalType set in the exported
> files.
> >
> > Thanks!
> > Pradeep
>

Reply via email to