[ https://issues.apache.org/jira/browse/ARROW-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657640#comment-17657640 ]
Rok Mihevc commented on ARROW-608: ---------------------------------- This issue has been migrated to [issue #16238|https://github.com/apache/arrow/issues/16238] on GitHub. Please see the [migration documentation|https://github.com/apache/arrow/issues/14542] for further details. > [Format] Days since epoch date type > ----------------------------------- > > Key: ARROW-608 > URL: https://issues.apache.org/jira/browse/ARROW-608 > Project: Apache Arrow > Issue Type: New Feature > Components: Format > Reporter: Wes McKinney > Assignee: Wes McKinney > Priority: Major > Fix For: 0.3.0 > > > While we've decided to make the primary IPC date type be int64 milliseconds > since the UNIX epoch, in many libraries dates are represented as integer > (int32, usually) days since some epoch. In the Python standard library, the > epoch is the year 0: > {code} > >>> d = datetime.date(2017, 1, 17) > >>> d.toordinal() > 736346 > >>> d.toordinal() / 365 > 2017 > {code} > At least in Cplusplus-land, in working on ARROW-452 I ran into the problem of > how to do zero-copy reads of such data, while preserving the metadata to know > that the values are dates. I added a cpp-only "date32" type to support this > use case https://github.com/apache/arrow/pull/365 > I'm not sure whether we should add a new logical type, but thought it would > be worth bringing up in any case -- This message was sent by Atlassian Jira (v8.20.10#820010)