Antoine Pitrou created ARROW-3997:
-------------------------------------
Summary: [C++] [Doc] Clarify dictionary encoding integer
signedness (and width?)
Key: ARROW-3997
URL: https://issues.apache.org/jira/browse/ARROW-3997
Project: Apache Arrow
Issue Type: Improvement
Components: C++, Documentation, Format
Affects Versions: 0.11.1
Reporter: Antoine Pitrou
The Arrow spec states that a dictionary-encoded array uses int32 indices.
Signed or unsigned? The spec doesn't say.
Also, the C++ implementation supports all kinds of integers as indices (8- to
64-bit, signed and unsigned). I wonder if we should at least mandate a specific
signedness.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)