Antoine Pitrou created ARROW-3997: ------------------------------------- Summary: [C++] [Doc] Clarify dictionary encoding integer signedness (and width?) Key: ARROW-3997 URL: https://issues.apache.org/jira/browse/ARROW-3997 Project: Apache Arrow Issue Type: Improvement Components: C++, Documentation, Format Affects Versions: 0.11.1 Reporter: Antoine Pitrou
The Arrow spec states that a dictionary-encoded array uses int32 indices. Signed or unsigned? The spec doesn't say. Also, the C++ implementation supports all kinds of integers as indices (8- to 64-bit, signed and unsigned). I wonder if we should at least mandate a specific signedness. -- This message was sent by Atlassian JIRA (v7.6.3#76005)