[jira] [Created] (ARROW-4100) [Gandiva][C++] Fix regex to ignore.

2018-12-20 Thread Praveen Kumar Desabandu (JIRA)
Praveen Kumar Desabandu created ARROW-4100: -- Summary: [Gandiva][C++] Fix regex to ignore. Key: ARROW-4100 URL: https://issues.apache.org/jira/browse/ARROW-4100 Project: Apache Arrow

[jira] [Created] (ARROW-4099) [Python] Pretty printing very large ChunkedArray objects can use unbounded memory

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4099: --- Summary: [Python] Pretty printing very large ChunkedArray objects can use unbounded memory Key: ARROW-4099 URL: https://issues.apache.org/jira/browse/ARROW-4099 Project

[jira] [Created] (ARROW-4098) [Python] Deprecate pyarrow.open_stream,open_file in favor of pa.ipc.open_stream/open_file

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4098: --- Summary: [Python] Deprecate pyarrow.open_stream,open_file in favor of pa.ipc.open_stream/open_file Key: ARROW-4098 URL: https://issues.apache.org/jira/browse/ARROW-4098

[jira] [Created] (ARROW-4096) [C++] Preserve "ordered" metadata in some special cases in dictionary unification

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4096: --- Summary: [C++] Preserve "ordered" metadata in some special cases in dictionary unification Key: ARROW-4096 URL: https://issues.apache.org/jira/browse/ARROW-4096 Project

[jira] [Created] (ARROW-4097) [C++] Add function to "conform" a dictionary array to a target new dictionary

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4097: --- Summary: [C++] Add function to "conform" a dictionary array to a target new dictionary Key: ARROW-4097 URL: https://issues.apache.org/jira/browse/ARROW-4097 Project: Ap

[jira] [Created] (ARROW-4095) [C++] Implement optimizations for dictionary unification where dictionaries are prefixes of the unified dictionary

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4095: --- Summary: [C++] Implement optimizations for dictionary unification where dictionaries are prefixes of the unified dictionary Key: ARROW-4095 URL: https://issues.apache.org/jira/brows

[jira] [Created] (ARROW-4094) [Python] Store RangeIndex in Parquet files as metadata rather than a physical data column

2018-12-20 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4094: --- Summary: [Python] Store RangeIndex in Parquet files as metadata rather than a physical data column Key: ARROW-4094 URL: https://issues.apache.org/jira/browse/ARROW-4094

[jira] [Created] (ARROW-4093) [C++] Deprecated method suggests wrong method

2018-12-20 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-4093: --- Summary: [C++] Deprecated method suggests wrong method Key: ARROW-4093 URL: https://issues.apache.org/jira/browse/ARROW-4093 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-4092) Implement common Reader / DataSource trait for CSV and Parquet

2018-12-20 Thread Andy Grove (JIRA)
Andy Grove created ARROW-4092: - Summary: Implement common Reader / DataSource trait for CSV and Parquet Key: ARROW-4092 URL: https://issues.apache.org/jira/browse/ARROW-4092 Project: Apache Arrow

[jira] [Created] (ARROW-4091) [C++] Curate default list of CSV null spellings

2018-12-20 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-4091: - Summary: [C++] Curate default list of CSV null spellings Key: ARROW-4091 URL: https://issues.apache.org/jira/browse/ARROW-4091 Project: Apache Arrow Issue

[jira] [Created] (ARROW-4090) Table.flatten() doesn't work recursively

2018-12-20 Thread Francisco Sanchez (JIRA)
Francisco Sanchez created ARROW-4090: Summary: Table.flatten() doesn't work recursively Key: ARROW-4090 URL: https://issues.apache.org/jira/browse/ARROW-4090 Project: Apache Arrow Issue T

Re: pyarrow and DictionaryBatches

2018-12-20 Thread Wes McKinney
hi Naveen -- there is not an API to generate them separately right now; they are written as part of serializing the schema. I don't see an issue with adding APIs to create them as buffers separate from the schema message. Feel free to open a JIRA and/or submit a PR - Wes On Thu, Dec 20, 2018 at 9

pyarrow and DictionaryBatches

2018-12-20 Thread Naveen Michaud-Agrawal
Hello, Is there any way to created DictionaryBatches from pyarrow? From the docs it looks like they are just regular RecordBatches with some extra metadata, but I don't see an API to create them directly. Thanks, Naveen

[jira] [Created] (ARROW-4089) [Plasma] The tutorial is wrong regarding the parameter type of PlasmaClient.Create

2018-12-20 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created ARROW-4089: - Summary: [Plasma] The tutorial is wrong regarding the parameter type of PlasmaClient.Create Key: ARROW-4089 URL: https://issues.apache.org/jira/browse/ARROW-4089 P

[jira] [Created] (ARROW-4088) Table.from_batches() fails when passed a schema with metadata

2018-12-20 Thread Thomas Buhrmann (JIRA)
Thomas Buhrmann created ARROW-4088: -- Summary: Table.from_batches() fails when passed a schema with metadata Key: ARROW-4088 URL: https://issues.apache.org/jira/browse/ARROW-4088 Project: Apache Arrow

[jira] [Created] (ARROW-4087) [C++] Make CSV nulls configurable

2018-12-20 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-4087: - Summary: [C++] Make CSV nulls configurable Key: ARROW-4087 URL: https://issues.apache.org/jira/browse/ARROW-4087 Project: Apache Arrow Issue Type: Improvem

[jira] [Created] (ARROW-4086) [Java] Add api to fetch summary of root allocator

2018-12-20 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-4086: - Summary: [Java] Add api to fetch summary of root allocator Key: ARROW-4086 URL: https://issues.apache.org/jira/browse/ARROW-4086 Project: Apache Arrow