[jira] [Updated] (ARROW-439) [Python] Add option in "to_pandas" conversions to yield Categorical from String/Binary arrays

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-439: - External issue URL: https://github.com/apache/arrow/issues/15339 > [Python] Add option in "to_pandas" conv

[jira] [Commented] (ARROW-438) [Python] Concatenate Table instances with equal schemas

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657472#comment-17657472 ] Rok Mihevc commented on ARROW-438: -- This issue has been migrated to [issue #16085|https:

[jira] [Updated] (ARROW-442) [Python] Add public Python API to inspect Parquet file metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-442: - External issue URL: https://github.com/apache/arrow/issues/16088 > [Python] Add public Python API to inspe

[jira] [Commented] (ARROW-431) [Python] Review GIL release and acquisition in to_pandas conversion

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657465#comment-17657465 ] Rok Mihevc commented on ARROW-431: -- This issue has been migrated to [issue #16078|https:

[jira] [Updated] (ARROW-444) [Python] Avoid unnecessary memory copies from use of PyBytes_* C APIs

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-444: - External issue URL: https://github.com/apache/arrow/issues/16090 > [Python] Avoid unnecessary memory copie

[jira] [Commented] (ARROW-359) Need to document ARROW_LIBHDFS_DIR

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657393#comment-17657393 ] Rok Mihevc commented on ARROW-359: -- This issue has been migrated to [issue #15901|https:

[jira] [Updated] (ARROW-448) [Python] Load HdfsClient default options from core-site.xml or hdfs-site.xml, if available

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-448: - External issue URL: https://github.com/apache/arrow/issues/16093 > [Python] Load HdfsClient default option

[jira] [Commented] (ARROW-362) Python: Calling to_pandas on a table read from Parquet leaks memory

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657396#comment-17657396 ] Rok Mihevc commented on ARROW-362: -- This issue has been migrated to [issue #15907|https:

[jira] [Updated] (ARROW-364) [Python] Multithreaded conversion between Arrow record batches as NumPy arrays (for pandas)

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-364: - External issue URL: https://github.com/apache/arrow/issues/15916 > [Python] Multithreaded conversion betwe

[jira] [Updated] (ARROW-366) [java] implement Dictionary vector

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-366: - External issue URL: https://github.com/apache/arrow/issues/15923 > [java] implement Dictionary vector > --

[jira] [Updated] (ARROW-361) Python: Support reading a column-selection from Parquet files

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-361: - External issue URL: https://github.com/apache/arrow/issues/15903 > Python: Support reading a column-select

[jira] [Commented] (ARROW-367) [java] converter csv/json <=> Arrow file format for Integration tests

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657401#comment-17657401 ] Rok Mihevc commented on ARROW-367: -- This issue has been migrated to [issue #15925|https:

[jira] [Commented] (ARROW-369) [Python] Add ability to convert multiple record batches at once to pandas

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657403#comment-17657403 ] Rok Mihevc commented on ARROW-369: -- This issue has been migrated to [issue #15932|https:

[jira] [Commented] (ARROW-372) Create JSON arrow file format for integration tests

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657406#comment-17657406 ] Rok Mihevc commented on ARROW-372: -- This issue has been migrated to [issue #15938|https:

[jira] [Commented] (ARROW-373) [C++] Implement C++ version of JSON file format for testing

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657407#comment-17657407 ] Rok Mihevc commented on ARROW-373: -- This issue has been migrated to [issue #15331|https:

[jira] [Updated] (ARROW-373) [C++] Implement C++ version of JSON file format for testing

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-373: - External issue URL: https://github.com/apache/arrow/issues/15331 > [C++] Implement C++ version of JSON fil

[jira] [Commented] (ARROW-375) columns parameter in parquet.read_table() raises KeyError for valid column

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657409#comment-17657409 ] Rok Mihevc commented on ARROW-375: -- This issue has been migrated to [issue #15943|https:

[jira] [Updated] (ARROW-374) Python: clarify unicode vs. binary in API

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-374: - External issue URL: https://github.com/apache/arrow/issues/15942 > Python: clarify unicode vs. binary in A

[jira] [Updated] (ARROW-378) Python: Respect timezone on conversion of Pandas datetime columns

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-378: - External issue URL: https://github.com/apache/arrow/issues/15951 > Python: Respect timezone on conversion

[jira] [Updated] (ARROW-380) [Java] optimize null count when serializing vectors.

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-380: - External issue URL: https://github.com/apache/arrow/issues/15332 > [Java] optimize null count when seriali

[jira] [Commented] (ARROW-383) [C++] Implement C++ version of ARROW-367 integration test validator

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657417#comment-17657417 ] Rok Mihevc commented on ARROW-383: -- This issue has been migrated to [issue #15982|https:

[jira] [Updated] (ARROW-382) Python: Extend API documentation

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-382: - External issue URL: https://github.com/apache/arrow/issues/15972 > Python: Extend API documentation >

[jira] [Commented] (ARROW-385) [Java] Refactor metrics system

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657419#comment-17657419 ] Rok Mihevc commented on ARROW-385: -- This issue has been migrated to [issue #15996|https:

[jira] [Commented] (ARROW-448) [Python] Load HdfsClient default options from core-site.xml or hdfs-site.xml, if available

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657482#comment-17657482 ] Rok Mihevc commented on ARROW-448: -- This issue has been migrated to [issue #16093|https:

[jira] [Commented] (ARROW-384) Align Java and C++ RecordBatch data and metadata layout

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657418#comment-17657418 ] Rok Mihevc commented on ARROW-384: -- This issue has been migrated to [issue #15990|https:

[jira] [Commented] (ARROW-450) Python: Fixes for PARQUET-818

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657484#comment-17657484 ] Rok Mihevc commented on ARROW-450: -- This issue has been migrated to [issue #16095|https:

[jira] [Updated] (ARROW-450) Python: Fixes for PARQUET-818

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-450: - External issue URL: https://github.com/apache/arrow/issues/16095 > Python: Fixes for PARQUET-818 > ---

[jira] [Commented] (ARROW-451) [C++] Override DataType::Equals for other types with additional metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657485#comment-17657485 ] Rok Mihevc commented on ARROW-451: -- This issue has been migrated to [issue #16096|https:

[jira] [Commented] (ARROW-386) [Java] Respect case of struct / map field names

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657420#comment-17657420 ] Rok Mihevc commented on ARROW-386: -- This issue has been migrated to [issue #16001|https:

[jira] [Updated] (ARROW-387) [C++] arrow::io::BufferReader does not permit shared memory ownership in zero-copy reads

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-387: - External issue URL: https://github.com/apache/arrow/issues/16005 > [C++] arrow::io::BufferReader does not

[jira] [Updated] (ARROW-389) Python: Write Parquet files to pyarrow.io.NativeFile objects

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-389: - External issue URL: https://github.com/apache/arrow/issues/16007 > Python: Write Parquet files to pyarrow.

[jira] [Commented] (ARROW-388) [C++] Add a "shifted file" abstraction

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657422#comment-17657422 ] Rok Mihevc commented on ARROW-388: -- This issue has been migrated to [issue #16006|https:

[jira] [Commented] (ARROW-462) [C++] Implement in-memory conversions between non-nested primitive types and DictionaryArray equivalent

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657496#comment-17657496 ] Rok Mihevc commented on ARROW-462: -- This issue has been migrated to [issue #16106|https:

[jira] [Updated] (ARROW-456) C++: Add jemalloc based MemoryPool

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-456: - External issue URL: https://github.com/apache/arrow/issues/16100 > C++: Add jemalloc based MemoryPool > --

[jira] [Commented] (ARROW-390) C++: CMake fails on json-integration-test with ARROW_BUILD_TESTS=OFF

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657424#comment-17657424 ] Rok Mihevc commented on ARROW-390: -- This issue has been migrated to [issue #16008|https:

[jira] [Commented] (ARROW-455) [C++] BufferOutputStream dtor does not call Close()

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657489#comment-17657489 ] Rok Mihevc commented on ARROW-455: -- This issue has been migrated to [issue #15341|https:

[jira] [Updated] (ARROW-455) [C++] BufferOutputStream dtor does not call Close()

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-455: - External issue URL: https://github.com/apache/arrow/issues/15341 > [C++] BufferOutputStream dtor does not

[jira] [Commented] (ARROW-454) pojo.Field doesn't implement hashCode()

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657488#comment-17657488 ] Rok Mihevc commented on ARROW-454: -- This issue has been migrated to [issue #16099|https:

[jira] [Updated] (ARROW-464) C++: More intelligent array growing

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-464: - External issue URL: https://github.com/apache/arrow/issues/16108 > C++: More intelligent array growing > -

[jira] [Updated] (ARROW-385) [Java] Refactor metrics system

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-385: - External issue URL: https://github.com/apache/arrow/issues/15996 > [Java] Refactor metrics system > --

[jira] [Commented] (ARROW-457) Python: Better control over memory pool

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657491#comment-17657491 ] Rok Mihevc commented on ARROW-457: -- This issue has been migrated to [issue #16101|https:

[jira] [Commented] (ARROW-466) C++: ExternalProject for jemalloc

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657500#comment-17657500 ] Rok Mihevc commented on ARROW-466: -- This issue has been migrated to [issue #16110|https:

[jira] [Commented] (ARROW-394) Add integration tests for boolean, list, struct, and other basic types

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657428#comment-17657428 ] Rok Mihevc commented on ARROW-394: -- This issue has been migrated to [issue #16021|https:

[jira] [Commented] (ARROW-467) [Python] Run parquet-cpp unit tests in Travis CI

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657501#comment-17657501 ] Rok Mihevc commented on ARROW-467: -- This issue has been migrated to [issue #16111|https:

[jira] [Commented] (ARROW-470) [Python] Add "FileSystem" abstraction to access directories of files in a uniform way

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657504#comment-17657504 ] Rok Mihevc commented on ARROW-470: -- This issue has been migrated to [issue #16114|https:

[jira] [Commented] (ARROW-468) Python: Conversion of nested data in pd.DataFrames to/from Arrow structures

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657502#comment-17657502 ] Rok Mihevc commented on ARROW-468: -- This issue has been migrated to [issue #16112|https:

[jira] [Commented] (ARROW-471) [Python] Enable ParquetFile to pass down separately-obtained file metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657505#comment-17657505 ] Rok Mihevc commented on ARROW-471: -- This issue has been migrated to [issue #16115|https:

[jira] [Updated] (ARROW-468) Python: Conversion of nested data in pd.DataFrames to/from Arrow structures

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-468: - External issue URL: https://github.com/apache/arrow/issues/16112 > Python: Conversion of nested data in pd

[jira] [Updated] (ARROW-469) C++: Add option so that resize doesn't decrease the capacity

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-469: - External issue URL: https://github.com/apache/arrow/issues/16113 > C++: Add option so that resize doesn't

[jira] [Commented] (ARROW-473) [C++/Python] Add public API for retrieving block locations for a particular HDFS file

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657507#comment-17657507 ] Rok Mihevc commented on ARROW-473: -- This issue has been migrated to [issue #16117|https:

[jira] [Updated] (ARROW-470) [Python] Add "FileSystem" abstraction to access directories of files in a uniform way

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-470: - External issue URL: https://github.com/apache/arrow/issues/16114 > [Python] Add "FileSystem" abstraction t

[jira] [Updated] (ARROW-463) C++: Support jemalloc 4.x

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-463: - External issue URL: https://github.com/apache/arrow/issues/16107 > C++: Support jemalloc 4.x > ---

[jira] [Commented] (ARROW-477) [Java] Add support for second/microsecond/nanosecond timestamps in-memory and in IPC/JSON layer

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657511#comment-17657511 ] Rok Mihevc commented on ARROW-477: -- This issue has been migrated to [issue #16121|https:

[jira] [Commented] (ARROW-399) [Java] ListVector.loadFieldBuffers ignores the ArrowFieldNode length metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657433#comment-17657433 ] Rok Mihevc commented on ARROW-399: -- This issue has been migrated to [issue #16027|https:

[jira] [Commented] (ARROW-398) [Java] Java file format requires bitmaps of all 1's to be written when there are no nulls

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657432#comment-17657432 ] Rok Mihevc commented on ARROW-398: -- This issue has been migrated to [issue #16026|https:

[jira] [Commented] (ARROW-476) [Integration] Add integration tests for Binary / Varbytes type

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657510#comment-17657510 ] Rok Mihevc commented on ARROW-476: -- This issue has been migrated to [issue #16120|https:

[jira] [Updated] (ARROW-478) [Python] Accept a PyBytes object in the pyarrow.io.BufferReader ctor

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-478: - External issue URL: https://github.com/apache/arrow/issues/16122 > [Python] Accept a PyBytes object in the

[jira] [Updated] (ARROW-399) [Java] ListVector.loadFieldBuffers ignores the ArrowFieldNode length metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-399: - External issue URL: https://github.com/apache/arrow/issues/16027 > [Java] ListVector.loadFieldBuffers igno

[jira] [Commented] (ARROW-475) [Python] High level support for reading directories of Parquet files (as a single Arrow table) from supported file system interfaces

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657509#comment-17657509 ] Rok Mihevc commented on ARROW-475: -- This issue has been migrated to [issue #16119|https:

[jira] [Commented] (ARROW-482) [Java] Provide API access to "custom_metadata" Field attribute in IPC setting

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657516#comment-17657516 ] Rok Mihevc commented on ARROW-482: -- This issue has been migrated to [issue #16126|https:

[jira] [Updated] (ARROW-482) [Java] Provide API access to "custom_metadata" Field attribute in IPC setting

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-482: - External issue URL: https://github.com/apache/arrow/issues/16126 > [Java] Provide API access to "custom_me

[jira] [Updated] (ARROW-479) Python: Test for expected schema in Pandas conversion

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-479: - External issue URL: https://github.com/apache/arrow/issues/16123 > Python: Test for expected schema in Pan

[jira] [Updated] (ARROW-404) [Python] Closing an HdfsClient while there are still open file handles results in a crash

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-404: - External issue URL: https://github.com/apache/arrow/issues/16052 > [Python] Closing an HdfsClient while th

[jira] [Updated] (ARROW-341) [Python] Making libpyarrow available to third parties

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-341: - External issue URL: https://github.com/apache/arrow/issues/15831 > [Python] Making libpyarrow available to

[jira] [Commented] (ARROW-409) Python: Change pyarrow.Table.dataframe_from_batches API to create Table instead

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657443#comment-17657443 ] Rok Mihevc commented on ARROW-409: -- This issue has been migrated to [issue #16058|https:

[jira] [Commented] (ARROW-338) [C++] Refactor IPC vector "loading" and "unloading" to be based on cleaner visitor pattern

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657372#comment-17657372 ] Rok Mihevc commented on ARROW-338: -- This issue has been migrated to [issue #15327|https:

[jira] [Commented] (ARROW-407) BitVector.copyFromSafe() should re-allocate if necessary instead of returning false

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657441#comment-17657441 ] Rok Mihevc commented on ARROW-407: -- This issue has been migrated to [issue #16056|https:

[jira] [Updated] (ARROW-333) Make writers update their internal schema even when no data is written.

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-333: - External issue URL: https://github.com/apache/arrow/issues/15805 > Make writers update their internal sche

[jira] [Commented] (ARROW-406) [C++] Large HDFS reads must utilize the set file buffer size when making RPCs

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657440#comment-17657440 ] Rok Mihevc commented on ARROW-406: -- This issue has been migrated to [issue #16055|https:

[jira] [Commented] (ARROW-343) [C++] Symbol visibility for PrimitiveBuilder subclasses is not uniform across platforms

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657377#comment-17657377 ] Rok Mihevc commented on ARROW-343: -- This issue has been migrated to [issue #15837|https:

[jira] [Commented] (ARROW-413) DATE type is not specified clearly

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657447#comment-17657447 ] Rok Mihevc commented on ARROW-413: -- This issue has been migrated to [issue #16062|https:

[jira] [Commented] (ARROW-334) [Python] OS X rpath issues on some configurations

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657368#comment-17657368 ] Rok Mihevc commented on ARROW-334: -- This issue has been migrated to [issue #15807|https:

[jira] [Updated] (ARROW-406) [C++] Large HDFS reads must utilize the set file buffer size when making RPCs

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-406: - External issue URL: https://github.com/apache/arrow/issues/16055 > [C++] Large HDFS reads must utilize the

[jira] [Updated] (ARROW-345) libhdfs integration doesn't work for Mac

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-345: - External issue URL: https://github.com/apache/arrow/issues/15840 > libhdfs integration doesn't work for Ma

[jira] [Updated] (ARROW-408) [C++/Python] Remove defunct conda recipes

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-408: - External issue URL: https://github.com/apache/arrow/issues/16057 > [C++/Python] Remove defunct conda recip

[jira] [Updated] (ARROW-344) Instructions for building with conda

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-344: - External issue URL: https://github.com/apache/arrow/issues/15839 > Instructions for building with conda >

[jira] [Commented] (ARROW-341) [Python] Making libpyarrow available to third parties

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657375#comment-17657375 ] Rok Mihevc commented on ARROW-341: -- This issue has been migrated to [issue #15831|https:

[jira] [Commented] (ARROW-342) Set Python version on release

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657376#comment-17657376 ] Rok Mihevc commented on ARROW-342: -- This issue has been migrated to [issue #15832|https:

[jira] [Commented] (ARROW-478) [Python] Accept a PyBytes object in the pyarrow.io.BufferReader ctor

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657512#comment-17657512 ] Rok Mihevc commented on ARROW-478: -- This issue has been migrated to [issue #16122|https:

[jira] [Commented] (ARROW-325) make TestArrowFile not dependent on timezone

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657359#comment-17657359 ] Rok Mihevc commented on ARROW-325: -- This issue has been migrated to [issue #15787|https:

[jira] [Commented] (ARROW-412) [Format] Handling of buffer padding in the IPC metadata

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657446#comment-17657446 ] Rok Mihevc commented on ARROW-412: -- This issue has been migrated to [issue #16061|https:

[jira] [Commented] (ARROW-361) Python: Support reading a column-selection from Parquet files

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657395#comment-17657395 ] Rok Mihevc commented on ARROW-361: -- This issue has been migrated to [issue #15903|https:

[jira] [Updated] (ARROW-411) [Java] Move Intergration.compare and Intergration.compareSchemas to a public utils class

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-411: - External issue URL: https://github.com/apache/arrow/issues/16060 > [Java] Move Intergration.compare and In

[jira] [Updated] (ARROW-354) Connot compare an array of empty strings to another

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-354: - External issue URL: https://github.com/apache/arrow/issues/15872 > Connot compare an array of empty string

[jira] [Updated] (ARROW-349) Six is missing as a requirement in the python setup.py

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-349: - External issue URL: https://github.com/apache/arrow/issues/15848 > Six is missing as a requirement in the

[jira] [Updated] (ARROW-329) [Java] ComplexWriterImpl.setValueCount() doesn't set the value count of its "root" container

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-329: - External issue URL: https://github.com/apache/arrow/issues/15793 > [Java] ComplexWriterImpl.setValueCount(

[jira] [Commented] (ARROW-368) Document use of LD_LIBRARY_PATH when using Python

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657402#comment-17657402 ] Rok Mihevc commented on ARROW-368: -- This issue has been migrated to [issue #15928|https:

[jira] [Updated] (ARROW-362) Python: Calling to_pandas on a table read from Parquet leaks memory

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-362: - External issue URL: https://github.com/apache/arrow/issues/15907 > Python: Calling to_pandas on a table re

[jira] [Commented] (ARROW-358) [C++] libhdfs can be in non-standard locations in some Hadoop distributions

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657392#comment-17657392 ] Rok Mihevc commented on ARROW-358: -- This issue has been migrated to [issue #15899|https:

[jira] [Updated] (ARROW-416) C++: Add Equals implementation to compare Columns

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-416: - External issue URL: https://github.com/apache/arrow/issues/16065 > C++: Add Equals implementation to compa

[jira] [Updated] (ARROW-368) Document use of LD_LIBRARY_PATH when using Python

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-368: - External issue URL: https://github.com/apache/arrow/issues/15928 > Document use of LD_LIBRARY_PATH when us

[jira] [Commented] (ARROW-405) [C++] Be less stringent about finding include/hdfs.h in HADOOP_HOME

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657439#comment-17657439 ] Rok Mihevc commented on ARROW-405: -- This issue has been migrated to [issue #16054|https:

[jira] [Updated] (ARROW-372) Create JSON arrow file format for integration tests

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-372: - External issue URL: https://github.com/apache/arrow/issues/15938 > Create JSON arrow file format for integ

[jira] [Commented] (ARROW-376) Python: Convert non-range Pandas indices (optionally) to Arrow

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657410#comment-17657410 ] Rok Mihevc commented on ARROW-376: -- This issue has been migrated to [issue #15949|https:

[jira] [Updated] (ARROW-322) [C++] Do not build HDFS IO interface optionally

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-322: - External issue URL: https://github.com/apache/arrow/issues/15778 > [C++] Do not build HDFS IO interface op

[jira] [Updated] (ARROW-413) DATE type is not specified clearly

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-413: - External issue URL: https://github.com/apache/arrow/issues/16062 > DATE type is not specified clearly > --

[jira] [Commented] (ARROW-379) Python: Use setuptools_scm/setuptools_scm_git_archive to provide the version number

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657413#comment-17657413 ] Rok Mihevc commented on ARROW-379: -- This issue has been migrated to [issue #15961|https:

[jira] [Updated] (ARROW-377) Python: Add support for conversion of Pandas.Categorical

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-377: - External issue URL: https://github.com/apache/arrow/issues/15950 > Python: Add support for conversion of P

[jira] [Updated] (ARROW-381) [C++] Simplify primitive array type builders to use a default type singleton

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-381: - External issue URL: https://github.com/apache/arrow/issues/15970 > [C++] Simplify primitive array type bui

[jira] [Commented] (ARROW-348) [Python] CMake build type should be configurable on the command line

2023-01-10 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17657382#comment-17657382 ] Rok Mihevc commented on ARROW-348: -- This issue has been migrated to [issue #15328|https:

<    2   3   4   5   6   7   8   9   10   11   >