[jira] [Updated] (ARROW-12688) [R] Use DuckDB to query an Arrow Dataset

2022-07-12 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12688: Component/s: (was: C++) > [R] Use DuckDB to query an Arrow Dataset > -

[jira] [Resolved] (ARROW-16776) [R] dplyr::glimpse method for arrow table and datasets

2022-07-12 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-16776. - Resolution: Fixed Issue resolved by pull request 13563 [https://github.com/apache/arrow/

[jira] [Updated] (ARROW-17062) [C#] write_feather() in R doesn't interop with ArrowFileReader.ReadNextRecordBatch()

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17062: Summary: [C#] write_feather() in R doesn't interop with ArrowFileReader.ReadNextRecordBatc

[jira] [Updated] (ARROW-17062) [C#] Support compression in IPC format

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17062: Summary: [C#] Support compression in IPC format (was: [C#] write_feather() in R doesn't i

[jira] [Commented] (ARROW-17062) [C#] Support compression in IPC format

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566301#comment-17566301 ] Neal Richardson commented on ARROW-17062: - It looks like the C# implementation d

[jira] [Updated] (ARROW-15938) [R][C++] Segfault in left join with empty right table when filtered on partition

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-15938: Component/s: (was: Compute IR) > [R][C++] Segfault in left join with empty right table

[jira] [Commented] (ARROW-15938) [R][C++] Segfault in left join with empty right table when filtered on partition

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566322#comment-17566322 ] Neal Richardson commented on ARROW-15938: - Confirmed that this is still an issue

[jira] [Updated] (ARROW-15938) [R][C++] Segfault in left join with empty right table when filtered on partition

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-15938: Labels: query-engine (was: ) > [R][C++] Segfault in left join with empty right table when

[jira] [Updated] (ARROW-15938) [R][C++] Segfault in left join with empty right table when filtered on partition

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-15938: Fix Version/s: 9.0.0 > [R][C++] Segfault in left join with empty right table when filtered

[jira] [Commented] (ARROW-16575) [R] arrow::write_dataset() does nothing with 0 row dataframes in R

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566324#comment-17566324 ] Neal Richardson commented on ARROW-16575: - This matches my expectations. write_d

[jira] [Commented] (ARROW-16863) [R] open_dataset() silently drops the missing values from a csv file

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566426#comment-17566426 ] Neal Richardson commented on ARROW-16863: - I think this is only an issue because

[jira] [Closed] (ARROW-16863) [R] open_dataset() silently drops the missing values from a csv file

2022-07-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-16863. --- Assignee: Neal Richardson Resolution: Not A Problem > [R] open_dataset() silently drop

[jira] [Commented] (ARROW-17072) [R] Rename *_feather functions

2022-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566862#comment-17566862 ] Neal Richardson commented on ARROW-17072: - Perhaps related: ARROW-8324 > [R] Re

[jira] [Updated] (ARROW-11749) [C++][Dataset] Support projections between children of UnionDatasets

2022-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11749: Fix Version/s: (was: 9.0.0) > [C++][Dataset] Support projections between children of U

[jira] [Resolved] (ARROW-16977) [R] Update dataset row counting so no integer overflow on large datasets

2022-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-16977. - Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13514 [http

[jira] [Commented] (ARROW-17085) [R] group_vars() returns NULL

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17567229#comment-17567229 ] Neal Richardson commented on ARROW-17085: - https://github.com/apache/arrow/blob/

[jira] [Commented] (ARROW-17085) [R] group_vars() returns NULL

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17567232#comment-17567232 ] Neal Richardson commented on ARROW-17085: - You may also want to change groups()

[jira] [Assigned] (ARROW-14330) [C++] Create DataHolder that can be used for caching during exec plans

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14330: --- Assignee: (was: Alexander Ocsa) > [C++] Create DataHolder that can be used for

[jira] [Updated] (ARROW-14445) [C++] Implement memory management for DataHolder

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14445: Fix Version/s: (was: 9.0.0) > [C++] Implement memory management for DataHolder > -

[jira] [Updated] (ARROW-14330) [C++] Create DataHolder that can be used for caching during exec plans

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14330: Fix Version/s: (was: 9.0.0) > [C++] Create DataHolder that can be used for caching dur

[jira] [Assigned] (ARROW-14445) [C++] Implement memory management for DataHolder

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14445: --- Assignee: (was: Alexander Ocsa) > [C++] Implement memory management for DataHol

[jira] [Updated] (ARROW-14445) [C++] Implement memory management for DataHolder

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14445: Labels: query-engine (was: pull-request-available query-engine) > [C++] Implement memory

[jira] [Assigned] (ARROW-14134) [C++][Compute] Standardize generator dispatchers

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14134: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Pradeep Garigipati)

[jira] [Assigned] (ARROW-15277) [Python] Use Make to create ChunkedArray and remove checks

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-15277: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Eduardo Ponce)

[jira] [Assigned] (ARROW-14443) [C++] Implement Plan Fragments support for ExecPlan.

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14443: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Percy Camilo Triveño Au

[jira] [Assigned] (ARROW-12723) [C++][Compute] GroupBy: add unittests for individual components of hash group by

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-12723: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Michal Nowakiewicz)

[jira] [Assigned] (ARROW-14332) [C++] Rename type traits utilities to improve semantic consistency

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14332: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Eduardo Ponce)

[jira] [Assigned] (ARROW-16337) [Python] Expose parameter that determines to store Arrow schema in Parquet metadata in Python

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-16337: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Joris Van den Bossche)

[jira] [Assigned] (ARROW-14725) [C++][Compute] Extract Expression simplification passes to an extensible registry

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14725: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Ben Kietzman)

[jira] [Assigned] (ARROW-15329) [Python] Add character limit to ChunkedArray repr

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-15329: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Will Jones)

[jira] [Assigned] (ARROW-12535) [C++] Enable metadata writing in the ORCWriter

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-12535: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Ian Alexander Joiner)

[jira] [Assigned] (ARROW-16178) [C++] Add a ThreadLocalState concept built on thread local

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-16178: --- Assignee: (was: Weston Pace) Labels: pull-request-available stop (was: pu

[jira] [Assigned] (ARROW-14477) [C++] Timezone-aware kernels should also handle offset strings

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14477: --- Assignee: (was: Rok Mihevc) Labels: kernel pull-request-available stop (w

[jira] [Updated] (ARROW-14895) [C++] Vcpkg install error for abseil on windows when building Arrow C++

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14895: Labels: pull-request-available stop (was: pull-request-available) This issue has been ina

[jira] [Assigned] (ARROW-15612) [C++] Migrate Flight APIs to Result<>

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-15612: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Tobias Zagorni)

[jira] [Assigned] (ARROW-11197) [C++] Add support for the dictionary type in the C++ ORC writer

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11197: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Ian Alexander Joiner)

[jira] [Assigned] (ARROW-13593) [C++][Dataset][Parquet] Support parquet modular encryption in the new Dataset API

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-13593: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Maya Anderson)

[jira] [Updated] (ARROW-13745) [CI][C++] conda python turbodbc nightly job failed

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13745: Labels: stop (was: ) This issue has been inactive for 3 months, so it has been unassigned

[jira] [Assigned] (ARROW-11296) [C++][Python] Add ReaderOptions for ORC

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11296: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Ian Alexander Joiner)

[jira] [Assigned] (ARROW-14923) [Java] AllocationListener should be called during ownership transferring

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14923: --- Fix Version/s: (was: 9.0.0) Assignee: (was: Hongze Zhang)

[jira] [Assigned] (ARROW-14906) [C++] Enable CSV Writer to control the type of escape used for quoting

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-14906: --- Assignee: (was: Ákos Hadnagy) Labels: good-first-issue pull-request-availa

[jira] [Updated] (ARROW-16379) [C++][Python] Change Memory Mapping to be off by default

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16379: Fix Version/s: (was: 9.0.0) > [C++][Python] Change Memory Mapping to be off by default

[jira] [Updated] (ARROW-11402) [C++][Dataset] Allow more aggresive implicit casts for literals

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11402: Fix Version/s: (was: 9.0.0) > [C++][Dataset] Allow more aggresive implicit casts for l

[jira] [Updated] (ARROW-2659) [Python] More graceful reading of empty String columns in ParquetDataset

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-2659: --- Fix Version/s: (was: 9.0.0) > [Python] More graceful reading of empty String columns in P

[jira] [Updated] (ARROW-13706) [C++][Compute] Add Find method to Grouper

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13706: Fix Version/s: (was: 9.0.0) > [C++][Compute] Add Find method to Grouper >

[jira] [Updated] (ARROW-12175) [C++] CMake's find_package(Parquet) does not find Parquet with Arrow 3.0.0

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12175: Fix Version/s: (was: 9.0.0) > [C++] CMake's find_package(Parquet) does not find Parque

[jira] [Updated] (ARROW-9171) [C++] Comments in FindArrow.cmake misleading

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9171: --- Fix Version/s: (was: 9.0.0) > [C++] Comments in FindArrow.cmake misleading >

[jira] [Updated] (ARROW-11118) [C++] Add union support in ORC reader & writer

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8: Fix Version/s: (was: 9.0.0) > [C++] Add union support in ORC reader & writer > ---

[jira] [Updated] (ARROW-12632) [C++][Dataset][Compute] Add support for dictionary_encode to Expression

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12632: Fix Version/s: (was: 9.0.0) > [C++][Dataset][Compute] Add support for dictionary_encod

[jira] [Updated] (ARROW-10142) [C++] RecordBatchStreamReader should use StreamDecoder

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10142: Fix Version/s: (was: 9.0.0) > [C++] RecordBatchStreamReader should use StreamDecoder >

[jira] [Updated] (ARROW-11465) [C++] Parquet file writer snapshot API and proper ColumnChunk.file_path utilization

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11465: Fix Version/s: (was: 9.0.0) > [C++] Parquet file writer snapshot API and proper Column

[jira] [Updated] (ARROW-11762) [C++][Dataset] Refactor Partitioning to explicitly treat null and absent fields identically

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11762: Fix Version/s: (was: 9.0.0) > [C++][Dataset] Refactor Partitioning to explicitly treat

[jira] [Updated] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12358: Fix Version/s: (was: 9.0.0) > [C++][Python][R][Dataset] Control overwriting vs appendi

[jira] [Updated] (ARROW-13773) [C++] Provide a cross platform helper for definition of library init code

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13773: Fix Version/s: (was: 9.0.0) > [C++] Provide a cross platform helper for definition of

[jira] [Updated] (ARROW-12341) [C++] Get rid of Result>

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12341: Fix Version/s: (was: 9.0.0) > [C++] Get rid of Result> > -

[jira] [Updated] (ARROW-14233) [C++] Improve ExecPlan::ToString

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14233: Fix Version/s: (was: 9.0.0) > [C++] Improve ExecPlan::ToString > -

[jira] [Updated] (ARROW-11647) [C++][Compute] CastFromNull does not use preallocated buffers

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11647: Fix Version/s: (was: 9.0.0) > [C++][Compute] CastFromNull does not use preallocated bu

[jira] [Updated] (ARROW-11378) [C++][Dataset] Writing partitions with timestamp type give mis-formatted (integers) directory names

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11378: Fix Version/s: (was: 9.0.0) > [C++][Dataset] Writing partitions with timestamp type gi

[jira] [Updated] (ARROW-15938) [R][C++] Segfault in left join with empty right table when filtered on partition

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-15938: Priority: Critical (was: Major) > [R][C++] Segfault in left join with empty right table w

[jira] [Updated] (ARROW-17040) [Go] Add a new StructBuilder constructor to support some specific use cases

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17040: Fix Version/s: (was: 9.0.0) > [Go] Add a new StructBuilder constructor to support some

[jira] [Updated] (ARROW-17086) [C++] Install java/dataset include file and fix debug build failed by compiler flag

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17086: Fix Version/s: (was: 9.0.0) > [C++] Install java/dataset include file and fix debug bu

[jira] [Updated] (ARROW-16728) [Python] Switch default and deprecate use_legacy_dataset=True in ParquetDataset

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16728: Fix Version/s: (was: 9.0.0) > [Python] Switch default and deprecate use_legacy_dataset

[jira] [Updated] (ARROW-16432) [Docs] Update verify RC instructions - JDK8

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16432: Fix Version/s: (was: 9.0.0) > [Docs] Update verify RC instructions - JDK8 > --

[jira] [Updated] (ARROW-16661) [Docs] Move verify Release candidate documentation to development guide

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16661: Fix Version/s: (was: 9.0.0) > [Docs] Move verify Release candidate documentation to de

[jira] [Updated] (ARROW-12755) [C++][Compute] Add quotient and modulo kernels

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12755: Fix Version/s: (was: 9.0.0) > [C++][Compute] Add quotient and modulo kernels > ---

[jira] [Updated] (ARROW-16754) [Java] StructVector's child vectors get unexpectedly reordered after adding vectors with duplicated fields

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16754: Fix Version/s: (was: 9.0.0) > [Java] StructVector's child vectors get unexpectedly reo

[jira] [Updated] (ARROW-8221) [Python][Dataset] Expose schema inference / validation options in the factory

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8221: --- Fix Version/s: (was: 9.0.0) > [Python][Dataset] Expose schema inference / validation opti

[jira] [Updated] (ARROW-11776) [Java][Dataset] Support writing to files within dataset scanner via JNI

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11776: Fix Version/s: (was: 9.0.0) > [Java][Dataset] Support writing to files within dataset

[jira] [Updated] (ARROW-16409) [C++][Python][R] Deprecate "scanner" (but keep "scan node") from public API

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16409: Fix Version/s: (was: 9.0.0) > [C++][Python][R] Deprecate "scanner" (but keep "scan nod

[jira] [Updated] (ARROW-14034) [Java] Unexpected Allocator states created after allocating buffer whose AllocationManager has different size from the requested size

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14034: Fix Version/s: (was: 9.0.0) > [Java] Unexpected Allocator states created after allocat

[jira] [Updated] (ARROW-11502) [C++] Optimize Arrow ByteStreamSplitDecode with Neon

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11502: Fix Version/s: (was: 9.0.0) > [C++] Optimize Arrow ByteStreamSplitDecode with Neon > -

[jira] [Updated] (ARROW-16707) [C++] Implement Rank kernel on chunked arrays

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16707: Fix Version/s: (was: 9.0.0) > [C++] Implement Rank kernel on chunked arrays >

[jira] [Updated] (ARROW-14656) [Python] Add sort_by helper method to StructArray

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14656: Fix Version/s: (was: 9.0.0) > [Python] Add sort_by helper method to StructArray >

[jira] [Updated] (ARROW-15251) [C++] Temporal floor/ceil/round handle ambiguous/nonexistent local time

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-15251: Fix Version/s: (was: 9.0.0) > [C++] Temporal floor/ceil/round handle ambiguous/nonexis

[jira] [Updated] (ARROW-11441) [R] Read CSV from character vector

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11441: Fix Version/s: (was: 9.0.0) > [R] Read CSV from character vector > ---

[jira] [Updated] (ARROW-16673) [Java] C data interface: Allow ownership transferring for imported buffer

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16673: Fix Version/s: (was: 9.0.0) > [Java] C data interface: Allow ownership transferring fo

[jira] [Updated] (ARROW-16430) [Python] Read/Write record batch custom metadata API in pyarrow

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16430: Fix Version/s: (was: 9.0.0) > [Python] Read/Write record batch custom metadata API in

[jira] [Updated] (ARROW-16674) [Java] C data interface: Reading as nioBuffer from imported buffer causes error

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16674: Fix Version/s: (was: 9.0.0) > [Java] C data interface: Reading as nioBuffer from impor

[jira] [Updated] (ARROW-12084) [C++][Compute] Add remainder and quotient compute::Function

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12084: Fix Version/s: (was: 9.0.0) > [C++][Compute] Add remainder and quotient compute::Funct

[jira] [Updated] (ARROW-16865) [C++] Implement cumulative product, max, and min compute functions

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16865: Fix Version/s: (was: 9.0.0) > [C++] Implement cumulative product, max, and min compute

[jira] [Updated] (ARROW-16852) [C++] Migrate SCALAR_AGGREGATE, HASH_AGGREGATE functions to use ExecSpan

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16852: Fix Version/s: (was: 9.0.0) > [C++] Migrate SCALAR_AGGREGATE, HASH_AGGREGATE functions

[jira] [Updated] (ARROW-9285) [C++] Detect unauthorized memory allocations in function kernels

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9285: --- Fix Version/s: (was: 9.0.0) > [C++] Detect unauthorized memory allocations in function ke

[jira] [Resolved] (ARROW-17085) [R] group_vars() should not return NULL

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-17085. - Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13621 [http

[jira] [Updated] (ARROW-16106) [R] Support for filename-based partitioning

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16106: Fix Version/s: 10.0.0 > [R] Support for filename-based partitioning >

[jira] [Resolved] (ARROW-14575) [R] Allow functions with {{pkg::}} prefixes

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-14575. - Resolution: Fixed Issue resolved by pull request 13160 [https://github.com/apache/arrow/

[jira] [Assigned] (ARROW-16612) [R] parquet files with compression extensions should use parquet writer for compression

2022-07-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-16612: --- Assignee: Neal Richardson > [R] parquet files with compression extensions should us

[jira] [Commented] (ARROW-17110) [C++] Move away from C++11

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568100#comment-17568100 ] Neal Richardson commented on ARROW-17110: - Slight correction: GCC 4.8 is not an

[jira] [Comment Edited] (ARROW-17110) [C++] Move away from C++11

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568100#comment-17568100 ] Neal Richardson edited comment on ARROW-17110 at 7/18/22 4:41 PM:

[jira] [Updated] (ARROW-12693) [R] add unique() methods for ArrowTabluar, datasets

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12693: Summary: [R] add unique() methods for ArrowTabluar, datasets (was: [R] Usage of compute f

[jira] [Commented] (ARROW-12693) [R] add unique() methods for ArrowTabluar, datasets

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568134#comment-17568134 ] Neal Richardson commented on ARROW-12693: - I think we can use the same function

[jira] [Commented] (ARROW-12693) [R] add unique() methods for ArrowTabluar, datasets

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568169#comment-17568169 ] Neal Richardson commented on ARROW-12693: - > Because this isn't a dplyr function

[jira] [Resolved] (ARROW-17102) [R] Test fails on R minimal nightly builds due to Parquet writing

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-17102. - Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13631 [http

[jira] [Resolved] (ARROW-8324) [R] Add read/write_ipc_file separate from _feather

2022-07-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-8324. Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13626 [https:/

[jira] [Updated] (ARROW-17120) [C++][R] copy_files() does not take paths to specific files

2022-07-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17120: Summary: [C++][R] copy_files() does not take paths to specific files (was: copy_files() d

[jira] [Closed] (ARROW-12213) [R] copy_files doesn't make it easy to copy a single file

2022-07-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-12213. --- Resolution: Duplicate > [R] copy_files doesn't make it easy to copy a single file >

[jira] [Updated] (ARROW-17120) copy_files() does not take paths to specific files

2022-07-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-17120: Component/s: C++ > copy_files() does not take paths to specific files > --

[jira] [Updated] (ARROW-16612) [R] Support inferring compression from filename for all readers/writers

2022-07-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-16612: Summary: [R] Support inferring compression from filename for all readers/writers (was: [R

[jira] [Commented] (ARROW-17132) [R] Mutate in compare_dplyr_binding returns wrong type

2022-07-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17569011#comment-17569011 ] Neal Richardson commented on ARROW-17132: - The assertion is failing on "time", y

[jira] [Commented] (ARROW-17143) [R] Add examples working with `tidyr::unnest`and `tidyr::unnest_longer`

2022-07-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17569015#comment-17569015 ] Neal Richardson commented on ARROW-17143: - These tidyr functions aren't working

[jira] [Commented] (ARROW-17132) [R] Mutate in compare_dplyr_binding returns wrong type

2022-07-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17569030#comment-17569030 ] Neal Richardson commented on ARROW-17132: - Right, transmute drops the input colu

  1   2   3   4   5   6   7   8   9   10   >