[jira] [Commented] (ARROW-10957) Expanding pyarrow buffer size more than 2GB for pandas_udf functions

2020-12-29 Thread Dmitry Kravchuk (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17255871#comment-17255871 ] Dmitry Kravchuk commented on ARROW-10957: - [~fan_li_ya] Where should I search Ja

[jira] [Created] (ARROW-11061) [Rust] Validate array properties against schema

2020-12-29 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-11061: -- Summary: [Rust] Validate array properties against schema Key: ARROW-11061 URL: https://issues.apache.org/jira/browse/ARROW-11061 Project: Apache Arrow Is

[jira] [Commented] (ARROW-10925) [Rust] Validate temporal data that has restrictions

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17255893#comment-17255893 ] Neville Dipale commented on ARROW-10925: [~andygrove] [~jorgecarleitao] [~alamb]

[jira] [Created] (ARROW-11062) When writing to flight stream, Spark's mapPartitions is not working

2020-12-29 Thread Ravi Shankar (Jira)
Ravi Shankar created ARROW-11062: Summary: When writing to flight stream, Spark's mapPartitions is not working Key: ARROW-11062 URL: https://issues.apache.org/jira/browse/ARROW-11062 Project: Apache A

[jira] [Updated] (ARROW-11061) [Rust] Validate array properties against schema

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-11061: --- Component/s: Rust > [Rust] Validate array properties against schema > --

[jira] [Created] (ARROW-11063) [Rust] Validate null counts when building arrays

2020-12-29 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-11063: -- Summary: [Rust] Validate null counts when building arrays Key: ARROW-11063 URL: https://issues.apache.org/jira/browse/ARROW-11063 Project: Apache Arrow I

[jira] [Updated] (ARROW-11063) [Rust] Validate null counts when building arrays

2020-12-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11063: --- Labels: pull-request-available (was: ) > [Rust] Validate null counts when building arrays >

[jira] [Commented] (ARROW-11061) [Rust] Validate array properties against schema

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17255926#comment-17255926 ] Neville Dipale commented on ARROW-11061: [~andygrove] [~alamb] [~jorgecarleitao]

[jira] [Assigned] (ARROW-10095) [Rust] [Parquet] Update for IPC changes

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-10095: -- Assignee: Carol Nichols > [Rust] [Parquet] Update for IPC changes > -

[jira] [Assigned] (ARROW-9908) [Rust] Support temporal data types in JSON reader

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-9908: - Assignee: Christoph Schulze > [Rust] Support temporal data types in JSON reader > --

[jira] [Assigned] (ARROW-9934) [Rust] Shape and stride check in tensor

2020-12-29 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-9934: - Assignee: Fernando Herrera > [Rust] Shape and stride check in tensor > -

[jira] [Commented] (ARROW-10957) Expanding pyarrow buffer size more than 2GB for pandas_udf functions

2020-12-29 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17255968#comment-17255968 ] Liya Fan commented on ARROW-10957: -- Sure. Please find the test case here: https://gith

[jira] [Created] (ARROW-11064) [Rust][DataFusion] Speed up hash join on smaller batches

2020-12-29 Thread Jira
Daniël Heres created ARROW-11064: Summary: [Rust][DataFusion] Speed up hash join on smaller batches Key: ARROW-11064 URL: https://issues.apache.org/jira/browse/ARROW-11064 Project: Apache Arrow

[jira] [Updated] (ARROW-11064) [Rust][DataFusion] Speed up hash join on smaller batches

2020-12-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11064: --- Labels: pull-request-available (was: ) > [Rust][DataFusion] Speed up hash join on smaller b

[jira] [Comment Edited] (ARROW-11062) When writing to flight stream, Spark's mapPartitions is not working

2020-12-29 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256009#comment-17256009 ] David Li edited comment on ARROW-11062 at 12/29/20, 2:10 PM: -

[jira] [Commented] (ARROW-11062) When writing to flight stream, Spark's mapPartitions is not working

2020-12-29 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256009#comment-17256009 ] David Li commented on ARROW-11062: -- Hi Ravi, can you provide the output of {{mvn depend

[jira] [Created] (ARROW-11065) Installation of Apache Arrow C++ failed on AIX7.2

2020-12-29 Thread Xiaobo Zhang (Jira)
Xiaobo Zhang created ARROW-11065: Summary: Installation of Apache Arrow C++ failed on AIX7.2 Key: ARROW-11065 URL: https://issues.apache.org/jira/browse/ARROW-11065 Project: Apache Arrow Issu

[jira] [Created] (ARROW-11066) [Java] Is there a bug in flight AddWritableBuffer

2020-12-29 Thread Kangping Huang (Jira)
Kangping Huang created ARROW-11066: -- Summary: [Java] Is there a bug in flight AddWritableBuffer Key: ARROW-11066 URL: https://issues.apache.org/jira/browse/ARROW-11066 Project: Apache Arrow

[jira] [Resolved] (ARROW-11064) [Rust][DataFusion] Speed up hash join on smaller batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-11064. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 9042 [https://github.

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11065: Summary: [C++] Installation failed on AIX7.2 (was: Installation of Apache Arrow C++ faile

[jira] [Updated] (ARROW-11062) [Java] When writing to flight stream, Spark's mapPartitions is not working

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11062: Summary: [Java] When writing to flight stream, Spark's mapPartitions is not working (was:

[jira] [Created] (ARROW-11067) read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
John Sheffield created ARROW-11067: -- Summary: read_csv_arrow silently fails to read some strings and returns nulls Key: ARROW-11067 URL: https://issues.apache.org/jira/browse/ARROW-11067 Project: Apa

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11067: Summary: [R] read_csv_arrow silently fails to read some strings and returns nulls (was: r

[jira] [Resolved] (ARROW-10995) [Rust] [DataFusion] Improve parallelism when reading Parquet files

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-10995. Resolution: Fixed Issue resolved by pull request 9029 [https://github.com/apache/arrow/pull/9029]

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11067: Fix Version/s: 3.0.0 > [R] read_csv_arrow silently fails to read some strings and returns

[jira] [Commented] (ARROW-10578) [C++] Comparison kernels crashing for string array with null string scalar

2020-12-29 Thread Kirill Lykov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256068#comment-17256068 ] Kirill Lykov commented on ARROW-10578: -- Problem is still reproducible. It happens o

[jira] [Updated] (ARROW-11058) [Rust] [DataFusion] Implement "coalesce batches" operator

2020-12-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11058: --- Labels: pull-request-available (was: ) > [Rust] [DataFusion] Implement "coalesce batches" o

[jira] [Commented] (ARROW-11058) [Rust] [DataFusion] Implement "coalesce batches" operator

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256096#comment-17256096 ] Andy Grove commented on ARROW-11058: [~jorgecarleitao] I think that the PR [https:/

[jira] [Created] (ARROW-11068) [Rust] [DataFusion] Wrap HashJoinExec in CoalesceBatchExec

2020-12-29 Thread Andy Grove (Jira)
Andy Grove created ARROW-11068: -- Summary: [Rust] [DataFusion] Wrap HashJoinExec in CoalesceBatchExec Key: ARROW-11068 URL: https://issues.apache.org/jira/browse/ARROW-11068 Project: Apache Arrow

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] Poor join performance with smaller batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256105#comment-17256105 ] Andy Grove commented on ARROW-11030: I have a theory on what might be happening here

[jira] [Updated] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11030: --- Summary: [Rust] [DataFusion] MutableArrayData slow with many batches (was: [Rust] [DataFusion] Poor

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256109#comment-17256109 ] Daniël Heres commented on ARROW-11030: -- One comment I put in a PR, which is I think

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256115#comment-17256115 ] Daniël Heres commented on ARROW-11030: -- It's not directly related to the mutablearr

[jira] [Commented] (ARROW-11058) [Rust] [DataFusion] Implement "coalesce batches" operator

2020-12-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256132#comment-17256132 ] Jorge Leitão commented on ARROW-11058: -- Thank you so much for your explanation, [~a

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256138#comment-17256138 ] Daniël Heres commented on ARROW-11030: -- But the same is applicable for mutablearray

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256141#comment-17256141 ] Jorge Leitão commented on ARROW-11030: -- [~andygrove], the MutableArrayData is basic

[jira] [Created] (ARROW-11069) Parquet writer incorrect data being written when data type is dictionary

2020-12-29 Thread Palash Goel (Jira)
Palash Goel created ARROW-11069: --- Summary: Parquet writer incorrect data being written when data type is dictionary Key: ARROW-11069 URL: https://issues.apache.org/jira/browse/ARROW-11069 Project: Apach

[jira] [Updated] (ARROW-11069) Parquet writer incorrect data being written when data type is dictionary

2020-12-29 Thread Palash Goel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Palash Goel updated ARROW-11069: Description: When writing a dict column using pyarrow.    This incorrect results start appearing

[jira] [Updated] (ARROW-11069) Parquet writer incorrect data being written when data type is dictionary

2020-12-29 Thread Palash Goel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Palash Goel updated ARROW-11069: Attachment: image-2020-12-30-01-19-20-491.png image-2020-12-30-01-19-42-739.png

[jira] [Updated] (ARROW-11069) Parquet writer incorrect data being written when data type is dictionary

2020-12-29 Thread Palash Goel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Palash Goel updated ARROW-11069: Attachment: image-2020-12-30-01-20-45-183.png > Parquet writer incorrect data being written when d

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256149#comment-17256149 ] Neal Richardson commented on ARROW-11067: - Thanks for the detailed summary. Sinc

[jira] [Created] (ARROW-11070) [C++] [R] Implement exponentiation compute kernel

2020-12-29 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-11070: -- Summary: [C++] [R] Implement exponentiation compute kernel Key: ARROW-11070 URL: https://issues.apache.org/jira/browse/ARROW-11070 Project: Apache Arrow

[jira] [Updated] (ARROW-11068) [Rust] [DataFusion] Wrap more operators in CoalesceBatchExec

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11068: --- Summary: [Rust] [DataFusion] Wrap more operators in CoalesceBatchExec (was: [Rust] [DataFusion] Wra

[jira] [Updated] (ARROW-11068) [Rust] [DataFusion] Wrap more operators in CoalesceBatchExec

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11068: --- Description: Once [https://github.com/apache/arrow/pull/9043] is merged, we should extend this to w

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256172#comment-17256172 ] Andy Grove commented on ARROW-11030: Thanks [~jorgecarleitao] and [~Dandandan]  for

[jira] [Assigned] (ARROW-11030) [Rust] [DataFusion] MutableArrayData slow with many batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-11030: -- Assignee: (was: Andy Grove) > [Rust] [DataFusion] MutableArrayData slow with many batches

[jira] [Updated] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-29 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11030: --- Summary: [Rust] [DataFusion] HashJoinExec slow with many batches (was: [Rust] [DataFusion] MutableA

[jira] [Created] (ARROW-11071) [R][CI] Use processx to set up minio and flight servers in tests

2020-12-29 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11071: --- Summary: [R][CI] Use processx to set up minio and flight servers in tests Key: ARROW-11071 URL: https://issues.apache.org/jira/browse/ARROW-11071 Project: Apach

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11065: - Fix Version/s: (was: 2.0.0) > [C++] Installation failed on AIX7.2 >

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11065: - Flags: (was: Important) > [C++] Installation failed on AIX7.2 > --

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11065: - Issue Type: New Feature (was: Bug) > [C++] Installation failed on AIX7.2 >

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11065: - Labels: (was: build) > [C++] Installation failed on AIX7.2 > -

[jira] [Updated] (ARROW-11065) [C++] Installation failed on AIX7.2

2020-12-29 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11065: - Description: My installation of pyarrow on AIX7.2 failed due to missing ARROW and I was told I

[jira] [Assigned] (ARROW-11063) [Rust] Validate null counts when building arrays

2020-12-29 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb reassigned ARROW-11063: --- Assignee: Neville Dipale > [Rust] Validate null counts when building arrays > -

[jira] [Updated] (ARROW-11063) [Rust] Validate null counts when building arrays

2020-12-29 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11063: Component/s: Rust > [Rust] Validate null counts when building arrays > ---

[jira] [Resolved] (ARROW-11063) [Rust] Validate null counts when building arrays

2020-12-29 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-11063. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 9041 [https://githu

[jira] [Commented] (ARROW-9147) [C++][Dataset] Support null -> other type promotion in Dataset scanning

2020-12-29 Thread Gabriel Bassett (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256193#comment-17256193 ] Gabriel Bassett commented on ARROW-9147: I received the following error with arro

[jira] [Assigned] (ARROW-6582) [R] Arrow to R fails with embedded nuls in strings

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6582: -- Assignee: Neal Richardson > [R] Arrow to R fails with embedded nuls in strings > -

[jira] [Updated] (ARROW-7288) [C++][R] read_parquet() freezes on Windows with Japanese locale

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-7288: --- Fix Version/s: (was: 3.0.0) 4.0.0 > [C++][R] read_parquet() freezes on

[jira] [Assigned] (ARROW-9856) [R] Add bindings for string compute functions

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9856: -- Assignee: Jonathan Keane > [R] Add bindings for string compute functions > ---

[jira] [Assigned] (ARROW-9187) [R] Add bindings for arithmetic kernels

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9187: -- Assignee: Jonathan Keane (was: Neal Richardson) > [R] Add bindings for arithmetic ker

[jira] [Assigned] (ARROW-9187) [R] Add bindings for arithmetic kernels

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9187: -- Assignee: Neal Richardson (was: Jonathan Keane) > [R] Add bindings for arithmetic ker

[jira] [Updated] (ARROW-8470) [Python][R] Expose incremental write API for Feather files

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8470: --- Fix Version/s: (was: 3.0.0) 4.0.0 > [Python][R] Expose incremental wri

[jira] [Commented] (ARROW-7288) [C++][R] read_parquet() freezes on Windows with Japanese locale

2020-12-29 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256202#comment-17256202 ] Antoine Pitrou commented on ARROW-7288: --- [~jonkeane] Is this something you could tr

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Attachment: arrowbug1.png > [R] read_csv_arrow silently fails to read some strings and retur

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Attachment: arrow_failure_cases.csv > [R] read_csv_arrow silently fails to read some strings

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256206#comment-17256206 ] John Sheffield commented on ARROW-11067: I pulled a few strings over a much larg

[jira] [Comment Edited] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256206#comment-17256206 ] John Sheffield edited comment on ARROW-11067 at 12/29/20, 11:38 PM: --

[jira] [Issue Comment Deleted] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Comment: was deleted (was: I pulled a few strings over a much larger dataset and came to som

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Attachment: arrow_failure_cases.csv > [R] read_csv_arrow silently fails to read some strings

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256207#comment-17256207 ] John Sheffield commented on ARROW-11067: I pulled a few strings over a much larg

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Attachment: arrowbug1.png > [R] read_csv_arrow silently fails to read some strings and retur

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256208#comment-17256208 ] Neal Richardson commented on ARROW-11067: - That's really helpful, thanks for sha

[jira] [Commented] (ARROW-9019) [Python] hdfs fails to connect to for HDFS 3.x cluster

2020-12-29 Thread Bradley Miro (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256212#comment-17256212 ] Bradley Miro commented on ARROW-9019: - Hello! I'm on the GCP Dataproc team and was wo

[jira] [Updated] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Sheffield updated ARROW-11067: --- Attachment: arrow_explanation.png > [R] read_csv_arrow silently fails to read some strings a

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256218#comment-17256218 ] John Sheffield commented on ARROW-11067: (Sorry for the fragmented report here,

[jira] [Comment Edited] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256218#comment-17256218 ] John Sheffield edited comment on ARROW-11067 at 12/30/20, 1:35 AM: ---

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-29 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17256237#comment-17256237 ] Weston Pace commented on ARROW-11067: - I'll look into it a bit more tomorrow but at