[jira] [Updated] (ARROW-13542) [C++][Compute][Dataset] Add dataset::WriteNode for writing rows from an ExecPlan to disk

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13542: --- Labels: dataset pull-request-available query-engine (was: dataset query-engine) > [C++][Co

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Carl Boettiger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405566#comment-17405566 ] Carl Boettiger commented on ARROW-13761: Thanks Weston, that's great to hear!  T

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Christian Cordova (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405559#comment-17405559 ] Christian Cordova commented on ARROW-12960: --- As a humble opinion (and as a use

[jira] [Created] (ARROW-13781) [Python] Allow per column encoding in parquet writer

2021-08-26 Thread Brian Kiefer (Jira)
Brian Kiefer created ARROW-13781: Summary: [Python] Allow per column encoding in parquet writer Key: ARROW-13781 URL: https://issues.apache.org/jira/browse/ARROW-13781 Project: Apache Arrow

[jira] [Updated] (ARROW-13780) [Gandiva][UDF] Fix bug in udf space/rpad/lpad

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13780: --- Labels: pull-request-available (was: ) > [Gandiva][UDF] Fix bug in udf space/rpad/lpad > --

[jira] [Updated] (ARROW-13780) [Gandiva][UDF] Fix bug in udf space/rpad/lpad

2021-08-26 Thread ZMZ91 (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZMZ91 updated ARROW-13780: -- Description: - return length out of range sometimes makes the udfs crash - the return length for rpad/lpad is

[jira] [Created] (ARROW-13780) [Gandiva][UDF] Fix bug in udf space/rpad/lpad

2021-08-26 Thread ZMZ91 (Jira)
ZMZ91 created ARROW-13780: - Summary: [Gandiva][UDF] Fix bug in udf space/rpad/lpad Key: ARROW-13780 URL: https://issues.apache.org/jira/browse/ARROW-13780 Project: Apache Arrow Issue Type: Bug

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405525#comment-17405525 ] Ian Cook commented on ARROW-12960: -- I haven't opened Jiras for them all yet, but eventu

[jira] [Assigned] (ARROW-13737) [C++] Support scalar columns in hash aggregations (was: hash_sum on scalar column segfaults)

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook reassigned ARROW-13737: Assignee: David Li (was: Ian Cook) > [C++] Support scalar columns in hash aggregations (was: has

[jira] [Assigned] (ARROW-13737) [C++] Support scalar columns in hash aggregations (was: hash_sum on scalar column segfaults)

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook reassigned ARROW-13737: Assignee: Ian Cook (was: David Li) > [C++] Support scalar columns in hash aggregations (was: has

[jira] [Updated] (ARROW-13310) [C++] Implement hash_aggregate mode kernel

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13310: - Component/s: C++ > [C++] Implement hash_aggregate mode kernel >

[jira] [Resolved] (ARROW-13776) [C++] Offline thirdparty versions.txt is missing extensions for some files

2021-08-26 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-13776. -- Fix Version/s: 6.0.0 Resolution: Fixed Issue resolved by pull request 11015 [https://gi

[jira] [Created] (ARROW-13779) [R] Disallow expressions that depend on order after arrange()

2021-08-26 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-13779: --- Summary: [R] Disallow expressions that depend on order after arrange() Key: ARROW-13779 URL: https://issues.apache.org/jira/browse/ARROW-13779 Project: Apache A

[jira] [Created] (ARROW-13778) [R] Handle complex summarize expressions

2021-08-26 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-13778: --- Summary: [R] Handle complex summarize expressions Key: ARROW-13778 URL: https://issues.apache.org/jira/browse/ARROW-13778 Project: Apache Arrow Issue T

[jira] [Created] (ARROW-13777) [R] mutate after group_by should be ok as long as there are only scalar functions

2021-08-26 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-13777: --- Summary: [R] mutate after group_by should be ok as long as there are only scalar functions Key: ARROW-13777 URL: https://issues.apache.org/jira/browse/ARROW-13777

[jira] [Updated] (ARROW-13776) [C++] Offline thirdparty versions.txt is missing extensions for some files

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13776: --- Labels: pull-request-available (was: ) > [C++] Offline thirdparty versions.txt is missing e

[jira] [Updated] (ARROW-13776) [C++] Offline thirdparty versions.txt is missing extensions for some files

2021-08-26 Thread Karl Dunkle Werner (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Dunkle Werner updated ARROW-13776: --- Parent: ARROW-12981 Issue Type: Sub-task (was: Bug) > [C++] Offline thirdpa

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405511#comment-17405511 ] Weston Pace commented on ARROW-13761: - I suspect the memory issue is ARROW-13611 whi

[jira] [Updated] (ARROW-13611) [C++] Scanning datasets does not enforce back pressure

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-13611: Summary: [C++] Scanning datasets does not enforce back pressure (was: [C++] Scanning datasets in

[jira] [Updated] (ARROW-13611) [C++] Scanning datasets does not enforce back pressure

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-13611: Description: I have a simple test case where I scan the batches of a 4GB dataset and print out th

[jira] [Updated] (ARROW-13611) [C++] Scanning datasets in pyarrow does not enforce back pressure

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-13611: Summary: [C++] Scanning datasets in pyarrow does not enforce back pressure (was: [C++][Python] Sc

[jira] [Created] (ARROW-13776) [C++] Offline thirdparty versions.txt is missing extensions for some files

2021-08-26 Thread Karl Dunkle Werner (Jira)
Karl Dunkle Werner created ARROW-13776: -- Summary: [C++] Offline thirdparty versions.txt is missing extensions for some files Key: ARROW-13776 URL: https://issues.apache.org/jira/browse/ARROW-13776

[jira] [Updated] (ARROW-13775) [C++] Allow Partitioning objects to be created with a vector of field names

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13775: --- Labels: pull-request-available (was: ) > [C++] Allow Partitioning objects to be created wit

[jira] [Assigned] (ARROW-13775) [C++] Allow Partitioning objects to be created with a vector of field names

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13775: --- Assignee: Weston Pace > [C++] Allow Partitioning objects to be created with a vector of fie

[jira] [Created] (ARROW-13775) [C++] Allow Partitioning objects to be created with a vector of field names

2021-08-26 Thread Weston Pace (Jira)
Weston Pace created ARROW-13775: --- Summary: [C++] Allow Partitioning objects to be created with a vector of field names Key: ARROW-13775 URL: https://issues.apache.org/jira/browse/ARROW-13775 Project: Ap

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Carl Boettiger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405506#comment-17405506 ] Carl Boettiger commented on ARROW-13761: Thanks all for the explanations. I can

[jira] [Assigned] (ARROW-12084) [C++][Compute] Add remainder and quotient compute::Function

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eduardo Ponce reassigned ARROW-12084: - Assignee: Eduardo Ponce > [C++][Compute] Add remainder and quotient compute::Function >

[jira] [Created] (ARROW-13774) [C++][Python][Compute] Number to string hex conversion

2021-08-26 Thread Eduardo Ponce (Jira)
Eduardo Ponce created ARROW-13774: - Summary: [C++][Python][Compute] Number to string hex conversion Key: ARROW-13774 URL: https://issues.apache.org/jira/browse/ARROW-13774 Project: Apache Arrow

[jira] [Commented] (ARROW-12657) [C++][Python][Compute] String hex to numeric conversion and bit shifting

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405491#comment-17405491 ] Eduardo Ponce commented on ARROW-12657: --- Also, Arrow already supports some hex par

[jira] [Commented] (ARROW-13695) Github data scraping for issue status

2021-08-26 Thread David Dali Susanibar Arce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405490#comment-17405490 ] David Dali Susanibar Arce commented on ARROW-13695: --- GITHUB: 1. get w

[jira] [Comment Edited] (ARROW-12657) [C++][Python][Compute] String hex to numeric conversion and bit shifting

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405482#comment-17405482 ] Eduardo Ponce edited comment on ARROW-12657 at 8/26/21, 9:49 PM: -

[jira] [Commented] (ARROW-12657) [C++][Python][Compute] String hex to numeric conversion and bit shifting

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405482#comment-17405482 ] Eduardo Ponce commented on ARROW-12657: --- The inverse operation, commonly called *h

[jira] [Assigned] (ARROW-12657) [C++][Python][Compute] String hex to numeric conversion and bit shifting

2021-08-26 Thread William Malpica (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] William Malpica reassigned ARROW-12657: --- Assignee: William Malpica > [C++][Python][Compute] String hex to numeric conversion

[jira] [Commented] (ARROW-12657) [C++][Python][Compute] String hex to numeric conversion and bit shifting

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405462#comment-17405462 ] Eduardo Ponce commented on ARROW-12657: --- Generally, compute functions in Arrow per

[jira] [Updated] (ARROW-13773) [C++] Provide a cross platform helepr for definition of library init code

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13773: --- Labels: pull-request-available (was: ) > [C++] Provide a cross platform helepr for definiti

[jira] [Created] (ARROW-13773) [C++] Provide a cross platform helepr for definition of library init code

2021-08-26 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-13773: Summary: [C++] Provide a cross platform helepr for definition of library init code Key: ARROW-13773 URL: https://issues.apache.org/jira/browse/ARROW-13773 Project: Ap

[jira] [Resolved] (ARROW-13757) [R] Fix download of C++ source for CRAN patch releases

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13757. - Resolution: Fixed Issue resolved by pull request 11000 [https://github.com/apache/arrow/

[jira] [Comment Edited] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405434#comment-17405434 ] Neal Richardson edited comment on ARROW-13761 at 8/26/21, 7:28 PM: ---

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405434#comment-17405434 ] Neal Richardson commented on ARROW-13761: - Yep, that's the difference: in the fi

[jira] [Closed] (ARROW-13771) [C++] Allow HivePartitioningFactory to be created from a vector of names

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13771. --- Resolution: Invalid I was turned around. The directory partitioning factory needs names because th

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405411#comment-17405411 ] Weston Pace commented on ARROW-13761: - An empty table can be represented as a chunke

[jira] [Created] (ARROW-13772) [R] Binding for median aggregation

2021-08-26 Thread Nic Crane (Jira)
Nic Crane created ARROW-13772: - Summary: [R] Binding for median aggregation Key: ARROW-13772 URL: https://issues.apache.org/jira/browse/ARROW-13772 Project: Apache Arrow Issue Type: Improvement

[jira] [Assigned] (ARROW-13771) [C++] Allow HivePartitioningFactory to be created from a vector of names

2021-08-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13771: --- Assignee: Weston Pace > [C++] Allow HivePartitioningFactory to be created from a vector of

[jira] [Created] (ARROW-13771) [C++] Allow HivePartitioningFactory to be created from a vector of names

2021-08-26 Thread Weston Pace (Jira)
Weston Pace created ARROW-13771: --- Summary: [C++] Allow HivePartitioningFactory to be created from a vector of names Key: ARROW-13771 URL: https://issues.apache.org/jira/browse/ARROW-13771 Project: Apach

[jira] [Updated] (ARROW-13616) [R] Cheat Sheet Structure

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13616: --- Labels: pull-request-available (was: ) > [R] Cheat Sheet Structure > --

[jira] [Commented] (ARROW-13769) [C++] BitmapAnd, BitmapOr... could return the number of set bits

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405393#comment-17405393 ] Antoine Pitrou commented on ARROW-13769: Yes, that was my thought. We could also

[jira] [Created] (ARROW-13770) [R] Add MapType and MapArray support to R bindings

2021-08-26 Thread Ian Cook (Jira)
Ian Cook created ARROW-13770: Summary: [R] Add MapType and MapArray support to R bindings Key: ARROW-13770 URL: https://issues.apache.org/jira/browse/ARROW-13770 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-13769) [C++] BitmapAnd, BitmapOr... could return the number of set bits

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405387#comment-17405387 ] David Li commented on ARROW-13769: -- This would let us pre-compute the null count in if_

[jira] [Updated] (ARROW-13480) [C++] [R] [Python] Dataset SyncScanner may freeze on error

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13480: --- Summary: [C++] [R] [Python] Dataset SyncScanner may freeze on error (was: [C++] [R] [Python

[jira] [Updated] (ARROW-13480) [C++] [R] [Python] C-interface error propagation

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13480: --- Fix Version/s: 5.0.1 > [C++] [R] [Python] C-interface error propagation > -

[jira] [Resolved] (ARROW-13480) [C++] [R] [Python] C-interface error propagation

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-13480. Fix Version/s: 6.0.0 Resolution: Fixed Issue resolved by pull request 10993 [https:

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405380#comment-17405380 ] Antoine Pitrou commented on ARROW-12960: Well, we could an option to control han

[jira] [Comment Edited] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405380#comment-17405380 ] Antoine Pitrou edited comment on ARROW-12960 at 8/26/21, 5:13 PM:

[jira] [Commented] (ARROW-12099) [Python] Explode array column

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405379#comment-17405379 ] Ian Cook commented on ARROW-12099: -- I think for the initial implementation, we should l

[jira] [Resolved] (ARROW-12959) [C++][R] Option for is_null(NaN) to evaluate to true

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-12959. Resolution: Fixed Issue resolved by pull request 10896 [https://github.com/apache/arrow/pu

[jira] [Comment Edited] (ARROW-12099) [Python] Explode array column

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405371#comment-17405371 ] Ian Cook edited comment on ARROW-12099 at 8/26/21, 5:05 PM:

[jira] [Commented] (ARROW-12099) [Python] Explode array column

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405371#comment-17405371 ] Ian Cook commented on ARROW-12099: -- +1 Hive also has an [{{explode}}|https://cwiki.apa

[jira] [Commented] (ARROW-13769) [C++] BitmapAnd, BitmapOr... could return the number of set bits

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405365#comment-17405365 ] Antoine Pitrou commented on ARROW-13769: cc [~lidavidm] > [C++] BitmapAnd, Bitm

[jira] [Created] (ARROW-13769) [C++] BitmapAnd, BitmapOr... could return the number of set bits

2021-08-26 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-13769: -- Summary: [C++] BitmapAnd, BitmapOr... could return the number of set bits Key: ARROW-13769 URL: https://issues.apache.org/jira/browse/ARROW-13769 Project: Apache

[jira] [Comment Edited] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405364#comment-17405364 ] Eduardo Ponce edited comment on ARROW-12960 at 8/26/21, 4:55 PM: -

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405364#comment-17405364 ] Eduardo Ponce commented on ARROW-12960: --- After more careful thought, I do understa

[jira] [Comment Edited] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405364#comment-17405364 ] Eduardo Ponce edited comment on ARROW-12960 at 8/26/21, 4:53 PM: -

[jira] [Updated] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13764: --- Labels: kernel pull-request-available (was: kernel) > [C++] Implement ScalarAggregateOption

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405335#comment-17405335 ] Ian Cook commented on ARROW-12960: -- [~edponce] I think that's a fair point, but I assum

[jira] [Updated] (ARROW-13768) [R] Allow JSON to be an optional component

2021-08-26 Thread Karl Dunkle Werner (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Dunkle Werner updated ARROW-13768: --- Parent: ARROW-12981 Issue Type: Sub-task (was: Task) > [R] Allow JSON to be

[jira] [Comment Edited] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Nic Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405324#comment-17405324 ] Nic Crane edited comment on ARROW-13764 at 8/26/21, 3:59 PM: -

[jira] [Created] (ARROW-13768) [R] Allow JSON to be an optional component

2021-08-26 Thread Karl Dunkle Werner (Jira)
Karl Dunkle Werner created ARROW-13768: -- Summary: [R] Allow JSON to be an optional component Key: ARROW-13768 URL: https://issues.apache.org/jira/browse/ARROW-13768 Project: Apache Arrow

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Nic Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405324#comment-17405324 ] Nic Crane commented on ARROW-13764: --- Ah, my mistake, CountOptions would be fantastic,

[jira] [Updated] (ARROW-13767) [R] Add Arrow methods slice(), slice_head(), slice_tail()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13767: - Description: Implement [{{slice()}}, {{slice_head()}}, and {{slice_tail()}}|https://dplyr.tidyverse.org/

[jira] [Updated] (ARROW-13766) [R] Add Arrow methods slice_min(), slice_max()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13766: - Description: Implement [{{slice_min()}} and {{slice_max()}}|https://dplyr.tidyverse.org/reference/slice

[jira] [Updated] (ARROW-13766) [R] Add Arrow methods slice_min(), slice_max()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13766: - Summary: [R] Add Arrow methods slice_min(), slice_max() (was: [R] Add Arrow methods for slice_min(), sl

[jira] [Updated] (ARROW-13766) [R] Add Arrow methods for slice_min(), slice_max()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13766: - Description: Implement [{{slice_min()}} and {{slice_max()}}|https://dplyr.tidyverse.org/reference/slice

[jira] [Updated] (ARROW-13767) [R] Add Arrow methods slice(), slice_head(), slice_tail()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13767: - Summary: [R] Add Arrow methods slice(), slice_head(), slice_tail() (was: [R] Add Arrow methods for slic

[jira] [Updated] (ARROW-13766) [R] Add Arrow methods for slice_min(), slice_max()

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13766: - Summary: [R] Add Arrow methods for slice_min(), slice_max() (was: [R] Add bindings for slice_min(), sli

[jira] [Created] (ARROW-13767) [R] Add Arrow methods for slice(), slice_head(), slice_tail()

2021-08-26 Thread Ian Cook (Jira)
Ian Cook created ARROW-13767: Summary: [R] Add Arrow methods for slice(), slice_head(), slice_tail() Key: ARROW-13767 URL: https://issues.apache.org/jira/browse/ARROW-13767 Project: Apache Arrow

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-08-26 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405314#comment-17405314 ] Eduardo Ponce commented on ARROW-12960: --- >From an API perspective, invoking a func

[jira] [Closed] (ARROW-12615) [C++] Add options for handling NAs to stddev and variance

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li closed ARROW-12615. Fix Version/s: 6.0.0 Assignee: David Li Resolution: Duplicate > [C++] Add options for hand

[jira] [Commented] (ARROW-12615) [C++] Add options for handling NAs to stddev and variance

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405311#comment-17405311 ] David Li commented on ARROW-12615: -- This was duplicated by ARROW-13691 which has a PR u

[jira] [Created] (ARROW-13766) [R] Add bindings for slice_min(), slice_max()

2021-08-26 Thread Ian Cook (Jira)
Ian Cook created ARROW-13766: Summary: [R] Add bindings for slice_min(), slice_max() Key: ARROW-13766 URL: https://issues.apache.org/jira/browse/ARROW-13766 Project: Apache Arrow Issue Type: Impr

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405301#comment-17405301 ] David Li commented on ARROW-13764: -- Ok, so this is about the values - then we should su

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405299#comment-17405299 ] Neal Richardson commented on ARROW-13764: - [~lidavidm] Neither of those, if I un

[jira] [Created] (ARROW-13765) Create front end UI to display and summarize information about nightly jobs branches

2021-08-26 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-13765: - Summary: Create front end UI to display and summarize information about nightly jobs branches Key: ARROW-13765 URL: https://issues.apache.org/jira/browse

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405297#comment-17405297 ] David Li commented on ARROW-13764: -- Well, it would be neither, since both of those are

[jira] [Updated] (ARROW-13686) [Python] Update deprecated pytest yield_fixture functions

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13686: --- Labels: beginner pull-request-available (was: beginner) > [Python] Update deprecated pytest

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405295#comment-17405295 ] Neal Richardson commented on ARROW-13764: - Side note: should this be CountOption

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405274#comment-17405274 ] David Li commented on ARROW-13764: -- And to be clear, this is about the groups, not the

[jira] [Updated] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-13764: - Labels: kernel (was: ) > [C++] Implement ScalarAggregateOptions for count_distinct (grouped) > ---

[jira] [Updated] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-13764: - Fix Version/s: 6.0.0 > [C++] Implement ScalarAggregateOptions for count_distinct (grouped) > --

[jira] [Assigned] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-13764: Assignee: David Li > [C++] Implement ScalarAggregateOptions for count_distinct (grouped) > -

[jira] [Updated] (ARROW-13694) [R] Arrow filter crashes (R aborted session)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13694: Priority: Critical (was: Blocker) > [R] Arrow filter crashes (R aborted session) > --

[jira] [Commented] (ARROW-13546) [Python] Breaking API change in FSSpecHandler, requires metadata argument

2021-08-26 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405265#comment-17405265 ] Joris Van den Bossche commented on ARROW-13546: --- Ah, it might be possible

[jira] [Commented] (ARROW-13761) [R] arrow::filter() crashes (aborts R session)

2021-08-26 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405261#comment-17405261 ] Neal Richardson commented on ARROW-13761: - Interestingly, this doesn't crash on

[jira] [Commented] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405259#comment-17405259 ] Antoine Pitrou commented on ARROW-13764: One way would probably to remove the "n

[jira] [Commented] (ARROW-13546) [Python] Breaking API change in FSSpecHandler, requires metadata argument

2021-08-26 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405258#comment-17405258 ] Joris Van den Bossche commented on ARROW-13546: --- [~maartenbreddels] sorry,

[jira] [Assigned] (ARROW-13696) [Python] Support for MapType with Fields

2021-08-26 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13696: - Assignee: Jason Reid > [Python] Support for MapType with Fields > -

[jira] [Resolved] (ARROW-13696) [Python] Support for MapType with Fields

2021-08-26 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-13696. --- Fix Version/s: 6.0.0 Resolution: Fixed Issue resolved by pull request

[jira] [Updated] (ARROW-13620) [R] Binding for n_distinct()

2021-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13620: --- Labels: pull-request-available query-engine (was: query-engine) > [R] Binding for n_distinc

[jira] [Comment Edited] (ARROW-13309) [C++] Implement hash_aggregate exact quantile kernel

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405240#comment-17405240 ] Ian Cook edited comment on ARROW-13309 at 8/26/21, 1:56 PM:

[jira] [Commented] (ARROW-13309) [C++] Implement hash_aggregate exact quantile kernel

2021-08-26 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405240#comment-17405240 ] Ian Cook commented on ARROW-13309: -- [~lidavidm] thanks, I think it's fine to wait for n

[jira] [Created] (ARROW-13764) [C++] Implement ScalarAggregateOptions for count_distinct (grouped)

2021-08-26 Thread Nic Crane (Jira)
Nic Crane created ARROW-13764: - Summary: [C++] Implement ScalarAggregateOptions for count_distinct (grouped) Key: ARROW-13764 URL: https://issues.apache.org/jira/browse/ARROW-13764 Project: Apache Arrow

  1   2   >