Re: [Docs] sphinx-tabs and build errors

2022-03-21 Thread Ian Joiner
Here is the PR: https://github.com/apache/arrow/pull/12684 > On Mar 21, 2022, at 2:52 PM, Joris Van den Bossche > wrote: > > On Sun, 20 Mar 2022 at 07:50, Ian Joiner wrote: > >> Hi, >> >> I’d like to ask about how the documentation is built. I have followed the >> instructions to build and i

Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Bob Tinsman
Wow, hotspot looks very cool! However, I was only able to download v1.1 which is very slow at processing the perf.data; it took something like 30 minutes for a perf.data file of about 0.5G. I am resorting to building Qt from source, so I can build hotspot from source...😖 On Mon, Mar 21, 2022 at 2:

[C++][Java][FlightSQL] GetSqlInfo properties

2022-03-21 Thread James Duong
I was examining the properties in GetSqlInfo and have a few questions as far as the proper way to approach some issues. Some bitmask properties have options that are intended to be mutually exclusive. For example the SQL_GROUP_BY property is a bitmask where possible values are: - SQL_GROUP_BY_UNRE

Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Daniël Heres
Hi Bob, One command I've been using succesfully some time for profiling is as follows (with different flags, csv file works just as well): perf record --call-graph=dwarf ../target/release/tpch benchmark datafusion --path [path] --format parquet --query 6 --iterations 1 -n 16 And using hotspot (ht

Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Bob Tinsman
Andrew, thanks for your feedback! I started looking at IOx and pprof, and I'm slowly getting a better picture of DataFusion performance work. In particular, I can see that IOx is driving some of this (in particular [1]). I'm still in sponge mode, but I can think of a few useful things to do around

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Andrew Lamb
BTW thank you all for your work in this matter (making JDBC/ODBC clients)! I think it is super valuable for the overall ecosystem. I am sorry for missing the conversation, but I am not clear on what we are voting on. Can we please clarify what changes are proposed to FlightSQL? The PRs appear to

Re: [Docs] sphinx-tabs and build errors

2022-03-21 Thread Ian Joiner
Hi Joris, Thanks for replying! No but dataset isn’t the only thing missing here. It is also complaining that pyarrow.fs is not a package. Of course it is actually a module. Moreover the build process errors out and no docs appear at all so these can’t simply be ignored. As for sphinx-build I can

Re: [R] how to rbind or concat two Arrow Tables?

2022-03-21 Thread Will Jones
I've created a Jira issue to track rbind implementation in R: https://issues.apache.org/jira/browse/ARROW-15989 On Mon, Mar 21, 2022 at 12:15 PM Will Jones wrote: > Hi Andrew, > > I don't think we've implemented rbind yet, unfortunately. We've just > implemented concat_arrays (also bound to c())

Re: [R] how to rbind or concat two Arrow Tables?

2022-03-21 Thread Will Jones
Hi Andrew, I don't think we've implemented rbind yet, unfortunately. We've just implemented concat_arrays (also bound to c()) [1], and that will be available in the next release (or nightlies right now). The one way you could "rbind" multiple feather files, if they have the same schema, is by con

Re: [Docs] sphinx-tabs and build errors

2022-03-21 Thread Joris Van den Bossche
On Sun, 20 Mar 2022 at 07:50, Ian Joiner wrote: > Hi, > > I’d like to ask about how the documentation is built. I have followed the > instructions to build and install the C++ and Python libraries in my > virtual environment and then followed the instructions for building the > documentation. How

[R] how to rbind or concat two Arrow Tables?

2022-03-21 Thread Andrew Piskorski
Hi, I am using the latest R arrow package from CRAN, 7.0.0. https://cran.r-project.org/web/packages/arrow/ What is the right way to concatenate rows from two Arrow Tables together into one Table? AKA, rbind() in base R. Can I do this as a zero-copy view in memory, or will I need to write the

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Antoine Pitrou
Moral +1 from me. I've posted minor comments on the specs changes in the PRs. Le 16/03/2022 à 20:50, David Li a écrit : Hello, Jose Almeida and James Duong have proposed two additions to Arrow Flight SQL, an experimental protocol for interacting with SQL databases over Arrow Flight. The

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Iury Salino
+1 (non-binding) On Mon, Mar 21, 2022 at 2:54 PM Vinicius Fraga wrote: > +1 (non-binding) > > On Mon, 21 Mar 2022 at 14:53, Rafael Telles wrote: > > > +1 (non-binding) > > > > Em seg., 21 de mar. de 2022 às 09:23, David Li > > escreveu: > > > > > My vote: +1 > > > > > > Are any PMC members ava

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Gabriel Escobar (CW)
+1 (non-binding) On Mon, Mar 21, 2022 at 2:54 PM Vinicius Fraga wrote: > +1 (non-binding) > > On Mon, 21 Mar 2022 at 14:53, Rafael Telles wrote: > > > +1 (non-binding) > > > > Em seg., 21 de mar. de 2022 às 09:23, David Li > > escreveu: > > > > > My vote: +1 > > > > > > Are any PMC members ava

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Vinicius Fraga
+1 (non-binding) On Mon, 21 Mar 2022 at 14:53, Rafael Telles wrote: > +1 (non-binding) > > Em seg., 21 de mar. de 2022 às 09:23, David Li > escreveu: > > > My vote: +1 > > > > Are any PMC members available to take a look? (Also note, anyone can > still > > cast non-binding votes.) > > > > -Davi

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread Rafael Telles
+1 (non-binding) Em seg., 21 de mar. de 2022 às 09:23, David Li escreveu: > My vote: +1 > > Are any PMC members available to take a look? (Also note, anyone can still > cast non-binding votes.) > > -David > > On Wed, Mar 16, 2022, at 15:50, David Li wrote: > > Hello, > > > > Jose Almeida and Jam

Re: Add JSON data to Arrow-Testing

2022-03-21 Thread Andrew Lamb
I would personally suggest putting the JSON in the apache-datafusion repository first, and then if it would help other implementations we can move it to the shared `arrow-testing` repository. Andrew On Wed, Mar 16, 2022 at 11:18 AM Matthew Turner < matthew.m.tur...@outlook.com> wrote: > Hello, >

Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Andrew Lamb
Thank you for writing up your findings If you use the `--mem-table` / `-m` command, the CSV file is read once and then the query is executed subsequently As for better ways of profiling rust, we have had good luck using `pprof` [1] in InfluxDB IOx (which also uses DataFusion), so I have mostly ne

Re: [VOTE][RUST] Release Apache Arrow Rust 11.0.0 RC1

2022-03-21 Thread Andrew Lamb
The vote passes with +8 (3 binding) The release is available here: https://dist.apache.org/repos/dist/release/arrow/arrow-rs-11.0.0 It has also been uploaded to crates.io: https://crates.io/crates/arrow/11.0.0 https://crates.io/crates/arrow-flight/11.0.0 https://crates.io/crates/parquet/11.0.0

Re: [VOTE][RUST] Release Apache Arrow Rust 11.0.0 RC1

2022-03-21 Thread Yang hao
+1 (non-binding) Verified on macOS 12.2 Apple M1 Pro Thanks, Remzi Yang From: Jörn Horstmann Date: Monday, March 21, 2022 at 17:41 To: dev@arrow.apache.org Subject: Re: [VOTE][RUST] Release Apache Arrow Rust 11.0.0 RC1 +1 (non-binding) Thanks, Jörn On Sat, Mar 19, 2022 at 5:29 AM Yijie Shen

Re: [VOTE] Extend Arrow Flight SQL with GetXdbcTypeInfo, SQL type info in schemas

2022-03-21 Thread David Li
My vote: +1 Are any PMC members available to take a look? (Also note, anyone can still cast non-binding votes.) -David On Wed, Mar 16, 2022, at 15:50, David Li wrote: > Hello, > > Jose Almeida and James Duong have proposed two additions to Arrow > Flight SQL, an experimental protocol for inter

Re: [VOTE][RUST] Release Apache Arrow Rust 11.0.0 RC1

2022-03-21 Thread Jörn Horstmann
+1 (non-binding) Thanks, Jörn On Sat, Mar 19, 2022 at 5:29 AM Yijie Shen wrote: > +1 non-binding > > Thanks, > Yijie > > On Sat, Mar 19, 2022 at 10:23 AM LM wrote: > > > +1 (non-binding) > > > > Verified on macOS 12.3 on M1Max > > > > Thanks, > > Lin > > > > On Fri, Mar 18, 2022 at 5:52 PM QP