Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Bob Tinsman
--format parquet --query 6 --iterations 1 -n 16 > And using hotspot (https://github.com/KDAB/hotspot) to load/visualize the > perf.data file (this takes 1-30s for me depending on the size of the file). > > Best regards, > > Daniël > > Op ma 21 mrt. 2022 om 21:12 schreef Bob T

Re: [Rust][DataFusion] profiling TPC-H benchmarks with flamegraphs

2022-03-21 Thread Bob Tinsman
he tpch benchmark program directly > > Making the profiling process easier / documenting it would definitely be > useful in my opinion > > Andrew > > > [1] https://crates.io/crates/pprof > > On Fri, Mar 18, 2022 at 6:10 PM Bob Tinsman wrote: > > > I've

Re: [DISCUSS][Rust] Performance Measurements (was Biweekly sync call for arrow/datafusion again?)

2022-03-14 Thread Bob Tinsman
he/arrow-datafusion/tree/master/benchmarks > [2] https://github.com/apache/arrow-datafusion/pull/1791 > > [3] https://github.com/apache/arrow-datafusion/pull/1928 On Fri, Mar 11, 2022 at 4:56 PM Bob Tinsman wrote: > > > I just missed the call, but I watched the recording (thank

Re: [DISCUSS][Rust] Biweekly sync call for arrow/datafusion again?

2022-03-11 Thread Bob Tinsman
I just missed the call, but I watched the recording (thank you to Andrew for posting [1]). Really interesting! I'm diving into Arrow because I have some previous experience with in-memory query engines. I'm following discussions around improving performance and adding features so I can determine ho

Re: [C++] Adopting a library for (distributed) tracing

2021-05-01 Thread Bob Tinsman
I agree that OpenTelemetry is the future; I have been following the observability space off and on and I knew about OpenTracing; I just realized that OpenTelemetry is its successor. [1] I have found tracing to be a very powerful approach; at one point, I did a POC of a trace recorder inside a Ja

Re: [JS] Exploring usage of apache arrow at my company for complex table rendering

2021-04-19 Thread Bob Tinsman
I know a lot about the general subject of grouping, although this was in a closed-source framework I created which also uses columnar in-memory data; it was specific to Java but at its heart it has a lot in common with Arrow. This framework was able to render charts, multi-level tables, and cross

Re: [Java] Source control of generated flatbuffers code

2021-04-15 Thread Bob Tinsman
elease phase, there are still a few blocking > > > issues that people are trying to fix (according to the arrow-sync call > > > earlier today). I don't mind jumping in and doing this; it appears that > > > there might be enough time for such a small change to make it

Re: [Java] Source control of generated flatbuffers code

2021-04-14 Thread Bob Tinsman
I apologize for leaving this hanging, but it looks like 4.0 is leaving the station :( Yes, it makes sense to bump it to 1.12, but you can't do that in isolation, because the flatc binary which is fetched as a Maven dependency is only available for 1.9. I will get back onto this and finish it thi

[JIRA] Request contributor role

2021-03-19 Thread Bob Tinsman
I've logged a couple bugs and would like to assign myself. My id is bobtinsman on JIRA; here is one of the bugs I logged: [ARROW-12006] updates to make dev on Java and Windows easier - ASF JIRA | | | | [ARROW-12006] updates to make dev on Java and Windows easier - ASF JIRA | | | I tr

[JAVA] [n00b] issues encountered during build

2021-03-11 Thread Bob Tinsman
I've been mostly lurking for awhile, but I would like to start picking off some bugs in the Java implementation.In the process of slogging through the build,  I've bumped into various issues. I'm happy to document them in java/README.md or make any other changes that might be helpful to others. I

Hello to the Arrow dev community

2020-09-22 Thread Bob Tinsman
I'd like to introduce myself, because I've had an interest in Arrow for a long time and now I have a chance to help out.Up until now, I haven't really contributed much in open source, although I've been an avid consumer, so I'd like to change that! My main areas of work have been performance opt