Re: [DISCUSS][C++] Help needed to refactor Skyhook

2024-03-14 Thread Jayjeet Chakraborty
yhook. > Please reply to let us know if you are actively using Skyhook or if > you are interested in helping to refactor Skyhook. > > Sincerely, > Ben Kietzman > > [1] > > https://arrow.apache.org/blog/2022/01/31/skyhook-bringing-computation-to-storage-with-apache-arrow/ > > -- *Jayjeet Chakraborty* CS PhD student UC Santa Cruz California, USA

Re: Trying to understand Arrow Flight internals better

2023-12-04 Thread Jayjeet Chakraborty
Thanks David > On Dec 4, 2023, at 12:25 PM, David Li wrote: > > Hi Jayjeet, > > Start from > https://github.com/apache/arrow/blob/main/cpp/src/arrow/flight/transport/grpc/customize_grpc.h > > -David > > On Sun, Dec 3, 2023, at 02:04, Jayjeet Chakraborty wrote

Trying to understand Arrow Flight internals better

2023-12-02 Thread Jayjeet Chakraborty
gards, *Jayjeet Chakraborty* Ph.D. Student Department of Computer Science and Engineering University of California, Santa Cruz *Email: jayje...@ucsc.edu *

Re: Transactional semantics in Acero

2022-09-11 Thread Jayjeet Chakraborty
tegration unnecessary). > > As Weston said I think at the Acero/Datasets level I think the right thing > to focus on is abstractions that allow plugging in any storage subsystem. > > On Friday, September 9, 2022, Jayjeet Chakraborty < > jayjeetchakrabort...@gmail.com> wrote:

Re: Transactional semantics in Acero

2022-09-09 Thread Jayjeet Chakraborty
ngine. That said, it can definitely be used as a > > > component for building a full database engine, which could implement > its > > > own locking of rows while Acero executes on them. You could also check > out > > > DuckDB, which can operate on

Transactional semantics in Acero

2022-09-09 Thread Jayjeet Chakraborty
interface over Acero in C++. Thanks. -- *Jayjeet Chakraborty* CS PhD student UC Santa Cruz California, USA

Re: Using Acero in a distributed environment

2022-08-31 Thread Jayjeet Chakraborty
several other compute engines that support these > > types of execution on top of arrow dataformat (eg: Cylon which I'm > working > > on ATM) > > > > [1] https://dl.acm.org/doi/abs/10.1145/2588555.2610507 > > > > On Wed, Aug 24, 2022 at 10:00 AM Jayjeet Chakr

Using Acero in a distributed environment

2022-08-24 Thread Jayjeet Chakraborty
if not already planning for it ? Thanks. Jayjeet Chakraborty -- *Jayjeet Chakraborty* CS PhD student UC Santa Cruz California, USA

Re: Discuss a very fast way to serialize a large in-memory Arrow IPC table to a void* buffer for sending over the network

2021-07-08 Thread Jayjeet Chakraborty
am only > > getting ~3 times improvement, I am wondering what am I doing wrong on my > > end. I am sharing my benchmark code here [1]. It will be great if someone > > could take a > look at it (mainly the Serialize function). Looking forward > > to hearing back from

Re: Arrow Dataset API on Ceph

2021-06-22 Thread Jayjeet Chakraborty
Do you have test cases > for the benchmarking? Thanks. > > Yibo > > On 6/8/21 4:18 PM, Jayjeet Chakraborty wrote: > > Hi Yibo, > > > > Thanks a lot for your interest in our work. Please refer to this [1] > guide to deploy a complete environment on a cluster of nodes. Regardi

Discuss a very fast way to serialize a large in-memory Arrow IPC table to a void* buffer for sending over the network

2021-06-10 Thread Jayjeet Chakraborty
Hello Arrow Community, I am a student working on a project where I need to serialize an in-memory Arrow Table of size around 700MB to a uint8_t* buffer. I am currently using the arrow::ipc::RecordBatchStreamWriter API to serialize the table to a arrow::Buffer, but it is taking nearly 1000ms to

Re: Arrow Dataset API on Ceph

2021-06-08 Thread Jayjeet Chakraborty
row PR, it looks great. IIUC, there should be a > corresponding Ceph patch implementing the object class with Arrow. > > I wonder the best approach to deploy a complete environment for a quick > evaluation. Any comment is welcomed. Thanks. > > Yibo > > On 6/2/21 3

Re: Arrow Dataset API on Ceph

2021-06-01 Thread Jayjeet Chakraborty
row soon and hence look forward to your comments and thoughts on this new feature. Please let us know if you have any questions. Thank you. Best regards, Jayjeet Chakraborty On 2020/09/15 18:06:56, Micah Kornfield wrote: > gmock is already a dependency. We haven't upgraded gmock/gtest