Re: Reading data from Iceberg table into Apache Arrow in Java

2021-03-03 Thread Ryan Blue
2021 6:41 AM > *To:* dev@iceberg.apache.org > *Cc:* Ryan Blue > *Subject:* RE: Reading data from Iceberg table into Apache Arrow in Java > > > > Thanks for finding out Peter. > > > > Should we proceed with this pr and later add support for vectorized reads &g

RE: Reading data from Iceberg table into Apache Arrow in Java

2021-03-03 Thread Mayur Srivastava
prs. Thanks, Mayur From: Peter Vary mailto:pv...@cloudera.com.INVALID>> Sent: Tuesday, March 2, 2021 10:38 AM To: Iceberg Dev List mailto:dev@iceberg.apache.org>> Cc: Ryan Blue mailto:rb...@netflix.com>> Subject: Re: Reading data from Iceberg table into Apache Arrow in Java After so

RE: Reading data from Iceberg table into Apache Arrow in Java

2021-03-03 Thread Mayur Srivastava
2, 2021 10:38 AM To: Iceberg Dev List Cc: Ryan Blue Subject: Re: Reading data from Iceberg table into Apache Arrow in Java After some more digging, I think there is no solution yet for vectorized reads with deletes. this.readUsingBatch = batchReadsEnabled && hasNoDel

Re: Reading data from Iceberg table into Apache Arrow in Java

2021-03-02 Thread Peter Vary
; Thanks, > > Mayur > > > > *From:* Peter Vary > *Sent:* Tuesday, March 2, 2021 8:51 AM > *To:* Iceberg Dev List > *Cc:* rb...@netflix.com > *Subject:* Re: Reading data from Iceberg table into Apache Arrow in Java > > > > Hi Mayur, > > > >

RE: Reading data from Iceberg table into Apache Arrow in Java

2021-03-02 Thread Mayur Srivastava
from someone who understands the Spark vectorized read path. But, I’ll read the code to understand the deletes. Thanks, Mayur From: Peter Vary Sent: Tuesday, March 2, 2021 8:51 AM To: Iceberg Dev List Cc: rb...@netflix.com Subject: Re: Reading data from Iceberg table into Apache Arrow in Java

Re: Reading data from Iceberg table into Apache Arrow in Java

2021-03-02 Thread Peter Vary
th these > types that write to parquet? > b. Types not supported: TimeType, ListType, MapType, StructType. What is > the path to add Arrow support for these data types? > > Thanks, > Mayur > > From: Mayur Srivastava > Sent: Friday, February 12, 2021 7:

RE: Reading data from Iceberg table into Apache Arrow in Java

2021-03-01 Thread Mayur Srivastava
s not supported: TimeType, ListType, MapType, StructType. What is the path to add Arrow support for these data types? Thanks, Mayur From: Mayur Srivastava Sent: Friday, February 12, 2021 7:41 PM To: dev@iceberg.apache.org; rb...@netflix.com Subject: RE: Reading data from Iceberg table into Apach

RE: Reading data from Iceberg table into Apache Arrow in Java

2021-02-12 Thread Mayur Srivastava
you if I’ve any question or need any more pointers. Thanks, Mayur From: Ryan Blue Sent: Friday, February 12, 2021 2:26 PM To: Iceberg Dev List Subject: Re: Reading data from Iceberg table into Apache Arrow in Java Hi Mayur, We built the Arrow support with Spark as the first use case, so the

Re: Reading data from Iceberg table into Apache Arrow in Java

2021-02-12 Thread Ryan Blue
Hi Mayur, We built the Arrow support with Spark as the first use case, so the best examples of how to use it are in Spark. The generic reader does two things: it plans a scan and sets up an iterator of file readers to produce generic records. What you want to do is the same thing, but set up the