Re: Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Bruno Murino
Hi, It’s my first time here on this mailing list as well. Regarding the EC2 instance size, I wonder if you’re hitting the IOPS limits of the T instance, give the large volumes of data coming out? I could be way off, though, but that’s where my mind went. Cheers, Bruno Murino > On 10 May 2024

Re: Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Bryce Mecum
Hi Christian, welcome. Your code looks reasonable to me at first glance. It does seem possible you're resource-constrained with that t2.micro instance. You might try using a larger instance or reducing the batch size in your call to iter_batches [1] to some very small number. [1] https://arrow.a

Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Christian Casazza
Hello everyone, This is my first time emailing this mailing list, so I hope I am explaining things correctly below. I am attempting to get started with Arrow Flight. I am storing parquet files and Iceberg tables on S3. I would like to use arrow flight as the interface data consumers use to access