Re: Support for selected vector based processing during ORC Read

2023-04-03 Thread Pavan Lanka
Hi, A gentle ping that this still needs a look, in terms of background it will be great to have someone with background of Vectorized processing to take a look. Please do let me know in case I can ease this process in any other way. Thanks, Pavan > On Mar 27, 2023, at 7:23 AM, Pavan La

Support for selected vector based processing during ORC Read

2023-03-27 Thread Pavan Lanka
Hi, I have raised an issue regarding support for the selected vector during ORC Read. https://github.com/apache/iceberg/issues/7191 I also have a PR for the proposed change. Which currently only includes Spark 3.3 related changes, once we have an agreement on this I can port this further to o

Re: FilleIO Support for Apache ORC

2022-12-15 Thread Pavan Lanka
Thanks everyone for the feedback and comments on the PR. I addressed most of the comments and responded to the remaining ones. Another look would be much appreciated. Thanks, Pavan > On Dec 6, 2022, at 1:55 PM, Pavan Lanka wrote: > > Hi, > > We have an existing issue <

FilleIO Support for Apache ORC

2022-12-06 Thread Pavan Lanka
Hi, We have an existing issue regarding the lack of FileIO support with Apache ORC. I have raised a PR that tries to address this gap. https://github.com/apache/iceberg/pull/6293 Added FileIO Support for ORC Reader and Writers by pavibhai · Pull Req

Re: Configuration of HiveMetastoreClient using Catalog Properties

2022-11-14 Thread Pavan Lanka
Sorry about the delay. I have raised the following PR to revert the change.  Regards, Pavan > On Nov 4, 2022, at 9:33 AM, Ryan Blue wrote: > > No problem! Thanks for contributing and helping to keep the code clean. > > On Tue, Nov 1, 2022 at 4:58 PM Pavan Lanka wr

Re: Configuration of HiveMetastoreClient using Catalog Properties

2022-11-01 Thread Pavan Lanka
; overall I think it looks good. > > On Thu, Oct 20, 2022 at 8:21 AM Pavan Lanka wrote: >> Hi, >> >> I have raised an issue regarding the means for configuring the >> HiveMetastoreClient to allow for different properties while connection to >> different

Configuration of HiveMetastoreClient using Catalog Properties

2022-10-20 Thread Pavan Lanka
Hi, I have raised an issue regarding the means for configuring the HiveMetastoreClient to allow for different properties while connection to different metastore services. https://github.com/apache/iceberg/issues/5988 I also have a PR for the sam

ORC write optimization regarding stripe offsets

2022-09-26 Thread Pavan Lanka
Hi, I have raised an issue regarding an improvement in the ORC Write process with Iceberg that helps avoid reading files written for determining stripe offsets. https://github.com/apache/iceberg/issues/5777 I also have a PR submitted for the same

Re: Shaded and UnShaded?

2021-05-26 Thread Pavan Lanka
tion which > includes a shaded version of Iceberg at runtime. In that case you should > reach out to the providers of that runtime > and ask them if they can figure out another solution for you. > >> On May 26, 2021, at 11:48 AM, Pavan Lanka > <mailto:pla...@apple.com.INVALI

Shaded and UnShaded?

2021-05-26 Thread Pavan Lanka
Hi, Currently I see that Iceberg shades a bunch of libraries and does not offer(please correct me in case I am mistaken here) any releases that does not shade these libraries. For some integrations (Avro, ORC, Parquet) I was won

Re: [VOTE] Release Apache Iceberg 0.9.0 RC5

2020-07-13 Thread Pavan Lanka
+1 (non-binding) Environment OSX openjdk 1.8.0_252 Build from source with tests Build time ~7mins Except for some warnings looks good > On Jul 10, 2020, at 9:20 AM, Ryan Murray wrote: > > 1. Verify the signature: OK > 2. Verify the checksum: OK > 3. Untar the archive tarball: OK > 4. Run RAT c