Re: [DISCUSS] Apache Iceberg Summit 2025 - Selection Committee

2024-11-28 Thread Gaurav Agarwal
I also want to volunteer if anything I can do from India Thanks On Thu, Nov 28, 2024, 11:11 PM Matt Topol wrote: > I'm also happy to volunteer in any way I can JB! > > On Thu, Nov 28, 2024, 10:35 AM Amogh Jahagirdar <2am...@gmail.com> wrote: > >> Thanks JB, happy to help in any way I can! >> >>

Re: Invitation to contribute to OneTable

2023-12-04 Thread Gaurav Agarwal
HI Thanks for this mail , I would like to know is there any group discussion also happened or any call to discuss the issues. thanks On Tue, Dec 5, 2023 at 9:29 AM Walaa Eldin Moustafa wrote: > Thanks Jesus for sharing OneTable. Looks like it touches upon some of the > topics we discussed in t

Re: [PROPOSAL] Preparing first Apache Iceberg Summit

2023-08-04 Thread Gaurav Agarwal
I am from India can I contribute in some way for the summit ? On Fri, Aug 4, 2023, 8:59 PM Jun H. wrote: > I would like to be involved and share our incremental processing work > using iceberg at Netflix. > > Best, > Jun > > > On Wed, Aug 2, 2023 at 7:00 AM Jean-Baptiste Onofré > wrote: > >> Hi

iceberg queries

2023-06-15 Thread Gaurav Agarwal
Hi Team, Sample Merge query: df.createOrReplaceTempView("source") MERGE INTO iceberg_hive_cat.iceberg_poc_db.iceberg_tab target USING (SELECT * FROM source) ON target.col1 = source.col1// this is my bucket column WHEN MATCHED THEN UPDATE SET * WHEN NOT MATCHED THEN INSERT * The source dataset

Merge into

2023-06-15 Thread Gaurav Agarwal
Hi Is there any way i can update specific columns in the merge into statement of iceberg . The table has 2000 columns i am trying to use the merge into strategy with spark where we have only bucket level partitions in table it's failing there with oome of executor so someone suggested columns ar

Spark iceberg query

2023-05-31 Thread Gaurav Agarwal
Hi We are loading the data in spark dataset We don't do any repartitioning on the loaded dataset If we want to load the in the iceberg partitioned table . If we do dataset.sortwithinpartition() before doing append or merge or overwrite. Will that be the best practice shall I go ahead? Or if I g

Re: Iceberg transaction support with spark sql

2023-05-26 Thread Gaurav Agarwal
unately, Spark SQL does not have an API for transactions. However, > you may use Iceberg WAP to stage multiple changes and commit them as a > single Iceberg transaction. > > > > - Anton > > > >> On May 25, 2023, at 12:49 PM, Gaurav Agarwal > wrote: > >&g

Iceberg transaction support with spark sql

2023-05-25 Thread Gaurav Agarwal
Hi We want to delete and insert rows in iceberg in one transaction and we are using spark SQL to execute delet e queries is there a way we can use it or implement in our application. I see somewhere dynamo db manager in your code will that help Thanks

Re: rewrite action for collate how can we pass date range?

2023-05-24 Thread Gaurav Agarwal
teDataFilesProcedure.java#L133-L135 > . > > - Wing Yew > > > On Tue, May 23, 2023 at 10:13 PM Gaurav Agarwal > wrote: > >> >> On Wed, May 24, 2023, 10:41 AM Gaurav Agarwal >> wrote: >> >>> I have one more query we are trying to compact fil

Re: rewrite action for collate how can we pass date range?

2023-05-23 Thread Gaurav Agarwal
On Wed, May 24, 2023, 10:41 AM Gaurav Agarwal wrote: > I have one more query we are trying to compact files currently it is > taking time as have never compacted till now this is the first time we are > trying to perform compaction after 5 months of continuously loading data > W

Orphan files

2023-05-23 Thread Gaurav Agarwal
Hello We have orphan files in the table does it impact the read performance of the data from the table if we are passing the partition column in the read query? Would like to know what is the impact of orphan files ? Thanks

Copyonwrite scan

2023-05-23 Thread Gaurav Agarwal
Hi We are getting " runtime file filtering exception the table has been concurrently modified row level operation scan snapshot id " This exception we got while trying to delete the data from the table and has copyonwrite setting for delete operation . I check the code this is mentioned as in spa

Re: Slack invitation

2023-05-10 Thread Gaurav Agarwal
g we just have to add a new link, so that's what i'm asking. Do we > need to update the link, or does it work? > > On May 10, 2023, at 11:16 AM, Gaurav Agarwal > wrote: > > there also it says if you dont apache email id then get the invitation > Don’t have an *@apach

Re: Slack invitation

2023-05-10 Thread Gaurav Agarwal
there also it says if you dont apache email id then get the invitation Don’t have an *@apache.org * email address? Contact the workspace administrator at *apache-iceberg* for an invitation. On Wed, May 10, 2023 at 9:13 PM Russell Spitzer wrote: > Does this link no longer work?

Need help in running test case of iceberg locall git hub 1.2.x

2023-05-10 Thread Gaurav Agarwal
HI All I am trying to set up the workspace and running the test case to understand the flow of delete functionality with different mode copyonwrite and mergeonread. the test case is failing locally. TestStoragePartitionedJoinsInRowLevelOperations.java in spark 3.3_2.12 I tried setting up hadoop