from:"Emil Ejbyfeldt"

Any committers interested in reviewing SPARK-52023

2025-06-28 Thread Emil Ejbyfeldt

Create this MR https://github.com/apache/spark/pull/50827 that fixes a segfault/data corruption issue when using a udaf returning an option. Anyone committer willing to have a look? Best, Emil --

Re: [Reminder] Spark 3.5 RC Cut

2023-08-01 Thread Emil Ejbyfeldt

pache Hadoop 3.3.5 and 3.3.6. HADOOP-18757 seems to be merged just two weeks ago and there is no Apache Hadoop release with it, isn't it? Could you check your local branch once more, please? Dongjoon. On Tue, Aug 1, 2023 at 9:46 PM Emil Ejbyfeldt <mailto:eejbyfe...@liveintent.c

Re: [Reminder] Spark 3.5 RC Cut

2023-08-01 Thread Emil Ejbyfeldt

mean another JIRA? Dongjoon. On Tue, Aug 1, 2023 at 2:59 AM Emil Ejbyfeldt wrote: Hi, We previously ran some experiments on builds from the 3.5 branch and noticed that Hadoop had a regression (https://issues.apache.org/jira/browse/HADOOP-18568 <https://issues.apache.

Re: [Reminder] Spark 3.5 RC Cut

2023-08-01 Thread Emil Ejbyfeldt

Hi, We previously ran some experiments on builds from the 3.5 branch and noticed that Hadoop had a regression (https://issues.apache.org/jira/browse/HADOOP-18568) in their s3a committer affecting 3.3.5 and 3.3.6 (Spark 3.4 uses hadoop 3.3.4). This fix has been merged into Hadoop and will be p

Re: Spark 3.4.0 with Hadoop2.7 cannot be downloaded

2023-04-20 Thread Emil Ejbyfeldt

Hi, I think this is expected as it was dropped from the release process in https://issues.apache.org/jira/browse/SPARK-40651 Also I don't see a Hadoop2.7 option when selecting Spark 3.4.0 on https://spark.apache.org/downloads.html Not really sure why you could be seeing that. Best, Emil O

Re: [VOTE] Release Apache Spark 3.4.0 (RC7)

2023-04-12 Thread Emil Ejbyfeldt

+1 (non-binding) Ran some tests with the Scala 2.13 build using part of our internal spark workload. On 12/04/2023 19:52, Chris Nauroth wrote: +1 (non-binding) * Verified all checksums. * Verified all signatures. * Built from source, with multiple profiles, to full success: * build/mvn

Re: [VOTE] Release Apache Spark 3.4.0 (RC3)

2023-03-09 Thread Emil Ejbyfeldt

It might being caused by the v3.4.0-rc3 tag not being part of the 3.4 branch branch-3.4: $ git log --pretty='format:%d %h' --graph origin/branch-3.4 v3.4.0-rc3 | head -n 10 * (HEAD, origin/branch-3.4) e38e619946 * f3e69a1fe2 * 74cf1a32b0 * 0191a5bde0 * afced91348 | * (tag: v3.4.0-rc3) b

Re: Missing data in spark output

2022-10-18 Thread Emil Ejbyfeldt

Hi, We have observed similar behavior in older versions of spark. But we were are currently using 3.3.0 where we have not seen such issues. Which version of Spark and Hadoop are you using? On 18/10/2022 19:48, Sandeep Vinayak wrote: Hello Everyone, We are recently observing an intermittent

Re: [VOTE] Release Spark 3.3.0 (RC2)

2022-05-19 Thread Emil Ejbyfeldt

Hi, When testing out Spark 3.3.0 on our production spark workload it was noticed that https://issues.apache.org/jira/browse/SPARK-38681 is actually a regression from 3.2 (I did not know this a the time of creating the ticket) seem like the bug was introduced in https://github.com/apache/spark

MetadataFetchFailedException due to decommission block migrations

2022-02-02 Thread Emil Ejbyfeldt

As noted in SPARK-34939 there is race when using broadcast for map output status. Explanation from SPARK-34939 > After map statuses are broadcasted and the executors obtain serialized broadcasted map statuses. If any fetch failure happens after, Spark scheduler invalidates cached map statuses

Re: [SPARK-20384][SQL] Support value class in schema of Dataset (third time's a charm)

2021-08-02 Thread Emil Ejbyfeldt

ed in this feature to leave a review/comments on that PR. Also it would be great if some admin would be able to review it. Or any tips on steps that should be taken in order to have the PR reviewed would be appreciated. / Emil On 25/05/2021 16:33, Emil Ejbyfeldt wrote: Hi dev, I am inter

[SPARK-20384][SQL] Support value class in schema of Dataset (third time's a charm)

2021-05-25 Thread Emil Ejbyfeldt

Hi dev, I am interested getting the support value classes in schemas of Dataset merged and I am willing to work on it. There are two previous PRs created for this JIRA (SPARK-20384) first https://github.com/apache/spark/pull/22309 and more recently https://github.com/apache/spark/pull/27153

Any committers interested in reviewing SPARK-52023

Re: [Reminder] Spark 3.5 RC Cut

Re: [Reminder] Spark 3.5 RC Cut

Re: [Reminder] Spark 3.5 RC Cut

Re: Spark 3.4.0 with Hadoop2.7 cannot be downloaded

Re: [VOTE] Release Apache Spark 3.4.0 (RC7)

Re: [VOTE] Release Apache Spark 3.4.0 (RC3)

Re: Missing data in spark output

Re: [VOTE] Release Spark 3.3.0 (RC2)

MetadataFetchFailedException due to decommission block migrations

Re: [SPARK-20384][SQL] Support value class in schema of Dataset (third time's a charm)

[SPARK-20384][SQL] Support value class in schema of Dataset (third time's a charm)

12 matches

Site Navigation

Mail list logo

Footer information