Should Spark 4.x use Java modules (those you define with module-info.java sources)?

2023-12-03 Thread Marc Le Bihan
ing pattern of the Spark jars for the 4.x could be a good idea,     but beyond that, what about attempting to integrate Spark into modules, it's submodules defining module-info.java?     Is it something that you think that [must | should | might | should not | must not] be done? Regards, Marc Le Bihan

Re: [VOTE] Release Spark 3.4.2 (RC1)

2023-11-25 Thread Marc Le Bihan
PARK-45311 Thanks ! Marc Le Bihan On 25/11/2023 11:48, Dongjoon Hyun wrote: Please vote on releasing the following candidate as Apache Spark version 3.4.2. The vote is open until November 30th 1AM (PST) and passes if a majority +1 PMC votes are cast, with a minimum of 3 +1 votes. [ ] +1 Re

De-serialization by Java encoder : Spark 3.4.x doesn't support anymore fields having an accessor but no setter? (Encoder fails on many "NoSuchElementException: None.get" since 3.4.x [SPARK-45311])

2023-11-12 Thread Marc Le Bihan
t ? Where ? an error log like : "/no setter associated to the accessor {} for field {} in class {}/" would be useful for the developer. With maybe mentioned into it, the workaround I suggest. Regards, Marc Le Bihan [Encoder fails on many "NoSuchElementException: None.get&qu

Re: My I report a special comparaison of executions leading on issues on Spark JIRA ?

2020-10-02 Thread Marc Le Bihan
Yes. As I explained at the beginning of the message. For com/fasterxml/jackson/module/scala/ScalaObjectMapper missing I will check myself with spark-core and spark-sql become unable to load this dependency But I see nothing in Spark Migration Guide 2.4.6 to 3.0 explaining the apparition of this m

Re: My I report a special comparaison of executions leading on issues on Spark JIRA ?

2020-10-02 Thread Marc Le Bihan
Few tests (that are working on 2.4.6 and 2.4.7) are failling in 3.0.1 Some with this message : *java.lang.ClassNotFoundException: com/fasterxml/jackson/module/scala/ScalaObjectMapper* Coming from : at org.apache.spark.sql.catalyst.util.RebaseDateTime.lastSwitchJulianDay(RebaseDateTime.sc

My I report a special comparaison of executions leading on issues on Spark JIRA ?

2020-10-01 Thread Marc Le Bihan
Hello, I currently run a Spark project based on cities, local authorities, enterprises, local communities, etc. Ten Datasets written in Java are doing operations going from simple join to elaborate ones. Language used is Java. 20 integrations tests with the whole data (20 GB) takes seven hour. *A