Thank you Bjørn. My PR [ https://github.com/apache/spark/pull/47522 ] was updated to be aligned with the guideline.
+ What changes were proposed in this pull request? + Why are the changes needed? + Does this PR introduce any user-facing change? + How was this patch tested? + Was this patch authored or co-authored using generative AI tooling? On Sun, 25 Aug 2024 at 15:47, Bjørn Jørgensen <bjornjorgen...@gmail.com> wrote: > Apache spark does have a template for PR's > https://github.com/apache/spark/blob/master/.github/PULL_REQUEST_TEMPLATE > > søn. 25. aug. 2024 kl. 13:41 skrev Mich Talebzadeh < > mich.talebza...@gmail.com>: > >> Unfortunately it is not that straight forward >> >> >> 1. Committer Votes: The PR needs a sufficient number of "+1" votes >> from *committers.* >> 2. Review Process: Address feedback from the community and committers >> to ensure the PR meets the necessary standards. >> 3. Approval: Once approved by committers, the PR can be merged into >> the main codebase. >> >> >> HTH >> >> >> >> On Sun, 25 Aug 2024 at 08:17, Mark Andreev <mark.andr...@gmail.com> >> wrote: >> >>> Thank you for your review. >>> >>> Could you explain how to merge this commit into the upstream? I don't >>> want this PR to be abandoned. >>> >>> Best regards, >>> Mark Andreev >>> >>> >>> On Wed, 21 Aug 2024 at 23:08, Mich Talebzadeh <mich.talebza...@gmail.com> >>> wrote: >>> >>>> Hi Mark, >>>> >>>> You have already done that and have made the request for review. >>>> >>>> +1 for me >>>> >>>> Mich Talebzadeh, >>>> >>>> Architect | Data Engineer | Data Science | Financial Crime >>>> PhD <https://en.wikipedia.org/wiki/Doctor_of_Philosophy> Imperial >>>> College London <https://en.wikipedia.org/wiki/Imperial_College_London> >>>> London, United Kingdom >>>> >>>> >>>> view my Linkedin profile >>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> >>>> >>>> >>>> https://en.everybodywiki.com/Mich_Talebzadeh >>>> >>>> >>>> >>>> *Disclaimer:* The information provided is correct to the best of my >>>> knowledge but of course cannot be guaranteed . It is essential to note >>>> that, as with any advice, quote "one test result is worth one-thousand >>>> expert opinions (Werner >>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>Von Braun >>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>)". >>>> >>>> >>>> On Wed, 21 Aug 2024 at 22:20, Mark Andreev <mark.andr...@gmail.com> >>>> wrote: >>>> >>>>> Thank you, Mich. >>>>> >>>>> What is the correct procedure to request a review? >>>>> >>>>> On Tue, 20 Aug 2024 at 22:57, Mich Talebzadeh < >>>>> mich.talebza...@gmail.com> wrote: >>>>> >>>>>> Hi Mark, >>>>>> >>>>>> Added a comment to Jira to provide more clarity to Description >>>>>> >>>>>> When encountering mixed schema rows, the current error message >>>>>> "{actual} is not a valid external type for schema of {expected}" lacks >>>>>> sufficient detail to identify the problematic column. This ambiguity >>>>>> hinders troubleshooting and increases development time. >>>>>> >>>>>> To enhance error clarity, we propose incorporating the source column >>>>>> name into the error message. For example: "Column 'my_column' has an >>>>>> actual >>>>>> type of {actual} which is not a valid external type for the expected >>>>>> schema >>>>>> of {expected}." >>>>>> >>>>>> By providing this additional context, developers can more efficiently >>>>>> pinpoint and resolve schema mismatches. >>>>>> >>>>>> >>>>>> HTH >>>>>> >>>>>> Mich Talebzadeh, >>>>>> >>>>>> Architect | Data Engineer | Data Science | Financial Crime >>>>>> PhD <https://en.wikipedia.org/wiki/Doctor_of_Philosophy> Imperial >>>>>> College London >>>>>> <https://en.wikipedia.org/wiki/Imperial_College_London> >>>>>> London, United Kingdom >>>>>> >>>>>> >>>>>> view my Linkedin profile >>>>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> >>>>>> >>>>>> >>>>>> https://en.everybodywiki.com/Mich_Talebzadeh >>>>>> >>>>>> >>>>>> >>>>>> *Disclaimer:* The information provided is correct to the best of my >>>>>> knowledge but of course cannot be guaranteed . It is essential to note >>>>>> that, as with any advice, quote "one test result is worth one-thousand >>>>>> expert opinions (Werner >>>>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>Von Braun >>>>>> <https://en.wikipedia.org/wiki/Wernher_von_Braun>)". >>>>>> >>>>>> >>>>>> On Tue, 20 Aug 2024 at 21:59, Mark Andreev <mark.andr...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Hi, >>>>>>> >>>>>>> Could you review my small PR [SPARK-49044][SQL] ValidateExternalType >>>>>>> should return a child in error ( >>>>>>> https://github.com/apache/spark/pull/47522 )? Changes contain >>>>>>> tests that verify results. >>>>>>> >>>>>>> TLDR: After fix error message will contain extra information: [B is >>>>>>> not a valid external type for schema of string at >>>>>>> getexternalrowfield(assertnotnull(input[0, org.apache.spark.sql.Row, >>>>>>> true]), 1, f3) >>>>>>> If you need more information, please let me know. If you're busy, >>>>>>> please let me know the best time to reach you again. >>>>>>> >>>>>>> On Mon, 29 Jul 2024 at 18:15, Mark Andreev <mark.andr...@gmail.com> >>>>>>> wrote: >>>>>>> >>>>>>>> Hi Spark Devs, >>>>>>>> >>>>>>>> Please review my PR [ https://github.com/apache/spark/pull/47522 ] >>>>>>>> that relates to ticket [ >>>>>>>> https://issues.apache.org/jira/browse/SPARK-49044 ]. >>>>>>>> >>>>>>>> Context: When we have mixed schema rows, the error message >>>>>>>> "{actual} is not a valid external type for schema of {expected}" >>>>>>>> doesn't >>>>>>>> help to understand the column with the problem. I suggest adding >>>>>>>> information about the source column. >>>>>>>> >>>>>>>> Example: >>>>>>>> https://github.com/mrk-andreev/example-spark-schema/blob/main/spark_4.0.0/src/test/scala/ErrorMsgSuite.scala >>>>>>>> >>>>>>>> Before fix: [B is not a valid external type for schema of string >>>>>>>> After fix: [B is not a valid external type for schema of string at >>>>>>>> getexternalrowfield(assertnotnull(input[0, org.apache.spark.sql.Row, >>>>>>>> true]), 1, f3) >>>>>>>> >>>>>>>> -- >>>>>>>> Best regards, >>>>>>>> Mark Andreev >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Best regards, >>>>>>> Mark Andreev >>>>>>> >>>>>> >>>>> >>>>> -- >>>>> Best regards, >>>>> Mark Andreev >>>>> >>>> > > -- > Bjørn Jørgensen > Vestre Aspehaug 4, 6010 Ålesund > Norge > > +47 480 94 297 > -- Best regards, Mark Andreev