There is no record "345" here it seems, right? it's not that it exists and
has null fields; it's invalid w.r.t. the schema that the rest suggests.

On Wed, Feb 23, 2022 at 11:57 AM Sid <flinkbyhe...@gmail.com> wrote:

> Hello experts,
>
> I have a JSON data like below:
>
> [
>   {
>     "123": {
>       "Party1": {
>         "FIRSTNAMEBEN": "ABC",
>         "ALIASBEN": "",
>         "RELATIONSHIPTYPE": "ABC, FGHIJK LMN",
>         "DATEOFBIRTH": "7/Oct/1969"
>       },
>       "Party2": {
>         "FIRSTNAMEBEN": "ABCC",
>         "ALIASBEN": "",
>         "RELATIONSHIPTYPE": "ABC, FGHIJK LMN",
>         "DATEOFBIRTH": "7/Oct/1969"
>       }
>     },
>     "GeneratedTime": "2022-01-30 03:09:26"
>   },
>   {
>     "456": {
>       "Party1": {
>         "FIRSTNAMEBEN": "ABCD",
>         "ALIASBEN": "",
>         "RELATIONSHIPTYPE": "ABC, FGHIJK LMN",
>         "DATEOFBIRTH": "7/Oct/1969"
>       },
>       "Party2": {
>         "FIRSTNAMEBEN": "ABCDD",
>         "ALIASBEN": "",
>         "RELATIONSHIPTYPE": "ABC, FGHIJK LMN",
>         "DATEOFBIRTH": "7/Oct/1969"
>       },
>       "Party3": {
>         "FIRSTNAMEBEN": "ABCDDE",
>         "ALIASBEN": "",
>         "RELATIONSHIPTYPE": "ABC, FGHIJK LMN",
>         "DATEOFBIRTH": "7/Oct/1969"
>       }
>     },
>     "GeneratedTime": "2022-01-30 03:09:26"
>   },
>   {
>     "345": {
>
>
>     },
>     "GeneratedTime": "2022-01-30 03:09:26"
>   }
> ]
>
> However, when I try to display this JSON using below code, it doesn't show
> the blank records. In my case I don't get any records for 345 since it is
> null but I want to display it in the final flattened dataset.
>
> val df = spark.read.option("multiline",
> true).json("/home/siddhesh/Documents/nested_json.json")
>
> Spark version:3.1.1
>
> Thanks,
> Sid
>

Reply via email to