Underneath the covers, jsonFile uses TextInputFormat, which will split
files correctly based on new lines.  Thus, there is no fixed maximum size
for a json object (other than the fact that it must fit into memory on the
executors).

On Mon, Dec 15, 2014 at 7:22 AM, Madabhattula Rajesh Kumar <
mrajaf...@gmail.com> wrote:
>
> Hi Peter,
>
> Thank you for the clarification.
>
> Now we need to store each JSON object into one line. Is there any
> limitation of length of JSON object? So, JSON object will not go to the
> next line.
>
> What will happen if JSON object is a big/huge one?  Will it store in a
> single line in HDFS?
>
> What will happen, if JSON object contains BLOB/CLOB value? Is this entire
> JSON object stores in single line of HDFS?
>
> What will happen, if JSON object exceeding the HDFS block size. For
> example, single JSON object split into two different worker nodes. In this
> case, How Spark will read this JSON object?
>
> Could you please clarify above questions
>
> Regards,
> Rajesh
>
>
> On Mon, Dec 15, 2014 at 6:52 PM, Peter Vandenabeele <
> pe...@vandenabeele.com> wrote:
>>
>>
>>
>> On Sat, Dec 13, 2014 at 5:43 PM, Helena Edelson <
>> helena.edel...@datastax.com> wrote:
>>
>>> One solution can be found here:
>>> https://spark.apache.org/docs/1.1.0/sql-programming-guide.html#json-datasets
>>>
>>>
>> As far as I understand, the people.json file is not really a proper json
>> file, but a file documented as:
>>
>>   "... JSON files where each line of the files is a JSON object.".
>>
>> This means that is a file with multiple lines, but each line needs to
>> have a fully self-contained JSON object
>> (initially confusing, this will not parse a standard multi-line JSON
>> file). We are working to clarify this in
>> https://github.com/apache/spark/pull/3517
>>
>> HTH,
>>
>> Peter
>>
>>
>>
>>
>>> - Helena
>>> @helenaedelson
>>>
>>> On Dec 13, 2014, at 11:18 AM, Madabhattula Rajesh Kumar <
>>> mrajaf...@gmail.com> wrote:
>>>
>>> Hi Team,
>>>
>>> I have a large JSON file in Hadoop. Could you please let me know
>>>
>>> 1. How to read the JSON file
>>> 2. How to parse the JSON file
>>>
>>> Please share any example program based on Scala
>>>
>>> Regards,
>>> Rajesh
>>>
>>>
>>>
>>
>>
>> --
>> Peter Vandenabeele
>> http://www.allthingsdata.io
>> http://www.linkedin.com/in/petervandenabeele
>> https://twitter.com/peter_v
>> gsm: +32-478-27.40.69
>> e-mail: pe...@vandenabeele.com
>> skype: peter_v_be
>>
>

Reply via email to