CVS with schema inference is a full read of the data, so that could be one of 
the problems. Do it at most once, print out the schema and use it from then on 
during ingress & use something else for persistence

On 6 Aug 2018, at 05:44, makatun 
<[email protected]<mailto:[email protected]>> wrote:

         a. csv and parquet formats (parquet created from the same csv):
.format(<csv/parquet>)
         b. schema-on-read on/off:  .option(inferSchema=<true/false>)

Reply via email to