Re: Create an Empty dataframe

Apostolos N. Papadopoulos Sat, 30 Jun 2018 08:51:51 -0700

Hi Dimitri,

you can do the following:


1. create an initial dataframe from an empty csv

2. use "union" to insert new rows

Do not forget that Spark cannot replace a DBMS. Spark is mainly be usedfor analytics.

If you need select/insert/delete/update capabilities, perhaps you shouldlook at a DBMS.

Another alternative, in case you need "append only" semantics, is to usestreaming or structured streaming.



regards,

Apostolos




On 30/06/2018 05:46 μμ, dimitris plakas wrote:

I am new to Pyspark and want to initialize a new empty dataframe withsqlContext() with two columns ("Column1", "Column2"), and i want toappend rows dynamically in a for loop.
Is there any way to achieve this?

Thank you in advance.


--
Apostolos N. Papadopoulos, Associate Professor
Department of Informatics
Aristotle University of Thessaloniki
Thessaloniki, GREECE
tel: ++0030312310991918
email: papad...@csd.auth.gr
twitter: @papadopoulos_ap
web: http://delab.csd.auth.gr/~apostol


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Re: Create an Empty dataframe

Reply via email to