Hi Teng,
Thanks for the answer. I've switched to pandas during proof of concept
process in order to be able to plot graphs easily.
Actually, pandas DataFrame object itself has `plot` methods, so these
objects can plot themselves on most cases easily (it uses matplotlib
inside).
I wonder if spark
e... then this sounds like a feature requirement for matplotlib, you
need to make matplotlib's APIs support RDD or spark DataFrame object,
i checked the API of mplot3d
(http://matplotlib.org/mpl_toolkits/mplot3d/tutorial.html#mpl_toolkits.mplot3d.Axes3D.scatter),
it only supports "array-like" input
Thanks for help, but the example that you referenced gets the values from
RDD as list and plots that list.
What I am specifically asking was that is there a convenient way to plot a
DataFrame object directly?(like pandas DataFrame objects)
On Wed, Mar 23, 2016 at 11:47 AM Teng Qiu wrote:
> not
not sure about 3d plot, but there is a nice example:
https://github.com/zalando/spark-appliance/blob/master/examples/notebooks/PySpark_sklearn_matplotlib.ipynb
for plotting rdd or dataframe using matplotlib.
Am Mittwoch, 23. März 2016 schrieb Yavuz Nuzumlalı :
> Hi all,
> I'm trying to plot the r
Hi all,
I'm trying to plot the result of a simple PCA operation, but couldn't find
a clear documentation about plotting data frames.
Here is the output of my data frame:
++
|pca_features