On 23/02/16 18:03, Jeff Steinmetz wrote:
Hello zeppelin dev group, Regarding the R Interpreter Pull requests 208 and 702. I am trying to figure out if the functionality between these are overlapping, or one supports something different than the other. Is 702 a super set of 208 (702 is a fork of 208)?
702 is not a fork of 208, just something that was in a public repo since long and finally decided to make a PR to deal with the points expressed on [1]
[1] https://github.com/apache/incubator-zeppelin/pull/208#issuecomment-170337289
Can you pass the reference of a distributed (parallelized) dataframe built in %spark (scala) to the R interpreter? Similar to z.put(“myDF", myDF)?
I expect passing a dataframe via the Zeppelin context will fail, but as the interperters run on the same Spark REPL, the dataframe are accessible in both R and Scala (see links to screenshots)
https://raw.githubusercontent.com/datalayer/datalayer-zeppelin/rscala/_Rimg/r-scala-dataframe-binding.png https://raw.githubusercontent.com/datalayer/datalayer-zeppelin/rscala/_Rimg/scala-r-dataframe-binding.png
Similarly, since R doesn’t support serialization of functions (unless you use something from the SparkR library) is there an example of collecting the parallel DF to a local DF (which I realize it means the dataset needs to fit in local memory on the zeppelin server).
That's something I am thinking to since long, especially to allow visualizing small or subset datasets with R visualizations. I will try to show an example.
I can to dig into this a bit and help out where appropriate, however its unclear which PR to focus my efforts on.
I guess you can checkout both and play with them to better feel what they offer.
Best, Jeff Steinmetz Principal Architect Akili Interactive Labs On 2/23/16, 8:01 AM, "elbamos" <[email protected]> wrote:Github user elbamos commented on the pull request: https://github.com/apache/incubator-zeppelin/pull/702#issuecomment-187764059 @btiernay support for that has been in 208 all along... > On Feb 23, 2016, at 9:27 AM, Bob Tiernay <[email protected]> wrote: > > @echarles This is great! Thanks for all your hard work. Very much appreciated! > > ╉ > Reply to this email directly or view it on GitHub. > --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at [email protected] or file a JIRA ticket with INFRA. ---
