> The #bigdata channel over on Clojurians slack is also suspiciously quiet, > as are many of the Google groups. > > Ray. >
I worked with Sparkling and Flambo about a year ago, while Mr. Macbeth is a fellow Portlander and has a solid API, I found Sparkling to be somewhat more direct and compact. For ETL via Hadoop I wouldn't hesitate to try either of these libraries. I found them to be stable and preferable to using Spark in Scala. However, I used Powderkeg (https://github.com/HCADatalab/powderkeg) a bit and found it the most intriguing. Christophe Grand last updated PowderKeg three hours ago (from time of my posting obviously). Powderkeg relies heavily on Clojure transducers and is the only Clojure Spark library I am aware of which doesn't require AOT compilation -- you can use a Clojure repl to directly spawn jobs on a Spark cluster. If you are interested in Clojure interoperability with Spark, I would look at Powderkeg first. If you require Spark Streaming, you might be better off writing Scala, or considering another streaming solution such as Storm. The closest I have come to getting Spark Streaming to work in Clojure was with Powderkeg. It might be worth seeing if Powderkeg has made progress in this area. -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to clojure+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/clojure?hl=en --- You received this message because you are subscribed to the Google Groups "Clojure" group. To unsubscribe from this group and stop receiving emails from it, send an email to clojure+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.