I think this proposal is a very good thing giving Spark a standard way of
getting to and calling UDFs.
I like having the ScalarFunction as the API to call the UDFs. It is simple,
yet covers all of the polymorphic type cases well. I think it would also
simplify using the functions in other contexts
The ORC community is really eager to get this work integrated in to Spark
so that Spark users can have fast access to their ORC data. Let us know if
we can help the integration.
Thanks,
Owen
On Fri, Aug 4, 2017 at 8:05 AM, Dong Joon Hyun
wrote:
> Hi, All.
>
>
>
> Apache Spark always has been
The DataWorks Summit EU 2017 (including Hadoop Summit) is going to be in
Munich April 5-6 2017
. I’ve pasted the text from the CFP below.
Would you like to share your knowledge with the best and brightest in the
data community? If so, we encourage you to submit an abstract for DataWorks
Summit wi
+1 (non-binding)
I think this is an important step to improve Spark as an Apache project.
.. Owen
On Mon, May 23, 2016 at 11:18 AM, Holden Karau wrote:
> +1 non-binding (as a contributor anything which speed things up is worth
> a try, and git blame is a good enough substitute for the list whe