I have a PR for it - https://github.com/apache/spark/pull/12574
Sadly I've been tied up and haven't had a chance to work further on it.
The main issue outstanding is deciding on the transform semantics as well
as performance testing.
Any comments / feedback welcome especially on transform semant
Hi Zak,
Indeed the function is missing in DataFrame-based API. I can probably
provide some quick prototype to see if it we can merge the function into
next release. I would send update here and feel free to give it a try.
Regards,
Yuhao
2016-11-01 10:00 GMT-07:00 Zak H :
> Hi,
>
> I'm using the