AW: [Spark R]: dapply only works for very small datasets

2017-11-29 Thread Kunft, Andreas
?Thanks alot. I will have a lock at the issues Von: Felix Cheung Gesendet: Mittwoch, 29. November 2017 04:47 An: Kunft, Andreas; user@spark.apache.org Betreff: Re: [Spark R]: dapply only works for very small datasets You can find more discussions in https

AW: [Spark R]: dapply only works for very small datasets

2017-11-28 Thread Kunft, Andreas
time, but it seems there must be something else off considering these numbers. Von: Felix Cheung Gesendet: Montag, 27. November 2017 20:20 An: Kunft, Andreas; user@spark.apache.org Betreff: Re: [Spark R]: dapply only works for very small datasets What's

[Spark R]: dapply only works for very small datasets

2017-11-27 Thread Kunft, Andreas
Hello, I tried to execute some user defined functions with R using the airline arrival performance dataset. While the examples from the documentation for the `<-` apply operator work perfectly fine on a size ~9GB, the `dapply` operator fails to finish even after ~4 hours. I'm using a functi