Easiest way is to broadcast it. On 13 May 2015 10:40, "Charles Hayden" <[email protected]> wrote:
> In pySpark, I am writing a map with a lambda that calls random.shuffle. > For testing, I want to be able to give it a seed, so that successive runs > will produce the same shuffle. > I am looking for a way to set this same random seed once on each worker. > Is there any simple way to do it? > > >
