Hi, You just need to extend Partitioner and override the numPartitions and getPartition methods, see below
class MyPartitioner extends partitioner { def numPartitions: Int = // Return the number of partitions def getPartition(key Any): Int = // Return the partition for a given key } On Tue, Sep 1, 2015 at 10:15 AM shahid qadri <shahidashr...@icloud.com> wrote: > Hi Sparkians > > How can we create a customer partition in pyspark > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >