One way I can see is to -

1. get rdd from your df
2. call rdd.zipWithIndex to get a new rdd
3. turn your new rdd to a new df

On Fri, May 29, 2015 at 5:43 AM, Cesar Flores <ces...@gmail.com> wrote:

>
> Assuming that I have the next data frame:
>
> flag | price
> ----------------------
> 1    |47.808764653746
> 1    |47.808764653746
> 1    |31.9869279512204
> 1    |47.7907893713564
> 1    |16.7599200038239
> 1    |16.7599200038239
> 1    |20.3916014172137
>
> How can I create a data frame with an extra indexed column as the next one:
>
> flag | price          | index
> ----------------------|-------
> 1    |47.808764653746 | 0
> 1    |47.808764653746 | 1
> 1    |31.9869279512204| 2
> 1    |47.7907893713564| 3
> 1    |16.7599200038239| 4
> 1    |16.7599200038239| 5
> 1    |20.3916014172137| 6
>
> --
> Cesar Flores
>

Reply via email to