Weichen Xu created SPARK-18003:
----------------------------------
Summary: RDD zipWithIndex generate wrong result when one partition
contains more than 2147483647 records.
Key: SPARK-18003
URL: https://issues.apache.org/jira/browse/SPARK-18003
Project: Spark
Issue Type: Bug
Reporter: Weichen Xu
RDD zipWithIndex generate wrong result when one partition contains more than
Int.MaxValue records.
when RDD contains a partition with more than 2147483647 records,
error occurs.
for example, if partition-0 has more than 2147483647 records, the index became:
0,1, ..., 2147483647, -2147483648, -2147483647, -2147483646 ....
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]