Are there any plans to generalize the type of VertexId in GraphX?
Our keys are particularly long. We could use the hashCode() trick, but the
chance of collisions is not acceptable. Given our data volume, we have
encountered hashCode() collisions more than once.

I see this Jira, but it is specific to UUIDs:

https://issues.apache.org/jira/browse/SPARK-1153

Looking through the code, it seems doable, but I'm not aware of the
consequences.
Is it entirely a performance issue?

Support for an arbitrary type would be ideal, but arbitrarily long byte
arrays are a reasonable compromise, if that helps.



-----
--
Madhu
https://www.linkedin.com/in/msiddalingaiah
--
View this message in context: 
http://apache-spark-developers-list.1001551.n3.nabble.com/VertexId-type-in-GraphX-tp10104.html
Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
For additional commands, e-mail: dev-h...@spark.apache.org

Reply via email to