Spark doesn't offer a native graph database like Neo4j does since GraphX
is still using the RDD tabular data structure. Spark doesn't have a GQL
or Cypher query engine either, but uses Google's Pregal API for graph
processing. Don't see any prospect that Spark is going to implement any
types of GQL engine in the near future.
Implementing a query engine would be a huge task and very difficult to
optimize in general, especially when dealing with millions if not
billions of vertices and edges. Since any graph query would translate
to APIs, your only option would be to use the graph APIs in Spark
(currently only in Scala). It isn't hard to learn and use at all, just
requires different types of thinking/problem-solving...
On 10/16/22 12:36 AM, ERSyrfw212oe wrote:
I think I saw GraphX here and there,is it a re-implementation of open
cypher or is it a graphdb for spark?
I wanted to create a graphdb and query with cypher language,i looked
around docs and didnt see any relevant guide.SO seems to be tackling
specific problems.and I currently dont even know where to start, or
even know if GQL/Cypher query is actually supported or not.