Hi Russell,

I've just got the OOM error during Test 13. I'm running it from IntelliJ on
Windows with Java 11.

[image: image.png]
I'll look into it over the course of the next week.

Regards,
Ángel

El sáb, 11 ene 2025 a las 9:23, Russell Jurney (<russell.jur...@gmail.com>)
escribió:

> Friends of GraphFrames (github.com/graphframes/graphframes), I have a
> question for you...
>
> I can't get the unit test 'two components and two dangling vertices' in
> the org.graphframes.lib.ConnectedComponentsSuite
> <https://github.com/graphframes/graphframes/blob/649094caf58cfda0eea3e8cd66785aa38104d771/src/test/scala/org/graphframes/lib/ConnectedComponentsSuite.scala#L138-L148>
> to pass. It fails with an 'OutOfMemoryError: Java heap space' error. I am a
> little stuck on completing a docs release with a motif finding tutorial
> <https://github.com/graphframes/graphframes/pull/473> due to this issue.
>
> The problem is outlined in this gist:
> https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf
>
> Can someone else please try this and see if it passes on the master branch?
>
> > build/sbt clean compile package test
>
> I've tried giving it lots of RAM just to see if it would help, as much as
> 32g driver and 16g for executors and... it has no effect. The test graph is 8
> nodes and 6 edges
> <https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf#file-connectedcomponentsuite-scala-L22-L26>,
> so it shouldn't have a memory problem... yet when it runs, all 24 cores of
> my CPU get used, it spikes as indicated in the image in the gist.
>
> I am running the following setup:
>
> * Ubuntu 20.04 (22.04 in the Docker image)
> * OpenJDK 11 (I also tried 8, same problem)
> * Scala 2.12.20 (I also tried 2.13, same problem)
> * Python 3.11 (I also tried 3.9, same problem)
>
> Or I am running the Dockerfile in the gist
> <https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf#file-dockerfile>
> .
>
> Any help much appreciated! Thanks
>
> -----------------------------------------------------------------
> Oh, some new community stuff for GraphFrames. Hackathon announced next
> week :)
>
>
>    - GraphFrames Mailing List <https://groups.google.com/g/graphframes/>:
>    ask questions about GraphFrames on our Google Group
>    - #graphframes Discord Channel on GraphGeeks
>    <https://discord.com/channels/1162999022819225631/1326257052368113674>
>
> Thanks!
> Russell Jurney @rjurney <http://twitter.com/rjurney>
> russell.jur...@gmail.com LI <http://linkedin.com/in/russelljurney> FB
> <http://facebook.com/jurney> datasyndrome.com
>

Reply via email to