[ 
https://issues.apache.org/jira/browse/FLINK-21929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yun Tang resolved FLINK-21929.
------------------------------
    Resolution: Fixed

> flink-statebackend-rocksdb crashes with Error occurred in starting fork
> -----------------------------------------------------------------------
>
>                 Key: FLINK-21929
>                 URL: https://issues.apache.org/jira/browse/FLINK-21929
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / State Backends
>    Affects Versions: 1.13.0
>            Reporter: Robert Metzger
>            Assignee: Yun Tang
>            Priority: Blocker
>              Labels: pull-request-available, test-stability
>             Fix For: 1.13.0
>
>         Attachments: image-2021-03-23-13-18-41-836.png
>
>
> https://dev.azure.com/rmetzger/Flink/_build/results?buildId=9001&view=results
> {code}
> 2021-03-23T09:11:12.1861967Z [INFO] BUILD FAILURE
> 2021-03-23T09:11:12.1863007Z [INFO] 
> ------------------------------------------------------------------------
> 2021-03-23T09:11:12.1863492Z [INFO] Total time: 42:35 min
> 2021-03-23T09:11:12.1864171Z [INFO] Finished at: 2021-03-23T09:11:12+00:00
> 2021-03-23T09:11:12.8003245Z [INFO] Final Memory: 137M/806M
> 2021-03-23T09:11:12.8006310Z [INFO] 
> ------------------------------------------------------------------------
> 2021-03-23T09:11:12.8082409Z [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-surefire-plugin:2.22.1:test (default-test) on 
> project flink-statebackend-rocksdb_2.11: There are test failures.
> 2021-03-23T09:11:12.8086652Z [ERROR] 
> 2021-03-23T09:11:12.8092462Z [ERROR] Please refer to 
> /__w/1/s/flink-state-backends/flink-statebackend-rocksdb/target/surefire-reports
>  for the individual test results.
> 2021-03-23T09:11:12.8096948Z [ERROR] Please refer to dump files (if any 
> exist) [date].dump, [date]-jvmRun[N].dump and [date].dumpstream.
> 2021-03-23T09:11:12.8101388Z [ERROR] ExecutionException Error occurred in 
> starting fork, check output in log
> 2021-03-23T09:11:12.8105868Z [ERROR] 
> org.apache.maven.surefire.booter.SurefireBooterForkException: 
> ExecutionException Error occurred in starting fork, check output in log
> 2021-03-23T09:11:12.8110518Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.awaitResultsDone(ForkStarter.java:510)
> 2021-03-23T09:11:12.8115518Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.runSuitesForkOnceMultiple(ForkStarter.java:382)
> 2021-03-23T09:11:12.8120811Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:297)
> 2021-03-23T09:11:12.8126356Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:246)
> 2021-03-23T09:11:12.8127129Z [ERROR] at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1183)
> 2021-03-23T09:11:12.8131291Z [ERROR] at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:1011)
> 2021-03-23T09:11:12.8132369Z [ERROR] at 
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:857)
> 2021-03-23T09:11:12.8133397Z [ERROR] at 
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:132)
> 2021-03-23T09:11:12.8134116Z [ERROR] at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
> 2021-03-23T09:11:12.8134793Z [ERROR] at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:153)
> 2021-03-23T09:11:12.8135621Z [ERROR] at 
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:145)
> 2021-03-23T09:11:12.8136323Z [ERROR] at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:116)
> 2021-03-23T09:11:12.8141570Z [ERROR] at 
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:80)
> 2021-03-23T09:11:12.8142374Z [ERROR] at 
> org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:51)
> 2021-03-23T09:11:12.8145665Z [ERROR] at 
> org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:120)
> 2021-03-23T09:11:12.8146407Z [ERROR] at 
> org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:355)
> 2021-03-23T09:11:12.8148835Z [ERROR] at 
> org.apache.maven.DefaultMaven.execute(DefaultMaven.java:155)
> 2021-03-23T09:11:12.8151299Z [ERROR] at 
> org.apache.maven.cli.MavenCli.execute(MavenCli.java:584)
> 2021-03-23T09:11:12.8152244Z [ERROR] at 
> org.apache.maven.cli.MavenCli.doMain(MavenCli.java:216)
> 2021-03-23T09:11:12.8152806Z [ERROR] at 
> org.apache.maven.cli.MavenCli.main(MavenCli.java:160)
> 2021-03-23T09:11:12.8155818Z [ERROR] at 
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 2021-03-23T09:11:12.8159757Z [ERROR] at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 2021-03-23T09:11:12.8177288Z [ERROR] at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2021-03-23T09:11:12.8178021Z [ERROR] at 
> java.lang.reflect.Method.invoke(Method.java:498)
> 2021-03-23T09:11:12.8179802Z [ERROR] at 
> org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced(Launcher.java:289)
> 2021-03-23T09:11:12.8183929Z [ERROR] at 
> org.codehaus.plexus.classworlds.launcher.Launcher.launch(Launcher.java:229)
> 2021-03-23T09:11:12.8187563Z [ERROR] at 
> org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode(Launcher.java:415)
> 2021-03-23T09:11:12.8192413Z [ERROR] at 
> org.codehaus.plexus.classworlds.launcher.Launcher.main(Launcher.java:356)
> 2021-03-23T09:11:12.8196538Z [ERROR] Caused by: 
> org.apache.maven.surefire.booter.SurefireBooterForkException: Error occurred 
> in starting fork, check output in log
> 2021-03-23T09:11:12.8201660Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.fork(ForkStarter.java:622)
> 2021-03-23T09:11:12.8203999Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.access$600(ForkStarter.java:115)
> 2021-03-23T09:11:12.8204879Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:371)
> 2021-03-23T09:11:12.8205665Z [ERROR] at 
> org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:347)
> 2021-03-23T09:11:12.8206513Z [ERROR] at 
> java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 2021-03-23T09:11:12.8207169Z [ERROR] at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> 2021-03-23T09:11:12.8209376Z [ERROR] at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> 2021-03-23T09:11:12.8209955Z [ERROR] at java.lang.Thread.run(Thread.java:748)
> 2021-03-23T09:11:12.8211107Z [ERROR] -> [Help 1]
> 2021-03-23T09:11:12.8211559Z [ERROR] 
> 2021-03-23T09:11:12.8309016Z [ERROR] To see the full stack trace of the 
> errors, re-run Maven with the -e switch.
> 2021-03-23T09:11:12.8310211Z [ERROR] Re-run Maven using the -X switch to 
> enable full debug logging.
> 2021-03-23T09:11:12.8311401Z [ERROR] 
> 2021-03-23T09:11:12.8311976Z [ERROR] For more information about the errors 
> and possible solutions, please read the following articles:
> 2021-03-23T09:11:12.8312635Z [ERROR] [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
> 2021-03-23T09:11:12.8313300Z [ERROR] 
> 2021-03-23T09:11:12.8314248Z [ERROR] After correcting the problems, you can 
> resume the build with the command
> 2021-03-23T09:11:12.8315462Z [ERROR]   mvn <goals> -rf 
> :flink-statebackend-rocksdb_2.11
> 2021-03-23T09:11:12.8687280Z Process exited with EXIT CODE: 1.
> 2021-03-23T09:11:12.8687830Z Trying to KILL watchdog (359).
> 2021-03-23T09:11:12.8696302Z /__w/1/s/tools/ci/watchdog.sh: line 100:   359 
> Terminated              watchdog
> 2021-03-23T09:11:16.4147696Z Searching for .dump, .dumpstream and related 
> files in '/__w/1/s'
> 2021-03-23T09:11:21.6569652Z Moving 
> '/__w/1/s/flink-runtime/target/surefire-reports/2021-03-23T08-28-44_805-jvmRun2.dump'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6645588Z Moving 
> '/__w/1/s/flink-runtime/target/surefire-reports/2021-03-23T08-28-44_805-jvmRun1.dump'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6683479Z Moving 
> '/__w/1/s/flink-runtime/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6726888Z Moving 
> '/__w/1/s/flink-core/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6772031Z Moving 
> '/__w/1/s/flink-metrics/flink-metrics-core/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6807831Z Moving 
> '/__w/1/s/flink-java/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6847619Z Moving 
> '/__w/1/s/flink-runtime-web/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:21.6882147Z Moving 
> '/__w/1/s/flink-state-backends/flink-statebackend-rocksdb/core.18126' to 
> target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1251457Z Moving 
> '/__w/1/s/flink-state-backends/flink-statebackend-rocksdb/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1287286Z Moving 
> '/__w/1/s/flink-test-utils-parent/flink-test-utils/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1322850Z Moving 
> '/__w/1/s/flink-clients/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1360757Z Moving 
> '/__w/1/s/flink-streaming-java/java_pid5009.hprof' to target directory 
> ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1421701Z Moving 
> '/__w/1/s/flink-streaming-java/target/surefire-reports/2021-03-23T08-28-44_805-jvmRun2.dump'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1460325Z Moving 
> '/__w/1/s/flink-streaming-java/target/surefire-reports/2021-03-23T08-28-44_805-jvmRun1.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1492941Z Moving 
> '/__w/1/s/flink-streaming-java/target/surefire-reports/2021-03-23T08-28-44_805-jvmRun1.dump'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1523043Z Moving 
> '/__w/1/s/flink-streaming-java/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1554119Z Moving 
> '/__w/1/s/flink-optimizer/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1589618Z Moving 
> '/__w/1/s/flink-scala/target/surefire-reports/2021-03-23T08-28-44_805.dumpstream'
>  to target directory ('/__w/_temp/debug_files/')
> 2021-03-23T09:11:22.1638143Z Compressing debug files
> {code}
> I'm not sure if this includes one or two issues.
> One issue seems to be in the flink-streaming-java tests:
> {code}
> # Created at 2021-03-23T09:03:49.666
> Corrupted STDOUT by directly writing to native stream in forked JVM 1. Stream 
> 'java.lang.OutOfMemoryError: Java heap space'.
> java.lang.IllegalArgumentException: Stream stdin corrupted. Expected comma 
> after third character in command 'java.lang.OutOfMemoryError: Java heap 
> space'.
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient$OperationalData.<init>(ForkClient.java:507)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.processLine(ForkClient.java:210)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLine(ForkClient.java:177)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsumer$Pumper.run(ThreadedStreamConsumer.java:88)
>       at java.lang.Thread.run(Thread.java:748)
> # Created at 2021-03-23T09:03:49.667
> Corrupted STDOUT by directly writing to native stream in forked JVM 1. Stream 
> 'Dumping heap to java_pid5009.hprof ...'.
> java.lang.IllegalArgumentException: Stream stdin corrupted. Expected comma 
> after third character in command 'Dumping heap to java_pid5009.hprof ...'.
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient$OperationalData.<init>(ForkClient.java:507)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.processLine(ForkClient.java:210)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLine(ForkClient.java:177)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsumer$Pumper.run(ThreadedStreamConsumer.java:88)
>       at java.lang.Thread.run(Thread.java:748)
> # Created at 2021-03-23T09:03:49.693
> Corrupted STDOUT by directly writing to native stream in forked JVM 1. Stream 
> 'Heap dump file created [2834354 bytes in 0.027 secs]'.
> java.lang.IllegalArgumentException: Stream stdin corrupted. Expected comma 
> after third character in command 'Heap dump file created [2834354 bytes in 
> 0.027 secs]'.
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient$OperationalData.<init>(ForkClient.java:507)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.processLine(ForkClient.java:210)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ForkClient.consumeLine(ForkClient.java:177)
>       at 
> org.apache.maven.plugin.surefire.booterclient.output.ThreadedStreamConsumer$Pumper.run(ThreadedStreamConsumer.java:88)
>       at java.lang.Thread.run(Thread.java:748)
> {code}
> There's a coredump included:
>  !image-2021-03-23-13-18-41-836.png! 
> Looks like the testInitialSizeCompoutation test is causing the / a failure.
> The second issue is (which is reported as a test failure): 
> {code}
> # Created at 2021-03-23T09:09:39.376
> Picked up JAVA_TOOL_OPTIONS: -XX:+HeapDumpOnOutOfMemoryError
> # Created at 2021-03-23T09:09:39.535
> Picked up JAVA_TOOL_OPTIONS: -XX:+HeapDumpOnOutOfMemoryError
> # Created at 2021-03-23T09:11:10.037
> pure virtual method called
> # Created at 2021-03-23T09:11:10.037
> terminate called without an active exception
> # Created at 2021-03-23T09:11:12.170
> Aborted (core dumped)
> {code}
> From the coredump, I see the following:
> {code}
> (gdb) where
> #0  0x00007f9343508438 in __GI_raise (sig=sig@entry=6) at 
> ../sysdeps/unix/sysv/linux/raise.c:54
> #1  0x00007f934350a03a in __GI_abort () at abort.c:89
> #2  0x00007f9341b8e84d in __gnu_cxx::__verbose_terminate_handler() () from 
> /usr/lib/x86_64-linux-gnu/libstdc++.so.6
> #3  0x00007f9341b8c6b6 in ?? () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
> #4  0x00007f9341b8c701 in std::terminate() () from 
> /usr/lib/x86_64-linux-gnu/libstdc++.so.6
> #5  0x00007f9341b8d23f in __cxa_pure_virtual () from 
> /usr/lib/x86_64-linux-gnu/libstdc++.so.6
> #6  0x00007f92ed39c0d5 in rocksdb::DBImpl::CloseHelper() () from 
> /tmp/librocksdbjni8334465068904677424.so
> #7  0x00007f92ed3a717b in rocksdb::DBImpl::~DBImpl() () from 
> /tmp/librocksdbjni8334465068904677424.so
> #8  0x00007f92ed3a7451 in rocksdb::DBImpl::~DBImpl() () from 
> /tmp/librocksdbjni8334465068904677424.so
> #9  0x00007f932d801aa8 in ?? ()
> #10 0x0000000085677fe8 in ?? ()
> #11 0x00007f931c8da910 in ?? ()
> #12 0x00007f931c8da968 in ?? ()
> #13 0x00007f932d007ffd in ?? ()
> #14 0x0000000000000000 in ?? ()
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to