[ https://issues.apache.org/jira/browse/FLINK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17481671#comment-17481671 ]
Chesnay Schepler commented on FLINK-18356: ------------------------------------------ ??the maximum memory required is 4G/3.xG for each process. This is also [weird] since we have limit the heap to 2G?? {{-Xmx}} only controls the heap, and according to your table the heap does not exceed that. ??we have class leaks?? This seems pretty much confirmed at this point. If I remember correctly in a heap dump I looked at a while ago that Scala itself can cache class references. ??One possible hotfix is to not reuse the process first.?? I'd be fine with this as a band-aid for the time being because this is impairing CI quite significantly, but we do nevertheless need to figure out what the actual cause is. As is I'd think this leak should also be present in production. > Exit code 137 returned from process > ----------------------------------- > > Key: FLINK-18356 > URL: https://issues.apache.org/jira/browse/FLINK-18356 > Project: Flink > Issue Type: Bug > Components: Build System / Azure Pipelines, Tests > Affects Versions: 1.12.0, 1.13.0, 1.14.0, 1.15.0 > Reporter: Piotr Nowojski > Assignee: Dawid Wysakowicz > Priority: Blocker > Labels: pull-request-available, test-stability > Fix For: 1.15.0 > > Attachments: 1234.jpg > > > {noformat} > ============================= test session starts > ============================== > platform linux -- Python 3.7.3, pytest-5.4.3, py-1.8.2, pluggy-0.13.1 > cachedir: .tox/py37-cython/.pytest_cache > rootdir: /__w/3/s/flink-python > collected 568 items > pyflink/common/tests/test_configuration.py .......... [ > 1%] > pyflink/common/tests/test_execution_config.py ....................... [ > 5%] > pyflink/dataset/tests/test_execution_environment.py . > ##[error]Exit code 137 returned from process: file name '/bin/docker', > arguments 'exec -i -u 1002 > 97fc4e22522d2ced1f4d23096b8929045d083dd0a99a4233a8b20d0489e9bddb > /__a/externals/node/bin/node /__w/_temp/containerHandlerInvoker.js'. > Finishing: Test - python > {noformat} > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=3729&view=logs&j=9cada3cb-c1d3-5621-16da-0f718fb86602&t=8d78fe4f-d658-5c70-12f8-4921589024c3 -- This message was sent by Atlassian Jira (v8.20.1#820001)