[ 
https://issues.apache.org/jira/browse/FLINK-30539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653578#comment-17653578
 ] 

Matthias Pohl edited comment on FLINK-30539 at 1/2/23 12:07 PM:
----------------------------------------------------------------

Thanks for sharing your opinion [~gaborgsomogyi].
{quote}By eliminating timeout you mean we should depend on CI timeout?{quote}
Yes, because the CI timeout (or actually the watcher process) would trigger a 
thread dump that would reveal where the test is hanging. That said, removing 
the timeouts is not a proposal for fixing the test instability but to comply to 
our [coding 
conventions|https://flink.apache.org/contributing/code-style-and-quality-common.html#avoid-timeouts-in-junit-tests]
 which would help us dig into what's going on.


was (Author: mapohl):
Thanks for sharing your opinion [~gaborgsomogyi].
> By eliminating timeout you mean we should depend on CI timeout?
Yes, because the CI timeout (or actually the watcher process) would trigger a 
thread dump that would reveal where the test is hanging. That said, removing 
the timeouts is not a proposal for fixing the test instability but to comply to 
our [coding 
conventions|https://flink.apache.org/contributing/code-style-and-quality-common.html#avoid-timeouts-in-junit-tests]
 which would help us dig into what's going on.

> YARNSessionCapacitySchedulerITCase.testDetachedPerJobYarnCluster and 
> testDetachedPerJobYarnClusterWithStreamingJob timing out
> -----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-30539
>                 URL: https://issues.apache.org/jira/browse/FLINK-30539
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>    Affects Versions: 1.16.0
>            Reporter: Matthias Pohl
>            Priority: Major
>              Labels: test-stability
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=44337&view=logs&j=298e20ef-7951-5965-0e79-ea664ddc435e&t=d4c90338-c843-57b0-3232-10ae74f00347&l=32023
> Both tests failed because they are running into the 60s timeout that was 
> defined for each of them. We should get rid of the timeout to access the 
> thread dump. It might be related to FLINK-24169



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to