On Thu, Feb 20, 2020 at 11:47 AM Konzem, Kevin P <
[email protected]> wrote:

> test this by running 'while [ true ];do /bin/df -TP /performance;done' on
> two sessions on the same client. As soon as I start the second while loop,
> the outputs go from:
> Filesystem                 Type   1024-blocks   Used Available Capacity
> Mounted on
> 192.168.0.181@tcp:/perform lustre    71467728 100416  67664944       1%
> /performance
>
> to:
> Filesystem                 Type   1024-blocks  Used Available Capacity
> Mounted on
> 192.168.0.181@tcp:/perform lustre           0    -0        -0      50%
> /performance
>

Kevin,

I can confirm seeing this issue intermittently as well, and usually with a
re-run of df the results are once again reasonable.  It looks like you have
a more reliable reproducer though, which is good!  A support ticket was
opened with our vendor, and they said if we can capture a "strace" of it
for a bad run that might be helpful... but I haven't caught it in the act
yet.  With your reproducer, can you get that and open a Jira ticket to
track the problem?

As a workaround, try "lfs df" instead, it may take a different code path
that avoids the bug.

-Nathan
_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to