Hi

It could be a bug, but most of the time, this is due to an open-unlinked file, 
typically a log file which is still in use and some processes keep writing to 
it until it fills the OSTs it is using.

Look for such files on your clients (use lsof). 

Aurélien


Le 03/09/2021 09:50, « lustre-discuss au nom de Alastair Basden » 
<[email protected] au nom de [email protected]> a 
écrit :

    CAUTION: This email originated from outside of the organization. Do not 
click links or open attachments unless you can confirm the sender and know the 
content is safe.



    Hi,

    We have a file system where each OST is a single SSD.

    One of those is reporting as 100% full (lfs df -h /snap8):
    snap8-OST004d_UUID          5.8T        2.0T        3.5T  37% /snap8[OST:77]
    snap8-OST004e_UUID          5.8T        5.5T        7.5G 100% /snap8[OST:78]
    snap8-OST004f_UUID          5.8T        2.0T        3.4T  38% /snap8[OST:79]

    However, I can't find any files on it:
    lfs find --ost snap8-OST004e /snap8/
    returns nothing.

    I guess that it has filled up, and that there is some bug or other that is
    now preventing proper behaviour - but I could be wrong.

    Does anyone have any suggestions?

    Essentially, I'd like to find some of the files and delete or migrate
    some, and thus return it to useful production.

    Cheers,
    Alastair.
    _______________________________________________
    lustre-discuss mailing list
    [email protected]
    http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to