[ovirt-users] Re: Sometimes paused due to unknown storage error on gluster

Strahil Nikolov Sat, 28 Mar 2020 11:50:10 -0700

On March 28, 2020 7:26:33 PM GMT+02:00, Nir Soffer <[email protected]> wrote:
>On Sat, Mar 28, 2020 at 1:59 PM Strahil Nikolov <[email protected]>
>wrote:
>>
>> On March 28, 2020 11:03:54 AM GMT+02:00, Gianluca Cecchi
><[email protected]> wrote:
>> >On Sat, Mar 28, 2020 at 8:39 AM Strahil Nikolov
><[email protected]>
>> >wrote:
>> >
>> >> On March 28, 2020 3:21:45 AM GMT+02:00, Gianluca Cecchi <
>> >> [email protected]> wrote:
>> >>
>> >>
>> >[snip]
>> >
>> >>Actually it only happened with empty disk (thin provisioned) and
>> >sudden
>> >> >high I/O during the initial phase of install of the OS; it didn't
>> >> >happened
>> >> >then during normal operaton (even with 600MB/s of throughput).
>> >>
>> >
>> >[snip]
>> >
>> >
>> >> Hi Gianluca,
>> >>
>> >> Is it happening to machines with preallocated disks or on machines
>> >with
>> >> thin disks ?
>> >>
>> >> Best Regards,
>> >> Strahil Nikolov
>> >>
>> >
>> >thin provisioned. But as I have tro create many VMs with 120Gb of
>disk
>> >size
>> >of which probably only a part during time will be allocated, it
>would
>> >be
>> >unfeasible to make them all preallocated. I learned that thin is not
>> >good
>> >for block based storage domains and heavy I/O, but I would hope that
>it
>> >is
>> >not the same with file based storage domains...
>> >Thanks,
>> >Gianluca
>>
>> This is normal - gluster cannot allocate fast enough the needed
>shards (due to high IO),  so the qemu pauses  the VM until  storage  is
>available  again .
>
>I don't know glusterfs internals, but I think this is very unlikely.
>
>For block storage thin provisioning in vdsm, vdsm is responsible for
>allocating
>more space, but vdsm is not in the datapath, it is monitoring the
>allocation and
>allocate more data when free space reaches a limit. It has no way to
>block I/O
>before more space is available. Gluster is in the datapath and can
>block I/O until
>it can process it.
>
>Can you explain what is the source for this theory?
>
>> You can think about VDO (with deduplication ) as a  PV for the  Thin
>LVM and this way you can preallocate your VMs , while saving space
>(deduplication, zero-block elimination  and even compression).
>> Of  course, VDO will reduce  performance (unless  you have
>battery-backed write cache and compression is disabled),  but  tbe
>benefits will be alot more.
>>
>> Another approach is to increase the shard size - so gluster will
>create fewer  shards,  but allocation on disk will be higher.
>>
>> Best Regards,
>> Strahil Nikolov
>> _______________________________________________
>> Users mailing list -- [email protected]
>> To unsubscribe send an email to [email protected]
>> Privacy Statement: https://www.ovirt.org/privacy-policy.html
>> oVirt Code of Conduct:
>https://www.ovirt.org/community/about/community-guidelines/
>> List Archives:
>https://lists.ovirt.org/archives/list/[email protected]/message/77DYUF7A5D6BIAYGVCBDKRBX2YWWJDJ4/
>_______________________________________________
>Users mailing list -- [email protected]
>To unsubscribe send an email to [email protected]
>Privacy Statement: https://www.ovirt.org/privacy-policy.html
>oVirt Code of Conduct:
>https://www.ovirt.org/community/about/community-guidelines/
>List Archives:
>https://lists.ovirt.org/archives/list/[email protected]/message/2LC5HGDMXJPOMVIYABLM77BRWG6LYOZJ/


Hey Nir,
You are right ... This is just a theory based on my knowledge and it might not 
be valid.
We nees the libvirt logs to confirm or reject  the theory, but I'm convinced 
that is the reason.

Yet,  it's quite  possible.
Qemu tries to write to the qcow disk on gluster.
Gluster is creating shards based of the ofset, as it was not done initially 
(preallocated  disk  take the full size  on gluster  and all shards are created 
 immediately). This takes time and requires  to be done on all bricks.
As the shard size  is too small (default 64MB), gluster has to create the next 
shard almost immediately,  but if it can't do it as fast as qemu is filling 
it's qcow2  disk -  qemu will get an I/O error and we know what happens there.
Later gluster manages to create the shard(s) , and the VM is unpaused.

That's why the oVirt team made all gluster-based disks to be fully preallocated.

Best Regards,
Strahil Nikolov
_______________________________________________
Users mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/[email protected]/message/QY6KU2ZCXTULX7GJBHLHGSN34NKBDMT3/

[ovirt-users] Re: Sometimes paused due to unknown storage error on gluster

Reply via email to