[ceph-users] ceph iscsi latency too high for esxi?

2020-10-04 Thread Golasowski Martin
Hi, does anyone here use CEPH iSCSI with VMware ESXi? It seems that we are hitting the 5 second timeout limit on software HBA in ESXi. It appears whenever there is increased load on the cluster, like deep scrub or rebalance. Is it normal behaviour in production? Or is there something special we

[ceph-users] Re: ceph iscsi latency too high for esxi?

2020-10-04 Thread Golasowski Martin
0638492 > Com. register: Amtsgericht Munich HRB 231263 > > Web: https://croit.io <https://croit.io/> > YouTube: https://goo.gl/PGE1Bx <https://goo.gl/PGE1Bx> > > > Am So., 4. Okt. 2020 um 14:37 Uhr schrieb Golasowski Martin > mailto:martin.golasow...@vsb.cz>

[ceph-users] Re: ceph iscsi latency too high for esxi?

2020-10-04 Thread Golasowski Martin
For clarity, the issue has been reported also before: https://www.spinics.net/lists/ceph-users/msg59798.html https://www.spinics.net/lists/target-devel/msg10469.html > On 4 Oct 2020, at 16:46, Steve Thompson wrote: > > On Sun, 4 Oct 2

[ceph-users] Re: ceph iscsi latency too high for esxi?

2020-10-04 Thread Golasowski Martin
and set up a second path to > the iscsi gateway from that. It may not solve the problem, but it might > lower the I/O on a single gateway enough that we won't see the problem > anymore (and hopefully our customers stop getting pissed off). > > Cheers, > Phil > >

[ceph-users] Erasure coded pool chunk count k

2021-10-04 Thread Golasowski Martin
Hello guys, how does one estimate number of chunks for erasure coded pool ( k = ? ) ? I see that number of m chunks determines the pool’s resiliency, however I did not find clear guideline how to determine k. Red Hat states that they support only the following combinations: k=8, m=3 k=8, m=4 k=

[ceph-users] CEPH iSCSI issue - ESXi command timeout

2020-10-01 Thread Golasowski Martin
Dear All, a week ago we had to reboot our ESXi nodes since our CEPH cluster sudennly stopped serving all I/O. We have identified a VM (vCenter appliance) which was swapping heavily and causing heavy load. However, since then we are experiencing strange issues, as if the cluster cannot handle an