[ceph-users] OSD will not start after heartbeatsuicide timeout, assert error from PGLog

2016-12-21 Thread Trygve Vea
istencies have occurred as a result of the first assert error. Is this a bug? Regards -- Trygve Vea Redpill Linpro AS ___ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] Regarding loss of heartbeats

2016-11-29 Thread Trygve Vea
- Den 29.nov.2016 15:20 skrev Nick Fisk n...@fisk.me.uk: >> -Original Message- >> From: ceph-users [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of >> Trygve >> Vea >> Sent: 29 November 2016 14:07 >> To: ceph-users >> Subject:

[ceph-users] Regarding loss of heartbeats

2016-11-29 Thread Trygve Vea
anyone have any thoughts about this? Are we stumbling on a known, or unknown bug in Ceph? Regards -- Trygve Vea ___ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] Missing heartbeats, OSD spending time reconnecting - possible bug?

2016-11-28 Thread Trygve Vea
- Den 11.nov.2016 14:35 skrev Wido den Hollander w...@42on.com: >> Op 11 november 2016 om 14:23 schreef Trygve Vea >> : >> >> >> Hi, >> >> We recently experienced a problem with a single OSD. This occurred twice. >> >> The problem man

[ceph-users] Missing heartbeats, OSD spending time reconnecting - possible bug?

2016-11-11 Thread Trygve Vea
of using the OSD partition for forensics (ordinary xfs filesystem, journal on ssd). Not an expert of the low-level behaviour of Ceph, but the logged reconnection-attempts from osd.14, and the complaining about missing heartbeats on osd.29 sounds to me like a bug. Have anyone else seen this beha

Re: [ceph-users] rgw / s3website, MethodNotAllowed on Jewel 10.2.3

2016-10-27 Thread Trygve Vea
- Den 27.okt.2016 23:13 skrev Robin H. Johnson robb...@gentoo.org: > On Wed, Oct 26, 2016 at 11:43:15AM +0200, Trygve Vea wrote: >> Hi! >> >> I'm trying to get s3website working on one of our Rados Gateway >> installations, and I'm having some problems fin

Re: [ceph-users] Significantly increased CPU footprint on OSDs after Hammer -> Jewel upgrade, OSDs occasionally wrongly marked as down

2016-10-26 Thread Trygve Vea
- Den 26.okt.2016 21:25 skrev Haomai Wang hao...@xsky.com: > On Thu, Oct 27, 2016 at 2:10 AM, Trygve Vea > wrote: >> - Den 26.okt.2016 16:37 skrev Sage Weil s...@newdream.net: >>> On Wed, 26 Oct 2016, Trygve Vea wrote: >>>> - Den 26.okt.2016 14:41

Re: [ceph-users] Significantly increased CPU footprint on OSDs after Hammer -> Jewel upgrade, OSDs occasionally wrongly marked as down

2016-10-26 Thread Trygve Vea
- Den 26.okt.2016 16:37 skrev Sage Weil s...@newdream.net: > On Wed, 26 Oct 2016, Trygve Vea wrote: >> - Den 26.okt.2016 14:41 skrev Sage Weil s...@newdream.net: >> > On Wed, 26 Oct 2016, Trygve Vea wrote: >> >> Hi, >> >> >> >> We h

Re: [ceph-users] Significantly increased CPU footprint on OSDs after Hammer -> Jewel upgrade, OSDs occasionally wrongly marked as down

2016-10-26 Thread Trygve Vea
- Den 26.okt.2016 15:36 skrev Haomai Wang hao...@xsky.com: > On Wed, Oct 26, 2016 at 9:09 PM, Trygve Vea > wrote: >> >> - Den 26.okt.2016 14:41 skrev Sage Weil s...@newdream.net: >> > On Wed, 26 Oct 2016, Trygve Vea wrote: >> >> Hi, >> >

Re: [ceph-users] Significantly increased CPU footprint on OSDs after Hammer -> Jewel upgrade, OSDs occasionally wrongly marked as down

2016-10-26 Thread Trygve Vea
- Den 26.okt.2016 14:41 skrev Sage Weil s...@newdream.net: > On Wed, 26 Oct 2016, Trygve Vea wrote: >> Hi, >> >> We have two Ceph-clusters, one exposing pools both for RGW and RBD >> (OpenStack/KVM) pools - and one only for RBD. >> >> After

[ceph-users] Significantly increased CPU footprint on OSDs after Hammer -> Jewel upgrade, OSDs occasionally wrongly marked as down

2016-10-26 Thread Trygve Vea
yone have been suffering from similar behaviour, if this is a bug (known or unknown). One detail to keep in mind is that the osds for the rgw pools store replicas on different physical sites. However, we have no reason to believe that saturation or high latency is a problem. Regards --

Re: [ceph-users] rgw / s3website, MethodNotAllowed on Jewel 10.2.3

2016-10-26 Thread Trygve Vea
gt;> Allowed', 'data': '> encoding="UTF-8"?>MethodNotAllowedtx00003-0058107a06-20d3274-default20d3274-default-default'} >> >> Has anyone have had any luck with this? > > does apache send $host variable to the backend ? > > something like "ProxyPreserveHost On" We're using ProxyPass to the unix fastcgi-socket - so it is already preserved, and this is verified working as we're frequently using the *.our.endpoint.org addressing method for ordinary buckets. Adding 'ProxyPreserveHost On' did not have any effect. Regards -- Trygve Vea Redpill Linpro AS ___ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[ceph-users] rgw / s3website, MethodNotAllowed on Jewel 10.2.3

2016-10-26 Thread Trygve Vea
#x27;: 'close', 'x-amz-request-id': 'tx3-0058107a06-20d3274-default', 'date': 'Wed, 26 Oct 2016 09:40:22 GMT', 'content-type': 'application/xml'}, 'reason': 'Method Not Allowed', 'da

[ceph-users] slow request, waiting for rw locks / subops from osd doing deep scrub of pg in rgw.buckets.index

2016-06-21 Thread Trygve Vea
e information I've provided here, can anyone shed some light on what this may be, and if it's a bug that is not fixed in HEAD; What information would be useful to include in a bug report? Regards -- Trygve Vea ___ ceph-users mailing list c