[ceph-users] Infernalis 9.2.1: the "rados df"ommand show wrong data

2016-03-04 Thread Mike Almateia
Hello Cephers! On my small cluster I see this: [root@c1 ~]# rados df pool name KB objects clones degraded unfound rdrd KB wrwr KB data 0000 06

Re: [ceph-users] Ceph RBD latencies

2016-03-04 Thread Christian Balzer
Hello, On Thu, 3 Mar 2016 23:26:13 + Adrian Saul wrote: > > > Samsung EVO... > > Which exact model, I presume this is not a DC one? > > > > If you had put your journals on those, you would already be pulling > > your hairs out due to abysmal performance. > > > > Also with Evo ones, I'd be w

Re: [ceph-users] Cache tier operation clarifications

2016-03-04 Thread Francois Lafont
Hello, On 04/03/2016 09:17, Christian Balzer wrote: > Unlike the subject may suggest, I'm mostly going to try and explain how > things work with cache tiers, as far as I understand them. > Something of a reference to point to. [...] I'm currently unqualified concerning cache tiering but I'm pret

[ceph-users] ceph-mon - mon daemon issues

2016-03-04 Thread M Ranga Swami Reddy
Hello, I have couple of questions on ceph-mon with mon daemon: Q1: Working command: /etc/init.d/ceph status mon Not working : status ceph-mon id=node-13 Why first command is working and why not the 2nd command nto working status ceph-mon id=node-13 status: Unknow

[ceph-users] Can I rebuild object maps while VMs are running ?

2016-03-04 Thread Christoph Adomeit
Hi there, I just updated our ceph-cluster to infernalis and now I want to enable the new image features. I wonder if I can add the features on the rbd images while the VMs are running. I want to do something like this: rbd feature enable $IMG exclusive-lock rbd feature enable $IMG object-map r

Re: [ceph-users] Problem: silently corrupted RadosGW objects caused by slow requests

2016-03-04 Thread Yehuda Sadeh-Weinraub
On Fri, Mar 4, 2016 at 7:26 AM, Ritter Sławomir wrote: >> From: Robin H. Johnson [mailto:robb...@gentoo.org] >> Sent: Friday, March 04, 2016 12:40 AM >> To: Ritter Sławomir >> Cc: ceph-us...@ceph.com; ceph-devel >> Subject: Re: [ceph-users] Problem: silently corrupted RadosGW objects caused >> by

Re: [ceph-users] Upgrade from Hammer LTS to Infernalis or wait for Jewel LTS?

2016-03-04 Thread Ken Dreyer
On Fri, Mar 4, 2016 at 1:53 AM, Luis Periquito wrote: > On Wed, Mar 2, 2016 at 9:32 AM, Mihai Gheorghe wrote: > From previous history the last 2 LTS versions are supported (currently > Firefly and Hammer). Note that Firefly reached end-of-life in January, and we're no longer issuing releases for

[ceph-users] Data inaccessable after single OSD down, default size is 3 min size is 1

2016-03-04 Thread Oliver Dzombic
Hi, we have here the effect, that single OSD's are getting down/out because it happens that they are sometimes too slow. osd_pool_default_size = 3 osd_pool_default_min_size = 1 pool 0 'rbd' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 512 pgp_num 512 last_change 1539

Re: [ceph-users] Problem: silently corrupted RadosGW objects caused by slow requests

2016-03-04 Thread Ritter Sławomir
> From: Robin H. Johnson [mailto:robb...@gentoo.org] > Sent: Friday, March 04, 2016 12:40 AM > To: Ritter Sławomir > Cc: ceph-us...@ceph.com; ceph-devel > Subject: Re: [ceph-users] Problem: silently corrupted RadosGW objects caused > by slow requests > > On Thu, Mar 03, 2016 at 01:55:13PM +0100, R

Re: [ceph-users] Problem: silently corrupted RadosGW objects caused by slow requests

2016-03-04 Thread Ritter Sławomir
Thnx for contact. > > 2016-02-23 13:49:58.818640 osd.260 10.176.67.27:6800/688083 2119 : [WRN] 4 > > slow requests, 4 included below; oldest blocked for > 30.727096 secs > > 2016-02-23 13:49:58.818673 osd.260 10.176.67.27:6800/688083 2120 : [WRN] > > slow request 30.727096 seconds old, received at

Re: [ceph-users] PG's stuck inactive, stuck unclean, incomplete, imports cause osd segfaults

2016-03-04 Thread Philip S. Hempel
On 03/03/2016 03:52 PM, Philip S. Hempel wrote: Thanks, appreciate the help. That is where I have gotten as well, so if we have a developer out there that can help please let me know. There is budget to pay someone for the help. We are still looking for someone to help us, if possible. I beli

Re: [ceph-users] slow requests with rbd

2016-03-04 Thread Max A. Krasilnikov
Здравствуйте! On Fri, Mar 04, 2016 at 01:33:24PM +0100, honza801 wrote: > hi, > i have rbd0 mapped to client, xfs formatted. i'm putting a lot of data on it. > following messages appear in logs and 'ceph -s' output > osd.255 [WRN] 1 slow requests, 1 included below; oldest blocked for > > 51.72

[ceph-users] slow requests with rbd

2016-03-04 Thread Jan Krcmar
hi, i have rbd0 mapped to client, xfs formatted. i'm putting a lot of data on it. following messages appear in logs and 'ceph -s' output osd.255 [WRN] 1 slow requests, 1 included below; oldest blocked for > 51.726881 secs osd.255 [WRN] slow request 51.726881 seconds old, received at 2016-03-04 12

Re: [ceph-users] Cache tier operation clarifications

2016-03-04 Thread Shinobu Kinjo
Great feedback (at least for me). I would like to know if the behaviours you seeing are expected things or not. BTW I will do some test regarding to cache tier with my new toy. Cheers, S On Fri, Mar 4, 2016 at 5:17 PM, Christian Balzer wrote: > > Hello, > > Unlike the subject may suggest, I'm m

Re: [ceph-users] Upgrade from Hammer LTS to Infernalis or wait for Jewel LTS?

2016-03-04 Thread Mihai Gheorghe
Here is the roadmap http://docs.ceph.com/docs/master/releases/ EOL is estimated. Or this is what i think of estimated retirement. We are already running hammer. No issues here, except for cahce tier pool with the promotion bug. Don't think the fix was backported to hammer as the time of writing,

Re: [ceph-users] abort slow requests ?

2016-03-04 Thread Ben Hines
Thanks, working on fixing the peering objects. Going to attempt a recovery on the bad pgs tomorrow. The corrupt OSD which they were on was marked 'lost' so i expected it wouldn't try to peer with it anymore. Anyway I do have the data, at least. -Ben On Fri, Mar 4, 2016 at 1:04 AM, Luis Periquito

Re: [ceph-users] abort slow requests ?

2016-03-04 Thread Luis Periquito
you should really fix the peering objects. So far what I've seen in ceph is that it prefers data integrity over availability. So if it thinks that it can't keep all working properly it tends to stop (i.e. blocked requests), thus I don't believe there's a way to do this. On Fri, Mar 4, 2016 at 1:0

Re: [ceph-users] Fwd: List of SSDs

2016-03-04 Thread Shinobu Kinjo
On Mar 4, 2016 5:30 PM, "Christian Balzer" wrote: > > On Fri, 4 Mar 2016 16:09:17 +0900 Shinobu Kinjo wrote: > > > Comparing with these SSDs, > > > > S3710s > > S3610s > > SM863 > > 845DC Pro > > > > which one is more reasonable in terms of performance, cost or whatever? > > S3710s does not so

Re: [ceph-users] Upgrade from Hammer LTS to Infernalis or wait for Jewel LTS?

2016-03-04 Thread Luis Periquito
On Wed, Mar 2, 2016 at 9:32 AM, Mihai Gheorghe wrote: > Hi, > > I've got two questions! > > First. We are currently running Hammer in production. You are thinking of > upgrading to Infernalis. Should we upgrade now or wait for the next LTS, > Jewel? On ceph releases i can see Hammers EOL is estima

Re: [ceph-users] Fwd: List of SSDs

2016-03-04 Thread Christian Balzer
On Fri, 4 Mar 2016 16:09:17 +0900 Shinobu Kinjo wrote: > Comparing with these SSDs, > > S3710s > S3610s > SM863 > 845DC Pro > > which one is more reasonable in terms of performance, cost or whatever? > S3710s does not sound reasonable to me. > Apples and Oranges. I use S3700s (I would use 3

[ceph-users] Cache tier operation clarifications

2016-03-04 Thread Christian Balzer
Hello, Unlike the subject may suggest, I'm mostly going to try and explain how things work with cache tiers, as far as I understand them. Something of a reference to point to. Of course if you spot something that's wrong or have additional information, by all means please do comment. While the d

Re: [ceph-users] Help: pool not responding

2016-03-04 Thread Mario Giammarco
I have restarted each host using init scripts. Is there another way? 2016-03-03 21:51 GMT+01:00 Dimitar Boichev : > But the whole cluster or what ? > > Regards. > > *Dimitar Boichev* > SysAdmin Team Lead > AXSMarine Sofia > Phone: +359 889 22 55 42 > Skype: dimitar.boichev.axsmarine > E-mail: dim