Re: [ceph-users] Questions about an example of ceph infrastructure

2015-04-19 Thread Christian Balzer
Hello, On Mon, 20 Apr 2015 04:16:01 +0200 Francois Lafont wrote: > Hi, > > Christian Balzer wrote: > > > For starters, make that 5 MONs. > > It won't really help you with your problem of keeping a quorum when > > loosing a DC, but being able to loose more than 1 monitor will come in > > handy

Re: [ceph-users] OSDs failing on upgrade from Giant to Hammer

2015-04-19 Thread Samuel Just
I have a suspicion about what caused this. Can you restart one of the problem osds with debug osd = 20 debug filestore = 20 debug ms = 1 and attach the resulting log from startup to crash along with the osdmap binary (ceph osd getmap -o ). -Sam - Original Message - From: "Scott Laird"

Re: [ceph-users] Questions about an example of ceph infrastructure

2015-04-19 Thread Francois Lafont
Hi, Christian Balzer wrote: > For starters, make that 5 MONs. > It won't really help you with your problem of keeping a quorum when > loosing a DC, but being able to loose more than 1 monitor will come in > handy. > Note that MONs don't really need to be dedicated nodes, if you know what > you'r

Re: [ceph-users] OSDs failing on upgrade from Giant to Hammer

2015-04-19 Thread Scott Laird
Nope. Straight from 0.87 to 0.94.1. FWIW, at someone's suggestion, I just upgraded the kernel on one of the boxes from 3.14 to 3.18; no improvement. Rebooting didn't help, either. Still failing with the same error in the logs. On Sun, Apr 19, 2015 at 2:06 PM Robert LeBlanc wrote: > Did you up

Re: [ceph-users] OSDs failing on upgrade from Giant to Hammer

2015-04-19 Thread Robert LeBlanc
Did you upgrade from 0.92? If you did, did you flush the logs before upgrading? On Sun, Apr 19, 2015 at 1:02 PM, Scott Laird wrote: > I'm upgrading from Giant to Hammer (0.94.1), and I'm seeing a ton of OSDs > die (and stay dead) with this error in the logs: > > 2015-04-19 11:53:36.796847 7f61fa

[ceph-users] OSDs failing on upgrade from Giant to Hammer

2015-04-19 Thread Scott Laird
I'm upgrading from Giant to Hammer (0.94.1), and I'm seeing a ton of OSDs die (and stay dead) with this error in the logs: 2015-04-19 11:53:36.796847 7f61fa900900 -1 osd/OSD.h: In function 'OSDMapRef OSDService::get_map(epoch_t)' thread 7f61fa900900 time 2015-04-19 11:53:36.794951 osd/OSD.h: 716:

Re: [ceph-users] full ssd setup preliminary hammer bench

2015-04-19 Thread Alexandre DERUMIER
>>From the version number it looks buggy. I'm really interested what fixed the >>issue for you. I'll test with debian client with my new hardware to compare. Currently, client difference vs previous test is: - centos7.1 vs debian wheezy - librbd hammer vs giant - CPU E5-2687W @3.1GHZ vs CPU E5-

Re: [ceph-users] Questions about an example of ceph infrastructure

2015-04-19 Thread Christian Balzer
Hello, On Sun, 19 Apr 2015 06:22:44 +0200 Francois Lafont wrote: > Hi, > > We are thinking about a ceph infrastructure and I have questions. > Here is the conceived (but not yet implemented) infrastructure: > (please, be careful to read the schema with a monospace font ;)) > > >