subject:"\[ceph\-users\] SSD Journal"

Re: [ceph-users] SSD Journal

2016-07-14 Thread Christian Balzer

, so writes do get nicely coalesced. All my other MONs (shared with on OSD storage nodes) are on S37x0 SSDs. OTOH, the Supermicro DoMs look nice enough on paper with 1 DWPD: https://www.supermicro.com/products/nfo/SATADOM.cfm The 64GB model should do the trick in most scenarios. Christian > Steffe

Re: [ceph-users] SSD Journal

2016-07-13 Thread Christian Balzer

is is especially important if you were to run a MON on those machines as well. Christian > Thanks, > Ashley > > -Original Message- > From: Christian Balzer [mailto:ch...@gol.com] > Sent: 13 July 2016 01:12 > To: ceph-users@lists.ceph.com > Cc: Wido den Hollander

Re: [ceph-users] SSD Journal

2016-07-13 Thread Kees Meijs

Hi, This is an OSD box running Hammer on Ubuntu 14.04 LTS with additional systems administration tools: > $ df -h | grep -v /var/lib/ceph/osd > Filesystem Size Used Avail Use% Mounted on > udev5,9G 4,0K 5,9G 1% /dev > tmpfs 1,2G 892K 1,2G 1% /run > /dev/dm-1

Re: [ceph-users] SSD Journal

2016-07-13 Thread Wido den Hollander

ly 2016 10:44 > To: Ashley Merrick ; ceph-users@lists.ceph.com; > Christian Balzer > Subject: RE: [ceph-users] SSD Journal > > > > Op 13 juli 2016 om 11:34 schreef Ashley Merrick : > > > > > > Hello, > > > > Looking at using 2 x 960GB SSD'

Re: [ceph-users] SSD Journal

2016-07-13 Thread Ashley Merrick

[mailto:w...@42on.com] Sent: 13 July 2016 10:44 To: Ashley Merrick ; ceph-users@lists.ceph.com; Christian Balzer Subject: RE: [ceph-users] SSD Journal > Op 13 juli 2016 om 11:34 schreef Ashley Merrick : > > > Hello, > > Looking at using 2 x 960GB SSD's (SM863) > &g

Re: [ceph-users] SSD Journal

2016-07-13 Thread Wido den Hollander

rom: Christian Balzer [mailto:ch...@gol.com] > Sent: 13 July 2016 01:12 > To: ceph-users@lists.ceph.com > Cc: Wido den Hollander ; Ashley Merrick > Subject: Re: [ceph-users] SSD Journal > > > Hello, > > On Tue, 12 Jul 2016 19:14:14 +0200 (CEST) Wido den Hollander

Re: [ceph-users] SSD Journal

2016-07-13 Thread Ashley Merrick

- From: Christian Balzer [mailto:ch...@gol.com] Sent: 13 July 2016 01:12 To: ceph-users@lists.ceph.com Cc: Wido den Hollander ; Ashley Merrick Subject: Re: [ceph-users] SSD Journal Hello, On Tue, 12 Jul 2016 19:14:14 +0200 (CEST) Wido den Hollander wrote: > > > Op 12 juli 20

Re: [ceph-users] SSD Journal

2016-07-12 Thread Christian Balzer

Hello, On Tue, 12 Jul 2016 19:14:14 +0200 (CEST) Wido den Hollander wrote: > > > Op 12 juli 2016 om 15:31 schreef Ashley Merrick : > > > > > > Hello, > > > > Looking at final stages of planning / setup for a CEPH Cluster. > > > > Per a Storage node looking @ > > > > 2 x SSD OS / Journal >

Re: [ceph-users] SSD Journal

2016-07-12 Thread Wido den Hollander

> Op 12 juli 2016 om 15:31 schreef Ashley Merrick : > > > Hello, > > Looking at final stages of planning / setup for a CEPH Cluster. > > Per a Storage node looking @ > > 2 x SSD OS / Journal > 10 x SATA Disk > > Will have a small Raid 1 Partition for the OS, however not sure if best to do: >

[ceph-users] SSD Journal

2016-07-12 Thread Ashley Merrick

Hello, Looking at final stages of planning / setup for a CEPH Cluster. Per a Storage node looking @ 2 x SSD OS / Journal 10 x SATA Disk Will have a small Raid 1 Partition for the OS, however not sure if best to do: 5 x Journal Per a SSD 10 x Journal on Raid 1 of two SSD's Is the "Performance"

Re: [ceph-users] SSD Journal Performance Priorties

2016-02-26 Thread Somnath Roy

4:16 PM To: ceph-users@lists.ceph.com Subject: [ceph-users] SSD Journal Performance Priorties Ignoring the durability and network issues for now :) Are there any aspects of a journals performance that matter most for over all ceph performance? i.e my inital thought is if I want to improve ceph write p

[ceph-users] SSD Journal Performance Priorties

2016-02-26 Thread Lindsay Mathieson

Ignoring the durability and network issues for now :) Are there any aspects of a journals performance that matter most for over all ceph performance? i.e my inital thought is if I want to improve ceph write performance journal seq write speed is what matters. Does random write speed factor at

Re: [ceph-users] SSD Journal

2016-02-01 Thread Jan Schermer

Hi, unfortunately I'm not a dev, so it's gonna be someone else ripping the journal out and trying. But I came to understand that getting rid of journal is not that easy of a task. To me, more important would be if the devs understood what I'm trying to say :-) because without that any new develop

Re: [ceph-users] SSD Journal

2016-01-31 Thread Alex Gorbachev

dhosting.net > If you are not the intended recipient of this transmission you are > notified that disclosing, copying, distributing or taking any action in > reliance on the contents of this information is strictly prohibited. > > > > -- > *From: *&q

Re: [ceph-users] SSD Journal

2016-01-29 Thread Anthony D'Atri

> Right now we run the journal as a partition on the data disk. I've build > drives without journals and the write performance seems okay but random io > performance is poor in comparison to what it should be. Co-located journals have multiple issues: o The disks are presented with double the

Re: [ceph-users] SSD Journal

2016-01-29 Thread Robert LeBlanc

-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Jan, I know that Sage has worked through a lot of this and spent a lot of time on it, so I'm somewhat inclined to say that if he says it needs to be there, then it needs to be there. I, however, have been known to stare at the tress so much that I m

Re: [ceph-users] SSD Journal

2016-01-29 Thread Lionel Bouton

Le 29/01/2016 16:25, Jan Schermer a écrit : > > [...] > > > But if I understand correctly, there is indeed a log of the recent > modifications in the filestore which is used when a PG is recovering > because another OSD is lagging behind (not when Ceph reports a full > backfill

Re: [ceph-users] SSD Journal

2016-01-29 Thread Jan Schermer

> On 29 Jan 2016, at 16:00, Lionel Bouton > wrote: > > Le 29/01/2016 01:12, Jan Schermer a écrit : >> [...] >>> Second I'm not familiar with Ceph internals but OSDs must make sure that >>> their PGs are synced so I was under the impression that the OSD content for >>> a PG on the filesystem s

Re: [ceph-users] SSD Journal

2016-01-29 Thread Lionel Bouton

Le 29/01/2016 01:12, Jan Schermer a écrit : > [...] >> Second I'm not familiar with Ceph internals but OSDs must make sure that >> their PGs are synced so I was under the impression that the OSD content for >> a PG on the filesystem should always be guaranteed to be on all the other >> active OS

Re: [ceph-users] SSD Journal

2016-01-29 Thread Jan Schermer

here from other clients)) > Bam. You're done. You just mirrored what a hard drive does, because you > mirrored that to a filesystem that mirrors that to a hard drive... No need of > journals on top of filesystems with journals with data on filesystems with > journals... My dat

Re: [ceph-users] SSD Journal

2016-01-28 Thread Somnath Roy

hnical terms. Jan Thanks & Regards Somnath From: ceph-users [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of Tyler Bishop Sent: Thursday, January 28, 2016 1:35 PM To: Jan Schermer Cc: ceph-users@lists.ceph.com<mailto:ceph-users@lists.ceph.com> Subject: Re: [ceph-users] SSD Journa

Re: [ceph-users] SSD Journal

2016-01-28 Thread Jan Schermer

> On 28 Jan 2016, at 23:19, Lionel Bouton > wrote: > > Le 28/01/2016 22:32, Jan Schermer a écrit : >> P.S. I feel very strongly that this whole concept is broken fundamentaly. We >> already have a journal for the filesystem which is time proven, well behaved >> and above all fast. Instead the

Re: [ceph-users] SSD Journal

2016-01-28 Thread Jan Schermer

ond of the multi-ms commiting limbo while data falls down throught those dream layers :P I really don't know how to explain that more. I bet if you ask on LKML, someone like Theodore Ts'o would say "you're doing completely superfluous work" in more technical terms. Jan

Re: [ceph-users] SSD Journal

2016-01-28 Thread Somnath Roy

yler Bishop Sent: Thursday, January 28, 2016 1:35 PM To: Jan Schermer Cc: ceph-users@lists.ceph.com Subject: Re: [ceph-users] SSD Journal What approach did sandisk take with this for jewel? [http://static.beyondhosting.net/img/bh-small.png] Tyler Bishop Chief Technical Officer 513-299-7108

Re: [ceph-users] SSD Journal

2016-01-28 Thread Lionel Bouton

Le 28/01/2016 22:32, Jan Schermer a écrit : > P.S. I feel very strongly that this whole concept is broken > fundamentaly. We already have a journal for the filesystem which is > time proven, well behaved and above all fast. Instead there's this > reinvented wheel which supposedly does it better in

Re: [ceph-users] SSD Journal

2016-01-28 Thread Tyler Bishop

in reliance on the contents of this information is strictly prohibited. From: "Jan Schermer" To: "Tyler Bishop" Cc: "Bill WONG" , ceph-users@lists.ceph.com Sent: Thursday, January 28, 2016 4:32:54 PM Subject: Re: [ceph-users] SSD Journal You can't

Re: [ceph-users] SSD Journal

2016-01-28 Thread Jan Schermer

buting or taking any action in reliance on > the contents of this information is strictly prohibited. > > > From: "Bill WONG" > To: "ceph-users" > Sent: Thursday, January 28, 2016 1:36:01 PM > Subject: [ceph-users] SSD Journal > > Hi, > i have

Re: [ceph-users] SSD Journal

2016-01-28 Thread Tyler Bishop

From: "Bill WONG" To: "ceph-users" Sent: Thursday, January 28, 2016 1:36:01 PM Subject: [ceph-users] SSD Journal Hi, i have tested with SSD Journal with SATA, it works perfectly.. now, i am testing with full SSD ceph cluster, now with full SSD ceph cluster, do i stil

[ceph-users] SSD Journal

2016-01-28 Thread Bill WONG

Hi, i have tested with SSD Journal with SATA, it works perfectly.. now, i am testing with full SSD ceph cluster, now with full SSD ceph cluster, do i still need to have SSD as journal disk? [ assumed i do not have PCIe SSD Flash which is better performance than normal SSD disk] please give some

[ceph-users] SSD Journal Best Practice

2015-01-12 Thread lidc...@redhat.com

Hi everyone: I plan to use SSD Journal to improve performance. I have one 1.2T SSD disk per server. what is the best practice for SSD Journal ? There are there choice to deploy SSD Journal 1. all osd used same ssd partion ceph-deploy osd create ceph-node:sdb:/de

Re: [ceph-users] SSD Journal Best Practice

2015-01-12 Thread lidc...@redhat.com

For the first choice: ceph-deploy osd create ceph-node:sdb:/dev/ssd ceph-node:sdc:/dev/ssd i find ceph-deploy will create partition automaticaly, and each partition is 5G default. So the first choice and second choice is almost the same. Compare to filesystem, I perfer to block device to get

Re: [ceph-users] SSD journal deployment experiences

2014-09-09 Thread Christian Balzer

On Tue, 9 Sep 2014 10:57:26 -0700 Craig Lewis wrote: > On Sat, Sep 6, 2014 at 9:27 AM, Christian Balzer wrote: > > > On Sat, 06 Sep 2014 16:06:56 + Scott Laird wrote: > > > > > Backing up slightly, have you considered RAID 5 over your SSDs? > > > Practically speaking, there's no performance

Re: [ceph-users] SSD journal deployment experiences

2014-09-09 Thread Craig Lewis

On Sat, Sep 6, 2014 at 9:27 AM, Christian Balzer wrote: > On Sat, 06 Sep 2014 16:06:56 + Scott Laird wrote: > > > Backing up slightly, have you considered RAID 5 over your SSDs? > > Practically speaking, there's no performance downside to RAID 5 when > > your devices aren't IOPS-bound. > > >

Re: [ceph-users] SSD journal deployment experiences

2014-09-09 Thread Craig Lewis

On Sat, Sep 6, 2014 at 7:50 AM, Dan van der Ster wrote: > > BTW, do you happen to know, _if_ we re-use an OSD after the journal has > failed, are any object inconsistencies going to be found by a > scrub/deep-scrub? > I haven't tested this, but I did something I *think* is similar. I deleted an

Re: [ceph-users] SSD journal deployment experiences

2014-09-08 Thread Christian Balzer

stat. Christian > Regards, > Quenten Grasso > > -Original Message- > From: ceph-users [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of > Christian Balzer Sent: Sunday, 7 September 2014 1:38 AM > To: ceph-users > Subject: Re: [ceph-users] SSD journ

Re: [ceph-users] SSD journal deployment experiences

2014-09-08 Thread Quenten Grasso

Of Christian Balzer Sent: Sunday, 7 September 2014 1:38 AM To: ceph-users Subject: Re: [ceph-users] SSD journal deployment experiences On Sat, 6 Sep 2014 14:50:20 + Dan van der Ster wrote: > September 6 2014 4:01 PM, "Christian Balzer" wrote: > > On Sat, 6 Sep 201

Re: [ceph-users] SSD journal deployment experiences

2014-09-08 Thread Dan Van Der Ster

Hi Scott, > On 06 Sep 2014, at 20:39, Scott Laird wrote: > > IOPS are weird things with SSDs. In theory, you'd see 25% of the write IOPS > when writing to a 4-way RAID5 device, since you write to all 4 devices in > parallel. Except that's not actually true--unlike HDs where an IOP is an > I

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Scott Laird

IOPS are weird things with SSDs. In theory, you'd see 25% of the write IOPS when writing to a 4-way RAID5 device, since you write to all 4 devices in parallel. Except that's not actually true--unlike HDs where an IOP is an IOP, SSD IOPS limits are really just a function of request size. Because

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Dan Van Der Ster

RAID5... Hadn't considered it due to the IOPS penalty (it would get 1/4th of the IOPS of separated journal devices, according to some online raid calc). Compared to RAID10, I guess we'd get 50% more capacity, but lower performance. After the anecdotes that the DCS3700 is very rarely failing, and

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Christian Balzer

On Sat, 06 Sep 2014 16:06:56 + Scott Laird wrote: > Backing up slightly, have you considered RAID 5 over your SSDs? > Practically speaking, there's no performance downside to RAID 5 when > your devices aren't IOPS-bound. > Well... For starters with RAID5 you would loose 25% throughput in bo

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Scott Laird

Backing up slightly, have you considered RAID 5 over your SSDs? Practically speaking, there's no performance downside to RAID 5 when your devices aren't IOPS-bound. On Sat Sep 06 2014 at 8:37:56 AM Christian Balzer wrote: > On Sat, 6 Sep 2014 14:50:20 + Dan van der Ster wrote: > > > Septemb

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Christian Balzer

On Sat, 6 Sep 2014 14:50:20 + Dan van der Ster wrote: > September 6 2014 4:01 PM, "Christian Balzer" wrote: > > On Sat, 6 Sep 2014 13:07:27 + Dan van der Ster wrote: > > > >> Hi Christian, > >> > >> Let's keep debating until a dev corrects us ;) > > > > For the time being, I give the

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Dan van der Ster

September 6 2014 4:01 PM, "Christian Balzer" wrote: > On Sat, 6 Sep 2014 13:07:27 + Dan van der Ster wrote: > >> Hi Christian, >> >> Let's keep debating until a dev corrects us ;) > > For the time being, I give the recent: > > https://www.mail-archive.com/ceph-users@lists.ceph.com/msg1220

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Christian Balzer

On Sat, 6 Sep 2014 13:07:27 + Dan van der Ster wrote: > Hi Christian, > > Let's keep debating until a dev corrects us ;) > For the time being, I give the recent: https://www.mail-archive.com/ceph-users@lists.ceph.com/msg12203.html And not so recent: http://www.spinics.net/lists/ceph-users/

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Dan van der Ster

Hi Christian, Let's keep debating until a dev corrects us ;) September 6 2014 1:27 PM, "Christian Balzer" wrote: > On Fri, 5 Sep 2014 09:42:02 + Dan Van Der Ster wrote: > >>> On 05 Sep 2014, at 11:04, Christian Balzer wrote: >>> >>> On Fri, 5 Sep 2014 07:46:12 + Dan Van Der Ster wrot

Re: [ceph-users] SSD journal deployment experiences

2014-09-06 Thread Christian Balzer

On Fri, 5 Sep 2014 09:42:02 + Dan Van Der Ster wrote: > > > On 05 Sep 2014, at 11:04, Christian Balzer wrote: > > > > On Fri, 5 Sep 2014 07:46:12 + Dan Van Der Ster wrote: > >> > >>> On 05 Sep 2014, at 03:09, Christian Balzer wrote: > >>> > >>> On Thu, 4 Sep 2014 14:49:39 -0700 Craig

Re: [ceph-users] SSD journal deployment experiences

2014-09-05 Thread Dan Van Der Ster

> On 05 Sep 2014, at 11:04, Christian Balzer wrote: > > > Hello Dan, > > On Fri, 5 Sep 2014 07:46:12 + Dan Van Der Ster wrote: > >> Hi Christian, >> >>> On 05 Sep 2014, at 03:09, Christian Balzer wrote: >>> >>> >>> Hello, >>> >>> On Thu, 4 Sep 2014 14:49:39 -0700 Craig Lewis wrote: >

Re: [ceph-users] SSD journal deployment experiences

2014-09-05 Thread Dan Van Der Ster

> On 05 Sep 2014, at 10:30, Nigel Williams wrote: > > On Fri, Sep 5, 2014 at 5:46 PM, Dan Van Der Ster > wrote: >>> On 05 Sep 2014, at 03:09, Christian Balzer wrote: >>> You might want to look into cache pools (and dedicated SSD servers with >>> fast controllers and CPUs) in your test cluster

Re: [ceph-users] SSD journal deployment experiences

2014-09-05 Thread Christian Balzer

Hello Dan, On Fri, 5 Sep 2014 07:46:12 + Dan Van Der Ster wrote: > Hi Christian, > > > On 05 Sep 2014, at 03:09, Christian Balzer wrote: > > > > > > Hello, > > > > On Thu, 4 Sep 2014 14:49:39 -0700 Craig Lewis wrote: > > > >> On Thu, Sep 4, 2014 at 9:21 AM, Dan Van Der Ster > >> wrote

Re: [ceph-users] SSD journal deployment experiences

2014-09-05 Thread Nigel Williams

On Fri, Sep 5, 2014 at 5:46 PM, Dan Van Der Ster wrote: >> On 05 Sep 2014, at 03:09, Christian Balzer wrote: >> You might want to look into cache pools (and dedicated SSD servers with >> fast controllers and CPUs) in your test cluster and for the future. >> Right now my impression is that there i

Re: [ceph-users] SSD journal deployment experiences

2014-09-05 Thread Dan Van Der Ster

Hi Christian, > On 05 Sep 2014, at 03:09, Christian Balzer wrote: > > > Hello, > > On Thu, 4 Sep 2014 14:49:39 -0700 Craig Lewis wrote: > >> On Thu, Sep 4, 2014 at 9:21 AM, Dan Van Der Ster >> wrote: >> >>> >>> >>> 1) How often are DC S3700's failing in your deployments? >>> >> >> None

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Martin B Nielsen

On Thu, Sep 4, 2014 at 10:23 PM, Dan van der Ster wrote: > Hi Martin, > > September 4 2014 10:07 PM, "Martin B Nielsen" wrote: > > Hi Dan, > > > > We took a different approach (and our cluster is tiny compared to many > others) - we have two pools; > > normal and ssd. > > > > We use 14 disks in

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Christian Balzer

Hello, On Thu, 4 Sep 2014 14:49:39 -0700 Craig Lewis wrote: > On Thu, Sep 4, 2014 at 9:21 AM, Dan Van Der Ster > wrote: > > > > > > > 1) How often are DC S3700's failing in your deployments? > > > > None of mine have failed yet. I am planning to monitor the wear level > indicator, and preemp

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Mark Kirkwood

On 05/09/14 10:05, Dan van der Ster wrote: That's good to know. I would plan similarly for the wear out. But I want to also prepare for catastrophic failures -- in the past we've had SSDs just disappear like a device unplug. Those were older OCZ's though... Yes - the Intel dc style drives s

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan van der Ster

Hi Craig, September 4 2014 11:50 PM, "Craig Lewis" wrote: > On Thu, Sep 4, 2014 at 9:21 AM, Dan Van Der Ster > wrote: > >> 1) How often are DC S3700's failing in your deployments? > > None of mine have failed yet. I am planning to monitor the wear level > indicator, and preemptively > repl

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Craig Lewis

On Thu, Sep 4, 2014 at 9:21 AM, Dan Van Der Ster wrote: > > > 1) How often are DC S3700's failing in your deployments? > None of mine have failed yet. I am planning to monitor the wear level indicator, and preemptively replace any SSDs that go below 10%. Manually flushing the journal, replacin

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan van der Ster

Hi Martin, September 4 2014 10:07 PM, "Martin B Nielsen" wrote: > Hi Dan, > > We took a different approach (and our cluster is tiny compared to many > others) - we have two pools; > normal and ssd. > > We use 14 disks in each osd-server; 8 platter and 4 ssd for ceph, and 2 ssd > for OS/journ

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Martin B Nielsen

Hi Dan, We took a different approach (and our cluster is tiny compared to many others) - we have two pools; normal and ssd. We use 14 disks in each osd-server; 8 platter and 4 ssd for ceph, and 2 ssd for OS/journals. We partitioned the two OS ssd as raid1 using about half the space for the OS and

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan van der Ster

Hi Stefan, September 4 2014 9:13 PM, "Stefan Priebe" wrote: > Hi Dan, hi Robert, > > Am 04.09.2014 21:09, schrieb Dan van der Ster: > >> Thanks again for all of your input. I agree with your assessment -- in >> our cluster we avg <3ms for a random (hot) 4k read already, but > 40ms >> for a 4k

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Robert LeBlanc

This is good to know. I just recompiled the CentOS7 3.10 kernel to enable bcache (I doubt they patched bcache since they don't compile/enable it). I've seen when I ran Ceph in VMs on my workstation that there were oops with bcache, but doing the bcache device and the backend device even with two co

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Stefan Priebe

Hi Dan, hi Robert, Am 04.09.2014 21:09, schrieb Dan van der Ster: Thanks again for all of your input. I agree with your assessment -- in our cluster we avg <3ms for a random (hot) 4k read already, but > 40ms for a 4k write. That's why we're adding the SSDs -- you just can't run a proportioned RB

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan van der Ster

Thanks again for all of your input. I agree with your assessment -- in our cluster we avg 40ms for a 4k write. That's why we're adding the SSDs -- you just can't run a proportioned RBD service without them. I'll definitely give bcache a try in my test setup, but more reading has kinda tempere

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Robert LeBlanc

You should be able to use any block device in a bcache device. Right now, we are OK losing one SSD and it takes out 5 OSDs. We would rather have twice the cache. Our opinion may change in the future. We wanted to keep as much overhead as low as possible. I think we may spend the extra on heavier du

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan Van Der Ster

I've just been reading the bcache docs. It's a pity the mirrored writes aren't implemented yet. Do you know if you can use an md RAID1 as a cache dev? And is the graceful failover from wb to writethrough actually working without data loss? Also, write behind sure would help the filestore, since

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Robert LeBlanc

So far it was worked really well, we can raise/lower/disable/enable the cache in realtime and watch how the load and traffic changes. There has been some positive subjective results, but definitive results are still forth coming. bcache on CentOS 7 was not easy, makes me wish we were running Debian

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan Van Der Ster

Hi Robert, That's actually a pretty good idea, since bcache would also accelerate the filestore flushes and leveldb. I actually wonder if an SSD-only pool would even be faster than such a setup... probably not. We're using an ancient enterprise n distro, so it will be a bit of a headache to ge

Re: [ceph-users] SSD journal deployment experiences

2014-09-04 Thread Robert LeBlanc

We are still pretty early on in our testing of how to best use SSDs as well. What we are trying right now, for some of the reasons you mentioned already, is to use bcache as a cache for both journal and data. We have 10 spindles in our boxes with 2 SSDs. We created two bcaches (one for each SSD) an

[ceph-users] SSD journal deployment experiences

2014-09-04 Thread Dan Van Der Ster

Dear Cephalopods, In a few weeks we will receive a batch of 200GB Intel DC S3700’s to augment our cluster, and I’d like to hear your practical experience and discuss options how best to deploy these. We’ll be able to equip each of our 24-disk OSD servers with 4 SSDs, so they will become 20 OSD

Re: [ceph-users] SSD journal overload?

2014-04-30 Thread Indra Pramana

Hi Irek, Good day to you. Any updates/comments on below? Looking forward to your reply, thank you. Cheers. On Tue, Apr 29, 2014 at 12:47 PM, Indra Pramana wrote: > Hi Irek, > > Good day to you, and thank you for your e-mail. > > Is there a better way other than patching the kernel? I would

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Indra Pramana

Hi Irek, Good day to you, and thank you for your e-mail. Is there a better way other than patching the kernel? I would like to avoid having to compile a custom kernel for my OS. I read that I can disable write-caching on the drive using hdparm: hdparm -W0 /dev/sdf hdparm -W0 /dev/sdg I tested o

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Irek Fasikhov

This is my article :). To patch to the kernel (http://www.theirek.com/downloads/code/CMD_FLUSH.diff ). After rebooting, run the following commands: echo temporary write through > /sys/class/scsi_disk//cache_type 2014-04-28 15:44 GMT+04:00 Indra Pramana : > Hi Irek, > > Thanks for the article. Do

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Indra Pramana

Hi Irek, Thanks for the article. Do you have any other web sources pertaining to the same issue, which is in English? Looking forward to your reply, thank you. Cheers. On Mon, Apr 28, 2014 at 7:40 PM, Irek Fasikhov wrote: > Most likely you need to apply a patch to the kernel. > > > http://ww

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Irek Fasikhov

Most likely you need to apply a patch to the kernel. http://www.theirek.com/blog/2014/02/16/patch-dlia-raboty-s-enierghoniezavisimym-keshiem-ssd-diskov 2014-04-28 15:20 GMT+04:00 Indra Pramana : > Hi Udo and Irek, > > Good day to you, and thank you for your emails. > > > >perhaps due IOs from t

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Indra Pramana

Hi Udo and Irek, Good day to you, and thank you for your emails. >perhaps due IOs from the journal? >You can test with iostat (like "iostat -dm 5 sdg"). Yes, I have shared the iostat result earlier on this same thread. At times the utilisation of the 2 journal drives will hit 100%, especially wh

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Irek Fasikhov

You what model SSD? Which version of the kernel? 2014-04-28 12:35 GMT+04:00 Udo Lembke : > Hi, > perhaps due IOs from the journal? > You can test with iostat (like "iostat -dm 5 sdg"). > > on debian iostat is in the package sysstat. > > Udo > > Am 28.04.2014 07:38, schrieb Indra Pramana: > > Hi

Re: [ceph-users] SSD journal overload?

2014-04-28 Thread Udo Lembke

Hi, perhaps due IOs from the journal? You can test with iostat (like "iostat -dm 5 sdg"). on debian iostat is in the package sysstat. Udo Am 28.04.2014 07:38, schrieb Indra Pramana: > Hi Craig, > > Good day to you, and thank you for your enquiry. > > As per your suggestion, I have created a 3r

Re: [ceph-users] SSD journal overload?

2014-04-27 Thread Indra Pramana

Hi Craig, Good day to you, and thank you for your enquiry. As per your suggestion, I have created a 3rd partition on the SSDs and did the dd test directly into the device, and the result is very slow. root@ceph-osd-08:/mnt# dd bs=1M count=128 if=/dev/zero of=/dev/sdg3 conv=fdatasync oflag=d

Re: [ceph-users] SSD journal overload?

2014-04-25 Thread Craig Lewis

I am not able to do a dd test on the SSDs since it's not mounted as filesystem, but dd on the OSD (non-SSD) drives gives normal result. Since you have free space on the SSDs, you could add a 3rd 10G partition to one of the SSDs. Then you could put a filesystem on that partition, or just dd

[ceph-users] SSD journal overload?

2014-04-25 Thread Indra Pramana

Hi, On one of our test clusters, I have a node with 4 OSDs with SAS / non-SSD drives (sdb, sdc, sdd, sde) and 2 SSD drives (sdf and sdg) for journals to serve the 4 OSDs (2 each). Model: ATA ST100FM0012 (scsi) Disk /dev/sdf: 100GB Sector size (logical/physical): 512B/4096B Partition Table: gpt N

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread Loic Dachary

Crystal clear, thanks for the tutorial :-) On 03/12/2013 16:52, James Pearce wrote: > It shouldn't happen, provided the sustained write rate doesn't exceed the > sustained erase capabilities of the device I guess. Daily fstrim will not > hurt though. > > Essentially the mapping between LBAs an

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread James Pearce

It shouldn't happen, provided the sustained write rate doesn't exceed the sustained erase capabilities of the device I guess. Daily fstrim will not hurt though. Essentially the mapping between LBAs and physical cells isn't persistent in SSDs (unlike LBA and physical sectors on an HDD). Prov

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread Loic Dachary

On 03/12/2013 13:38, James Pearce wrote: > Since the journal partitions are generally small, it shouldn't need to be. > > For example implement with substantial under-provisioning, either via HPA or > simple partitions. > Does that mean the problem will happen much later or that it will never

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread Emmanuel Lacour

On Tue, Dec 03, 2013 at 01:22:50PM +, James Pearce wrote: > > Daily cron task is though still a good idea - enabling discard mount > option is generally counter-productive since trim is issue way too > often, destroying performance (in my testing). > yes that's why we are using cron here. _

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread James Pearce

Most likely. When fully provisioned the device has a much smaller pool of cells to manage (i.e. charge) in the background, hence once that pool is exhausted the device has no option but to stall whilst it clears (re-charges) a cell, which takes something like 2-5ms. Daily cron task is though

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread Emmanuel Lacour

On Tue, Dec 03, 2013 at 12:48:21PM +, James Pearce wrote: > How much (%) is left unprovisioned on those (840s?) ? And were they > trim'd/secure erased before deployment? > unfortunatly, everything was provisioned (thought, there is free spaces in the VG) due to lack of knowledge. Nothing sp

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread James Pearce

How much (%) is left unprovisioned on those (840s?) ? And were they trim'd/secure erased before deployment? On 2013-12-03 12:45, Emmanuel Lacour wrote: On Tue, Dec 03, 2013 at 12:38:54PM +, James Pearce wrote: Since the journal partitions are generally small, it shouldn't need to be. h

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread Emmanuel Lacour

On Tue, Dec 03, 2013 at 12:38:54PM +, James Pearce wrote: > Since the journal partitions are generally small, it shouldn't need > to be. > here with 2 journals (2 osds) on two ssd (samsung 850 pro, soft raid1 + lvm + xfs) trim is just obligatory. We forget to set it at cluster setup and one w

Re: [ceph-users] SSD journal partition and trim

2013-12-03 Thread James Pearce

Since the journal partitions are generally small, it shouldn't need to be. For example implement with substantial under-provisioning, either via HPA or simple partitions. On 2013-12-03 12:18, Loic Dachary wrote: Hi Ceph, When an SSD partition is used to store a journal https://github.com/c

[ceph-users] SSD journal partition and trim

2013-12-03 Thread Loic Dachary

Hi Ceph, When an SSD partition is used to store a journal https://github.com/ceph/ceph/blob/master/src/os/FileJournal.cc#L90 how is it trimmed ? http://en.wikipedia.org/wiki/Trim_%28computing%29 Cheers -- Loïc Dachary, Artisan Logiciel Libre signature.asc Description: OpenPGP digital sign

89 matches

Mail list logo