Re: [ceph-users] Bad Write-Performance on Ceph/Possible bottlenecks?

2014-07-04 Thread Daniel Schwager
> Try to create e.g. 20 (small) rbd devices, putting them all in a lvm vg, > creating a logical volume (Raid0) with > 20 stripes and e.g. stripeSize 1MB (better bandwith) or 4kb (better io) - or > use md-raid0 (it's maybe 10% faster - but not that flexible): BTW - we use this approach for VMware

Re: [ceph-users] snapshoting on btrfs vs xfs

2015-02-04 Thread Daniel Schwager
Hi Cristian, > We will try to report back, but I'm not sure our use case is relevant. > We are trying to use every dirty trick to speed up the VMs. we have the same use-case. > The second pool is for the tests machines and has the journal in ram, > so this part is very volatile. We don't really

Re: [ceph-users] certificate of `ceph.com' is not trusted!

2015-02-12 Thread Daniel Schwager
Hi, I think the root-CA (COMODO RSA Certification Authority) is not available on your Linux host? Using Google chrome connecting to https://ceph.com/ works fine. regards Danny > -Original Message- > From: ceph-users [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of > Dietmar Maur

[ceph-users] Big or small node ?

2013-11-19 Thread Daniel Schwager
Hallo, we are going to setup a 36-40TB (brutto) test setup in our company for disk2disk2tape backup. Now, we are faced to decide if we go the high- or low density ceph way. --- The big node (only one necessary): 1 x Supermicro, System 6027R-E1R12T with 2 x CPU E5-2620v2 (6 core (Hyper thread

Re: [ceph-users] How to replace a failed OSD

2013-11-20 Thread Daniel Schwager
Hi Robert, > What is the easiest way to replace a failed disk / OSD. > It looks like the documentation here is not really compatible with > ceph_deploy: > http://ceph.com/docs/master/rados/operations/add-or-rm-osds/ I found the following thread useful: http://www.spinics.net/lists/ceph-u

Re: [ceph-users] problem with delete or rename a pool

2013-11-28 Thread Daniel Schwager
Hi, > The problem is: now I want to delete or rename the pool '-help', maybe you will try using double-hyphen ("--") [1] , e.g. something (not tested) like ceph osd pool rename -- "-help" aaa ceph osd pool delete -- -help regards Danny [1] http://unix.stackexchange.com/questi

Re: [ceph-users] odd performance graph

2013-12-01 Thread Daniel Schwager
Hi, > > The low points are all ~35Mbytes/sec and the high points are all > > ~60Mbytes/sec. This is very reproducible. > > It occurred to me that just stopping the OSD's selectively would allow me to > see if there was a change when one > was ejected, but at no time was there a change to the gra

Re: [ceph-users] Replacing a disk: Best practices?

2014-10-15 Thread Daniel Schwager
Hi, > I recently had an OSD disk die, and I'm wondering what are the > current "best practices" for replacing it. I think I've thoroughly removed > the old disk, both physically and logically, but I'm having trouble figuring > out how to add the new disk into ceph. I did this today (one

Re: [ceph-users] Replacing a disk: Best practices?

2014-10-15 Thread Daniel Schwager
Loic, > > root@ceph-node3:~# smartctl -a /dev/sdd | less > > === START OF INFORMATION SECTION === > > Device Model: ST4000NM0033-9ZM170 > > Serial Number:Z1Z5LGBX > > .. > > admin@ceph-admin:~/cluster1$ emacs -nw ceph.conf >

[ceph-users] RBD - possible to query "used space" of images/clones ?

2014-11-04 Thread Daniel Schwager
Hi, is there a way to query the used space of a RBD image created with format 2 (used for kvm)? Also, if I create a linked clone base on this image, how do I get the additional, individual used space of this clone? In zfs, I can query these kind of information by calling "zfs info .." (2). "rb

[ceph-users] Statistic information about rbd bandwith/usage (from a rbd/kvm client)

2014-11-09 Thread Daniel Schwager
Hi, is there a possibility to see which rbd-device (used by a kvm hypervisor) produces high load on a ceph cluster? "ceph -w" shows only the total usage - but I don't know see which client or rbd-device is responsible for this load. best regards Danny smime.p7s Description: S/MIME cryptograp

Re: [ceph-users] osds fails to start with mismatch in id

2014-11-10 Thread Daniel Schwager
Hi Ramakrishna, we use the phy. path (containing the serial number) to a disk to prevent complexity and wrong mapping... This path will never change: /etc/ceph/ceph.conf [osd.16] devs = /dev/disk/by-id/scsi-SATA_ST4000NM0033-9Z_Z1Z0SDCY-part1 osd_jo

[ceph-users] Way to improve "rados get" bandwidth?

2014-11-24 Thread Daniel Schwager
Hi, I did some bandwidth tests with "rados put/get" (1), (2). I'm wondering, why "rados get" (read) bandwidth-results are not that good. If I run - 1 instance of rados get (2), the bandwidth is about 170MB/sec, with - 4 instances (2) in parallel, I get 4 times ~170MB/sec == ~ 680MB/sec Is the

Re: [ceph-users] Tip of the week: don't use Intel 530 SSD's for journals

2014-11-25 Thread Daniel Schwager
Hi. > > If you are like me, you have the journals for your OSD's with rotating > > media stored separately on an SSD. If you are even more like me, you > > happen to use Intel 530 SSD's in some of your hosts. If so, please do > > check your S.M.A.R.T. statistics regularly, because these SSD's real

Re: [ceph-users] Number of SSD for OSD journal

2014-12-15 Thread Daniel Schwager
Hallo Mike, > This is also have another way. > * for CONF 2,3 replace 200Gb SSD to 800Gb and add another 1-2 SSD to > each node. > * make tier1 read-write cache on SSDs > * also you can add journal partition on them if you wish - then data > will moving from SSD to SSD before let down on HDD > * o

[ceph-users] monitor can not rejoin the cluster

2014-01-30 Thread Daniel Schwager
Hi all, my monitor3 is not able to rejoin the cluster (containing mon1, mon2 and mon3 - running stable emperor). I try to recreate/inject a new monmap to all 3 mon's - but only mon1 and mon2 are up and joined. Now, enabling debugging on mon3, I got the following: 2014-01-30 08:51:03.823669 7f3

Re: [ceph-users] monitor can not rejoin the cluster

2014-01-30 Thread Daniel Schwager
OK - found the problem: > mon_status > { "name": "ceph-mon3", .. > "mons": [ > { "rank": 2, > "name": "mon.ceph-mon3", NAME is wrong > "addr": "192.168.135.33:6789\/0"}]}} > In the docu http://ceph.com/docs/master/man/8/monmaptool/ the creation of t

[ceph-users] udev names /dev/sd* - what happens if they change ?

2014-01-30 Thread Daniel Schwager
Hi, just a small question: Createing a new OSD i use e.g. ceph-deploy osd create ceph-node1:sdg:/dev/sdb5 Question: What happens if the mapping of my disks changes (e.g. because adding new disks to the server) sdg becomes sgh sdb becomes sdc Is this handled (how?) by ce

Re: [ceph-users] Meetup in Frankfurt, before the Ceph day

2014-02-05 Thread Daniel Schwager
Hi, > Anyone else from ceph community , willing to join. will also visit my first ceph day (-: See you all in Frankfurt! best regards Danny smime.p7s Description: S/MIME cryptographic signature ___ ceph-users mailing list ceph-users@lists.ceph.com ht

Re: [ceph-users] Crush Maps

2014-02-06 Thread Daniel Schwager
Hallo Bradley, additionally to your question, I'm interesting in the following: 5) can I change all 'type' Ids because adding a new type "host-slow" to distinguish between OSD's with journal on the same HDD / separate SSD? E.g. from type 0 osd type 1 host

Re: [ceph-users] RBD+KVM problems with sequential read

2014-02-07 Thread Daniel Schwager
setup a bitter value for read_ahead_kb ? I tested with 256 MB read ahead cache ( From: ceph-users-boun...@lists.ceph.com [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of ??? Sent: Friday, February 07, 2014 10:55 AM To: Konrad Gutkowski Cc: ceph-users@lists.ceph.com Subject: Re:

Re: [ceph-users] RBD+KVM problems with sequential read

2014-02-07 Thread Daniel Schwager
> I'm sorry, but I did not understand you :) Sorry (-: My finger touched the RETURN-key to fast... Try to setup a bigger value for the read ahead cache, maybe 256 MB? echo "262144">/sys/block/vda/queue/read_ahead_kb Try also "fio" performance tool - it will show more detailed information.

[ceph-users] change order of an rbd image ?

2014-02-15 Thread Daniel Schwager
Hi, I created a 1TB rbd-image formated with vmfs (vmware) for an ESX server - but with a wrong order (25 instead of 22 ...). The rbd man page tells me for export/import/cp, rbd will use the order of the source image. Is there a way to change the order of a rbd image by doing some conversion? Ok

Re: [ceph-users] Write performance from qemu rbd (Mysql)

2014-03-12 Thread Daniel Schwager
Hi Robert, > Are there any people running Mysql instances from qemu rbd? > The performance of the databases is not that great. try to setup bcache or flashcache on top of the rdb-device. With a Intel S37000 caching SSD-disk, we got the following fio results agains a bcache'd rbd device:

[ceph-users] firefly performance - SSD-only based caching pool ./. normal pool (journal on SSD)

2014-04-11 Thread Daniel Schwager
Hi guys, I'm interesting in the question, which setup will be faster: a) a pool with journals on separate SSD's, OSD's on normal disks OR b) a caching pool with SSDs only (osd+journal) in front of a normal pool (osd+journal on normal disks) A normal setup could be like this: - 5-node f

Re: [ceph-users] Automount Failovered Multi MDS CephFS

2016-08-03 Thread Daniel Schwager
Maybe something like this? 192.168.135.31:6789:/ /cephfs ceph name=cephfs,secretfile=/etc/ceph/client.cephfs,noatime 0 0 Best regards Daniel From: ceph-users [mailto:ceph-users-boun...@lists.ceph.com] On Behalf Of Lazuardi Nasution Sent: Wednesday, August 03, 2016 6:10 PM

Re: [ceph-users] Ceph + VMWare

2016-10-05 Thread Daniel Schwager
storage.dtnetcloud.com/d/684b330eea/ --- DT Netsolution GmbH - Taläckerstr. 30-D-70437 Stuttgart Geschäftsführer: Daniel Schwager, Stefan Hörz - HRB Stuttgart 19870 Tel: +49-711-849910-32, Fax: -932 - Mailto:daniel.schwa...@d

[ceph-users] Possible to change RBD-Caching settings while rbd device is in use ?

2015-12-11 Thread Daniel Schwager
Hallo ceph-users, is it possible to change the rbd caching settings [1] from a rbd devices which is already online, mapped and in use? I'm interesting in changing "READ-AHEAD SETTINGS" because of poor read performance of our iscsi/fc-target. But it's not (easy) possible to unmount the rbd dev

[ceph-users] pg is stuck stale (osd.21 still removed)

2016-01-08 Thread Daniel Schwager
Hi, we had a HW-problem with OSD.21 today. The OSD daemon was down and "smartctl" told me about some hardware errors. I decided to remove the HDD: ceph osd out 21 ceph osd crush remove osd.21 ceph auth del osd.21 ceph osd rm osd.21 But afterwards I saw t

Re: [ceph-users] pg is stuck stale (osd.21 still removed)

2016-01-08 Thread Daniel Schwager
One more - I tried to recreate the pg but now this pg this "stuck inactive": root@ceph-admin:~# ceph pg force_create_pg 34.225 pg 34.225 now creating, ok root@ceph-admin:~# ceph health detail HEALTH_WARN 49 pgs stale; 1 pgs stuck inactive; 49 pgs stuck stale; 1 pg

Re: [ceph-users] pg is stuck stale (osd.21 still removed)

2016-01-13 Thread Daniel Schwager
Hi ceph-users, any idea to fix my cluster? OSD.21 removed, but still some (staled) PG's pointing to OSD.21... I don't know how to proceed... Help is very welcome! Best regards Daniel > -Original Message- > From: Daniel Schwager > Sent: Friday, January 08, 2016 3:1

Re: [ceph-users] pg is stuck stale (osd.21 still removed) - SOLVED.

2016-01-13 Thread Daniel Schwager
Well, ok - I found the solution: ceph health detail HEALTH_WARN 50 pgs stale; 50 pgs stuck stale pg 34.225 is stuck inactive since forever, current state creating, last acting [] pg 34.225 is stuck unclean since forever, current state creating,

Re: [ceph-users] OSD behavior, in case of its journal disk (either HDD or SSD) failure

2016-01-25 Thread Daniel Schwager
Hi, > ok...OSD stop. Any reason why OSD stop ( I assume if journal disk > fails, OSD should work as no journal. Isn't it?) No. In my understanding - if a journal fails, all the attached (to this Journal HDD) OSD's fails also. E.g. if you have 4 OSD's with the 4 journals's located on one SSD-hard