Re: [ceph-users] Cephfs unaccessible

2013-04-23 Thread Marco Aroldi
Hi, this morning I have this situation: health HEALTH_WARN 1540 pgs backfill; 30 pgs backfill_toofull; 113 pgs backfilling; 43 pgs degraded; 38 pgs peering; 5 pgs recovering; 484 pgs recovery_wait; 38 pgs stuck inactive; 2180 pgs stuck unclean; recovery 2153828/21551430 degraded (9.994%); noup,n

[ceph-users] Down OSD never goes out with v0.60

2013-04-23 Thread Aurélien Dunand
Hi, I've got a problem with Ceph 0.60, when an OSD fails, it goes down, but it never goes out, even after the down out interval. This behavior works in 0.56.4. When I stop an OSD, after the mon_osd_down_out_interval seconds, this OSD is not set out of the cluster: ceph osd tree # idweight

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Igor Laskovy
Sorry for delayed reply, I am not good familiar with apache. For RGW I use one of the OSD nodes. This is clear minimum installation of Ubunut 12.04 and ceph deployment on it, no another services. I must to say that I use default apache2 package from Ubuntu repository and have rgw print continue =

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Yehuda Sadeh
On Tue, Apr 23, 2013 at 7:51 AM, Igor Laskovy wrote: > Sorry for delayed reply, > > I am not good familiar with apache. > For RGW I use one of the OSD nodes. This is clear minimum installation of > Ubunut 12.04 and ceph deployment on it, no another services. > I must to say that I use default apac

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Igor Laskovy
In /etc/apache2/httpd.conf I have : ServerName osd01.ceph.labspace.studiogrizzly.com In /etc/apache2/sites-available/rgw.conf : FastCgiExternalServer /var/www/s3gw.fcgi -socket /tmp/radosgw.sock ServerName osd01.ceph.labspace.studiogrizzly.com ServerAdmin igor.lask...@gmail.com

[ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
I've run into an issue where after copying a file to my cephfs cluster the md5sums no longer match. I believe I've tracked it down to some parts of the file which are missing: $ obj_name=$(cephfs "title1.mkv" show_location -l 0 | grep object_name | sed -e "s/.*:\W*\([0-9a-f]*\)\.[0-9a-f]*/\1/") $

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Yehuda Sadeh
On Tue, Apr 23, 2013 at 11:33 AM, Igor Laskovy wrote: > In /etc/apache2/httpd.conf I have : > ServerName osd01.ceph.labspace.studiogrizzly.com > > In /etc/apache2/sites-available/rgw.conf : > FastCgiExternalServer /var/www/s3gw.fcgi -socket /tmp/radosgw.sock > > > ServerName osd01.ceph

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Gregory Farnum
On Tue, Apr 23, 2013 at 11:38 AM, Bryan Stillwell wrote: > I've run into an issue where after copying a file to my cephfs cluster > the md5sums no longer match. I believe I've tracked it down to some > parts of the file which are missing: > > $ obj_name=$(cephfs "title1.mkv" show_location -l 0 |

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Igor Laskovy
So, I totally lost in this, but I did it, and now CrossFTP report: [R1] Connect to osd01.ceph.labspace.studiogrizzly.com [R1] Current path: / [R1] Current path: / [R1] LIST / [R1] Request Error [ 404 Not Found Not Found The requested URL / was not found on this server. ]. On Tue, Apr 23, 2013 a

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Igor Laskovy
Ok, I removed right lines. Now CrossFTP connected, but when I trying create bucket it report: [R1] S3 Error: -1 (null) error: Request Error: java.net.UnknownHostException: fdfdf.osd01.ceph.labspace.studiogrizzly.com; XML Error Message: null [R1] -1 (null) error: Request Error: java.net.UnknownHostE

Re: [ceph-users] RadosGW and S3-compatible clients for PC and OSX

2013-04-23 Thread Lorieri
I've made some tests again with s3cmd you need to have a valid and accessible host_bucket key in the .s3cfg for example: host_bucket = %(bucket)s.myhostname.com if you dont have it, it does not allow you to use lowercase buckets I believe it checks if the bucket name is a valid dns name, etc for

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
I've tried a few different ones: 1. cp to cephfs mounted filesystem on Ubuntu 12.10 (quantal) 2. rsync over ssh to cephfs mounted filesystem on Ubuntu 12.04.2 (precise) 3. scp to cephfs mounted filesystem on Ubuntu 12.04.2 (precise) It's fairly reproducible, so I can collect logs for you. Which

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Gregory Farnum
Sorry, I meant kernel client or ceph-fuse? Client logs would be enough to start with, I suppose — "debug client = 20" and "debug ms = 1" if using ceph-fuse; if using the kernel client things get tricker; I'd have to look at what logging is available without the debugfs stuff being enabled. :/ -Greg

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
I'm using the kernel client that's built into precise & quantal. I could give the ceph-fuse client a try and see if it has the same issue. I haven't used it before, so I'll have to do some reading first. Bryan On Tue, Apr 23, 2013 at 4:04 PM, Gregory Farnum wrote: > Sorry, I meant kernel clien

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Gregory Farnum
On Tue, Apr 23, 2013 at 3:37 PM, Bryan Stillwell wrote: > I'm using the kernel client that's built into precise & quantal. > > I could give the ceph-fuse client a try and see if it has the same > issue. I haven't used it before, so I'll have to do some reading > first. If you've got the time tha

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
I'm testing this now, but while going through the logs I saw something that might have something to do with this: Apr 23 16:35:28 a1 kernel: [692455.496594] libceph: corrupt inc osdmap epoch 22146 off 102 (88021e0dc802 of 88021e0dc79c-88021e0dc802) Apr 23 16:35:28 a1 kernel: [692455.50

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Sage Weil
On Tue, 23 Apr 2013, Bryan Stillwell wrote: > I'm testing this now, but while going through the logs I saw something > that might have something to do with this: > > Apr 23 16:35:28 a1 kernel: [692455.496594] libceph: corrupt inc osdmap > epoch 22146 off 102 (88021e0dc802 of > 88021e0dc79c

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
On Tue, Apr 23, 2013 at 5:24 PM, Sage Weil wrote: > > On Tue, 23 Apr 2013, Bryan Stillwell wrote: > > I'm testing this now, but while going through the logs I saw something > > that might have something to do with this: > > > > Apr 23 16:35:28 a1 kernel: [692455.496594] libceph: corrupt inc osdmap

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Sage Weil
On Tue, 23 Apr 2013, Bryan Stillwell wrote: > On Tue, Apr 23, 2013 at 5:24 PM, Sage Weil wrote: > > > > On Tue, 23 Apr 2013, Bryan Stillwell wrote: > > > I'm testing this now, but while going through the logs I saw something > > > that might have something to do with this: > > > > > > Apr 23 16:35

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Gregory Farnum
On Tue, Apr 23, 2013 at 4:45 PM, Sage Weil wrote: > On Tue, 23 Apr 2013, Bryan Stillwell wrote: >> On Tue, Apr 23, 2013 at 5:24 PM, Sage Weil wrote: >> > >> > On Tue, 23 Apr 2013, Bryan Stillwell wrote: >> > > I'm testing this now, but while going through the logs I saw something >> > > that migh

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
On Tue, Apr 23, 2013 at 5:45 PM, Sage Weil wrote: > On Tue, 23 Apr 2013, Bryan Stillwell wrote: >> On Tue, Apr 23, 2013 at 5:24 PM, Sage Weil wrote: >> > >> > On Tue, 23 Apr 2013, Bryan Stillwell wrote: >> > > I'm testing this now, but while going through the logs I saw something >> > > that migh

Re: [ceph-users] Corruption by missing blocks

2013-04-23 Thread Bryan Stillwell
On Tue, Apr 23, 2013 at 5:54 PM, Gregory Farnum wrote: > On Tue, Apr 23, 2013 at 4:45 PM, Sage Weil wrote: >> On Tue, 23 Apr 2013, Bryan Stillwell wrote: >>> On Tue, Apr 23, 2013 at 5:24 PM, Sage Weil wrote: >>> > >>> > On Tue, 23 Apr 2013, Bryan Stillwell wrote: >>> > > I'm testing this now, bu

[ceph-users] Upcoming Stable Release and Wheezy

2013-04-23 Thread Steve Hindle
Hi All, The next stable release of both ceph and Debian are fast approaching. I'm just looking to get started with ceph and I was hoping to to install ceph when I do the wheezy upgrades. As such, I have a couple of questions: 1.) Will the upcoming stable release will have packages for Debian W

Re: [ceph-users] Upcoming Stable Release and Wheezy

2013-04-23 Thread Mike Lowe
2. At kernels less than 3.8 BTRFS will loose data with sparse files, so DO NOT USE IT. I've had trouble with btrfs file deletion hanging my osd's for up to 15 minutes with kernel 3.7 with btrfs sparse file patch applied. On Apr 23, 2013, at 8:20 PM, Steve Hindle wrote: > > Hi All, > > The

[ceph-users] rbd command error "librbd::ImageCtx: error finding header"

2013-04-23 Thread Dennis Chen
Hi list, I am using a ceph cluster (version 0.56.4) with all nodes (mon, mds, osd...) deployed in the RHEL 6 distro, the client is based on Ubuntu 12.10. Now I am confused by a strange issue, seems the issue has been asked before by google but no a clear answer for it. The specific details as

Re: [ceph-users] rbd command error "librbd::ImageCtx: error finding header"

2013-04-23 Thread Michael Lowe
My initial reaction is that you should use -p because rbd defaults to the rbd pool. You are in effect trying to get info about mypool/odm-kvm-img from rbd/odm-kvm-img which doesn't exist. Sent from my iPad On Apr 23, 2013, at 11:24 PM, Dennis Chen wrote: > Hi list, > > I am using a ceph cl

Re: [ceph-users] rbd command error "librbd::ImageCtx: error finding header"

2013-04-23 Thread Dennis Chen
Hi, A different error msg after your suggestion -- root@~# rbd -p mypool --image odm-kvm-img info rbd: error opening image odm-kvm-img: (95) Operation not supported 2013-04-24 11:32:47.757778 7f49949f7780 -1 librbd: Error listing snapshots: (95) Operation not supported I create the "odm-kvm-im

Re: [ceph-users] rbd command error "librbd::ImageCtx: error finding header"

2013-04-23 Thread Dennis Chen
specify the -p mypool will create another error msg looks like: root@~# rbd --image odm-kvm-img -p mypool info 2013-04-24 10:44:29.442715 7f8d9134b780 -1 rbd: error opening image librbd: Error listing snapshots: (95) Operation not supportedodm-kvm-img : (95) Operation not supported BRs, Dennis

Re: [ceph-users] rbd command error "librbd::ImageCtx: error finding header"

2013-04-23 Thread Barry O'Rourke
Hi, This sounds aa lot like https://bugzilla.redhat.com/show_bug.cgi?id=891993. Barry On 04/24/13 04:24, Dennis Chen wrote: Hi list, I am using a ceph cluster (version 0.56.4) with all nodes (mon, mds, osd...) deployed in the RHEL 6 distro, the client is based on Ubuntu 12.10. Now I am confus

[ceph-users] Rebuild the monitor infrastructure

2013-04-23 Thread Bryan Stansell
Sorry for possibly a silly new user question, but I was wondering if there was any way to rebuild the monitor infrastructure in case of catastrophic failure. My simple case is a single monitor. If the data is lost because of hardware failure, etc, can it be recreated from scratch? The same cou

Re: [ceph-users] Rebuild the monitor infrastructure

2013-04-23 Thread Martin B Nielsen
Hi Bryan, I asked the same question a few months ago: http://lists.ceph.com/pipermail/ceph-users-ceph.com/2013-February/000221.html But basically, that is pretty bad; you'll be stuck on your own and would need to get in contact with Inktank - they might be able to help rebuild a monitor for you.