It's unlikely to be the issue, but you might check the times on your OSDs. cephx is clock-sensitive if you're off by more than an hour or two. -Greg
Software Engineer #42 @ http://inktank.com | http://ceph.com On Tue, Jun 17, 2014 at 8:30 AM, Fred Yang <frederic.y...@gmail.com> wrote: > What's strange is OSD rebalance obviously has no problem, it's just new > object can't be written since the new segments can't be distributed to new > OSDs. > > Here is the error from radosgw.log: > > 2014-06-17 10:34:01.568754 7fc7e83f4700 0 cephx: verify_reply couldn't > decrypt with error: error decoding block for decryption > 2014-06-17 10:34:01.568763 7fc7e83f4700 0 -- 172.17.9.218:0/1034041 >> > 10.122.134.204:6820/14745 pipe(0x1e24710 sd=11 :54045 s=1 pgs=0 cs=0 l=1 > c=0x1e23db0).failed verifying authorize reply > > So it appears OSD can authenticate with each other, but the key generated > between client and mon are only visible to existing OSDs, but not new OSDs > just added? > > I'm trying increase cephx debug level on mon but it seems hanging: > > # ceph tell mon.* injectarts '--debug-auth=5' > no valid command found; 10 closest matches: > config-key exists <key> > config-key list > config-key put <key> {<val>} > config-key del <key> > osd tier remove-overlay <poolname> > config-key get <key> > osd tier cache-mode <poolname> none|writeback|invalidate+forward|readonly > osd tier set-overlay <poolname> <poolname> > mon remove <name> > osd tier remove <poolname> <poolname> > mon.nysanlab04: Error EINVAL: invalid command > mon.nysanlab04: invalid command > 2014-06-17 11:08:20.995510 7ffcd078b700 0 -- 172.17.9.218:0/1001296 >> > 172.17.9.219:6789/0 pipe(0x7ffccc021fe0 sd=4 :0 s=1 pgs=0 cs=0 l=1 > c=0x7ffccc022240).fault > > Am I using the wrong syntax to increse debug level for auth in mon? Or > something wrong with the cluster? > > > On Mon, Jun 16, 2014 at 5:56 PM, John Wilkins <john.wilk...@inktank.com> > wrote: > >> Did you run ceph-deploy in the directory where you ran ceph-deploy new >> and ceph-deploy gatherkeys? That's where the monitor bootstrap key should >> be. >> >> >> On Mon, Jun 16, 2014 at 8:49 AM, Fred Yang <frederic.y...@gmail.com> >> wrote: >> >>> I'm adding three OSD nodes(36 osds in total) to existing 3-node >>> cluster(35 osds) using ceph-deploy, after disks prepared and OSDs >>> activated, the cluster re-balanced and shows all pgs active+clean: >>> >>> osdmap e820: 72 osds: 71 up, 71 in >>> pgmap v173328: 15920 pgs, 17 pools, 12538 MB data, 3903 objects >>> 30081 MB used, 39631 GB / 39660 GB avail >>> 15920 active+clean >>> >>> However, the object write start having issue since the new OSDs added to >>> cluster: >>> >>> 2014-06-16 11:36:36.421868 osd.35 [WRN] slow request 30.317529 seconds >>> old, received at 2014-06-16 11:36:06.104256: osd_op(client.5568.0:1502400 >>> default.5250.4_loadtest/512B_file [getxattrs,stat] 9.552a7900 e820) v4 >>> currently waiting for rw locks >>> >>> And from existing osd log, it seems it's having problem to authenticate >>> the new OSDs (10.122.134.204 is the IP of one of new OSD nodes) : >>> >>> 2014-06-16 11:38:25.281270 7f58562ce700 0 cephx: verify_reply couldn't >>> decrypt with error: error decoding block for decryption >>> 2014-06-16 11:38:25.281288 7f58562ce700 0 -- 172.17.9.218:6811/2047255 >>> >> 10.122.134.204:6831/17571 pipe(0x2891280 sd=90 :48493 s=1 pgs=3091 >>> cs=10 l=0 c=0x62d1840).failed verifying authorize reply >>> >>> >>> The cephx auth list shows good to me: >>> >>> exported keyring for osd.45 >>> [osd.45] >>> key = AQAoCp5TqBq/MhAANwclbs1nCgefNfxqqPnkZQ== >>> caps mon = "allow profile osd" >>> caps osd = "allow *" >>> >>> The key above does not match the keyring on osd.45. >>> >>> Anybody have any clue what might be the authentication issue here? I'm >>> running Ceph 0.72.2. >>> >>> Thanks in advance, >>> Fred >>> >>> _______________________________________________ >>> ceph-users mailing list >>> ceph-users@lists.ceph.com >>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com >>> >>> >> >> >> -- >> John Wilkins >> Senior Technical Writer >> Intank >> john.wilk...@inktank.com >> (415) 425-9599 >> http://inktank.com >> > > > _______________________________________________ > ceph-users mailing list > ceph-users@lists.ceph.com > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > >
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com