hmmm we are running infernalis... root@ceph1:~# ceph -v ceph version 9.2.0 (bb2ecea240f3a1d525bcb35670cb07bd1f0ca299) root@ceph1:~#
So you think everything should work with hammer ? As this is all just a proof of concept on some vm's so we know how to build into prod Regards, Daniel On Mon, Feb 1, 2016 at 11:22 PM, John Spray <jsp...@redhat.com> wrote: > The "assert path[-1] == 'type'" is the error you get when using the > calamari diamond branch with a >= infernalis version of Ceph (where > new fields were added to the perf schema output). No idea if anyone > has worked on updating Calamari+Diamond for latest ceph. > > John > > On Mon, Feb 1, 2016 at 12:09 PM, Daniel Rolfe <daniel.rolfe...@gmail.com> > wrote: > > I can see the is ok files are there > > > > root@ceph1:/var/run/ceph# ls -la > > total 0 > > drwxrwx--- 2 ceph ceph 80 Feb 1 10:51 . > > drwxr-xr-x 18 root root 640 Feb 1 10:52 .. > > srwxr-xr-x 1 ceph ceph 0 Feb 1 10:51 ceph-mon.ceph1.asok > > srwxr-xr-x 1 root root 0 Jan 27 15:08 ceph-osd.0.asok > > root@ceph1:/var/run/ceph# > > root@ceph1:/var/run/ceph# > > root@ceph1:/var/run/ceph# > > > > > > Running diamond in debug show the below > > > > [2016-02-01 10:55:23,774] [Thread-1] Collecting data from: > NetworkCollector > > [2016-02-01 10:56:23,484] [Thread-1] Collecting data from: CPUCollector > > [2016-02-01 10:56:23,487] [Thread-6] Collecting data from: > MemoryCollector > > [2016-02-01 10:56:23,489] [Thread-7] Collecting data from: > SockstatCollector > > [2016-02-01 10:56:23,768] [Thread-1] Collecting data from: CephCollector > > [2016-02-01 10:56:23,768] [Thread-1] gathering service stats for > > /var/run/ceph/ceph-mon.ceph1.asok > > [2016-02-01 10:56:24,094] [Thread-1] Traceback (most recent call last): > > File "/usr/lib/pymodules/python2.7/diamond/collector.py", line 412, in > > _run > > self.collect() > > File "/usr/share/diamond/collectors/ceph/ceph.py", line 464, in collect > > self._collect_service_stats(path) > > File "/usr/share/diamond/collectors/ceph/ceph.py", line 450, in > > _collect_service_stats > > self._publish_stats(counter_prefix, stats, schema, GlobalName) > > File "/usr/share/diamond/collectors/ceph/ceph.py", line 305, in > > _publish_stats > > assert path[-1] == 'type' > > AssertionError > > > > [2016-02-01 10:56:24,096] [Thread-8] Collecting data from: > > LoadAverageCollector > > [2016-02-01 10:56:24,098] [Thread-1] Collecting data from: > VMStatCollector > > [2016-02-01 10:56:24,099] [Thread-1] Collecting data from: > > DiskUsageCollector > > [2016-02-01 10:56:24,104] [Thread-9] Collecting data from: > > DiskSpaceCollector > > > > > > > > Check the md5 on the file returns the below: > > > > root@ceph1:/var/run/ceph# md5sum > /usr/share/diamond/collectors/ceph/ceph.py > > aeb3915f8ac7fdea61495805d2c99f33 > /usr/share/diamond/collectors/ceph/ceph.py > > root@ceph1:/var/run/ceph# > > > > > > > > I've found that replacing the ceph.py file with the below stops the > diamond > > error > > > > > > > https://raw.githubusercontent.com/BrightcoveOS/Diamond/master/src/collectors/ceph/ceph.py > > > > root@ceph1:/usr/share/diamond/collectors/ceph# md5sum ceph.py > > 13ac74ce0df39a5def879cb5fc530015 ceph.py > > > > > > [2016-02-01 11:14:33,116] [Thread-42] Collecting data from: > MemoryCollector > > [2016-02-01 11:14:33,117] [Thread-1] Collecting data from: CPUCollector > > [2016-02-01 11:14:33,123] [Thread-43] Collecting data from: > > SockstatCollector > > [2016-02-01 11:14:35,453] [Thread-1] Collecting data from: CephCollector > > [2016-02-01 11:14:35,454] [Thread-1] checking > > /var/run/ceph/ceph-mon.ceph1.asok > > [2016-02-01 11:14:35,552] [Thread-1] checking > /var/run/ceph/ceph-osd.0.asok > > [2016-02-01 11:14:35,685] [Thread-44] Collecting data from: > > LoadAverageCollector > > [2016-02-01 11:14:35,686] [Thread-1] Collecting data from: > VMStatCollector > > [2016-02-01 11:14:35,687] [Thread-1] Collecting data from: > > DiskUsageCollector > > [2016-02-01 11:14:35,692] [Thread-45] Collecting data from: > > DiskSpaceCollector > > > > > > But after all that it's still NOT working > > > > What diamond version are you running ? > > > > I'm running Diamond version 3.4.67 > > > > On Mon, Feb 1, 2016 at 11:01 PM, Daniel Rolfe <daniel.rolfe...@gmail.com > > > > wrote: > >> > >> I can see the is ok files are there > >> > >> root@ceph1:/var/run/ceph# ls -la > >> total 0 > >> drwxrwx--- 2 ceph ceph 80 Feb 1 10:51 . > >> drwxr-xr-x 18 root root 640 Feb 1 10:52 .. > >> srwxr-xr-x 1 ceph ceph 0 Feb 1 10:51 ceph-mon.ceph1.asok > >> srwxr-xr-x 1 root root 0 Jan 27 15:08 ceph-osd.0.asok > >> root@ceph1:/var/run/ceph# > >> root@ceph1:/var/run/ceph# > >> root@ceph1:/var/run/ceph# > >> > >> > >> Running diamond in debug show the below > >> > >> [2016-02-01 10:55:23,774] [Thread-1] Collecting data from: > >> NetworkCollector > >> [2016-02-01 10:56:23,484] [Thread-1] Collecting data from: CPUCollector > >> [2016-02-01 10:56:23,487] [Thread-6] Collecting data from: > MemoryCollector > >> [2016-02-01 10:56:23,489] [Thread-7] Collecting data from: > >> SockstatCollector > >> [2016-02-01 10:56:23,768] [Thread-1] Collecting data from: CephCollector > >> [2016-02-01 10:56:23,768] [Thread-1] gathering service stats for > >> /var/run/ceph/ceph-mon.ceph1.asok > >> [2016-02-01 10:56:24,094] [Thread-1] Traceback (most recent call last): > >> File "/usr/lib/pymodules/python2.7/diamond/collector.py", line 412, in > >> _run > >> self.collect() > >> File "/usr/share/diamond/collectors/ceph/ceph.py", line 464, in > collect > >> self._collect_service_stats(path) > >> File "/usr/share/diamond/collectors/ceph/ceph.py", line 450, in > >> _collect_service_stats > >> self._publish_stats(counter_prefix, stats, schema, GlobalName) > >> File "/usr/share/diamond/collectors/ceph/ceph.py", line 305, in > >> _publish_stats > >> assert path[-1] == 'type' > >> AssertionError > >> > >> [2016-02-01 10:56:24,096] [Thread-8] Collecting data from: > >> LoadAverageCollector > >> [2016-02-01 10:56:24,098] [Thread-1] Collecting data from: > VMStatCollector > >> [2016-02-01 10:56:24,099] [Thread-1] Collecting data from: > >> DiskUsageCollector > >> [2016-02-01 10:56:24,104] [Thread-9] Collecting data from: > >> DiskSpaceCollector > >> > >> > >> > >> Check the md5 on the file returns the below: > >> > >> root@ceph1:/var/run/ceph# md5sum > >> /usr/share/diamond/collectors/ceph/ceph.py > >> aeb3915f8ac7fdea61495805d2c99f33 > >> /usr/share/diamond/collectors/ceph/ceph.py > >> root@ceph1:/var/run/ceph# > >> > >> > >> > >> I've found that replacing the ceph.py file with the below stops the > >> diamond error > >> > >> > >> > >> > https://raw.githubusercontent.com/BrightcoveOS/Diamond/master/src/collectors/ceph/ceph.py > >> > >> root@ceph1:/usr/share/diamond/collectors/ceph# md5sum ceph.py > >> 13ac74ce0df39a5def879cb5fc530015 ceph.py > >> > >> > >> [2016-02-01 11:14:33,116] [Thread-42] Collecting data from: > >> MemoryCollector > >> [2016-02-01 11:14:33,117] [Thread-1] Collecting data from: CPUCollector > >> [2016-02-01 11:14:33,123] [Thread-43] Collecting data from: > >> SockstatCollector > >> [2016-02-01 11:14:35,453] [Thread-1] Collecting data from: CephCollector > >> [2016-02-01 11:14:35,454] [Thread-1] checking > >> /var/run/ceph/ceph-mon.ceph1.asok > >> [2016-02-01 11:14:35,552] [Thread-1] checking > >> /var/run/ceph/ceph-osd.0.asok > >> [2016-02-01 11:14:35,685] [Thread-44] Collecting data from: > >> LoadAverageCollector > >> [2016-02-01 11:14:35,686] [Thread-1] Collecting data from: > VMStatCollector > >> [2016-02-01 11:14:35,687] [Thread-1] Collecting data from: > >> DiskUsageCollector > >> [2016-02-01 11:14:35,692] [Thread-45] Collecting data from: > >> DiskSpaceCollector > >> > >> > >> But after all that it's still now working > >> > >> What diamond version are you running ? > >> > >> I'm running Diamond version 3.4.67 > >> > >> > >> On Mon, Feb 1, 2016 at 12:24 PM, hnuzhoulin <hnuzhoul...@gmail.com> > wrote: > >>> > >>> Yes,in my environment I fix it. > >>> BTW,I check the md5 of ceph collection file.It is correct. > >>> > >>> 在 Sun, 31 Jan 2016 22:46:42 +0800,Daniel Rolfe > >>> <daniel.rolfe...@gmail.com> 写道: > >>> > >>> Hi, thanks for the reply > >>> > >>> Just to confirm , did you manage to fix this issue ? > >>> > >>> I've restarted the whole ceph cluster a few times. > >>> > >>> Sent from my iPhone > >>> > >>> On 1 Feb 2016, at 1:26 AM, hnuzhoulin <hnuzhoul...@gmail.com> wrote: > >>> > >>> I just face the same problem. > >>> > >>> The problem is my cluster missing the asok files of mons although the > >>> cluster works well. > >>> > >>> so kill mon process and restart it may fix it.(using service command to > >>> restart mon daemon may do not work) > >>> > >>> > >>> 在 Sun, 31 Jan 2016 10:35:25 +0800,Daniel Rolfe > >>> <daniel.rolfe...@gmail.com> 写道: > >>> > >>> Seem to be having an issue with global ceph stats getting back to > >>> calamari > >>> > >>> Individual node and osd stats are working > >>> > >>> If anyone can point me into the right direction that would be great > >>> > >>> https://github.com/ceph/calamari/issues/384 > >>> > >>> > >>> > >>> > >>> > >>> > >>> > >>> -- > >>> ------------------------- > >>> hnuzhoul...@gmail.com > >>> > >>> > >>> > >>> > >>> -- > >>> ------------------------- > >>> hnuzhou...@gmail.com > >> > >> > > > > > > _______________________________________________ > > ceph-users mailing list > > ceph-users@lists.ceph.com > > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > > >
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com