Hi, Jason 十分抱歉还得打扰您,我把我们用的所有虚拟机全都重启了一次,然后将数据清理,重新启动集群,发现这个问题还是重在,我们用的是 2m-2s-async的,只有其中一个主,刚启动起来没有 broker.log的日志(为空),只有store.log的日志,里面全部都是(2018-09-04 09:47:29 WARN StoreScheduledThread1 - disk space will be full soon, but delete file failed. 2018-09-04 09:47:39 WARN StoreScheduledThread1 - disk space will be full soon, but delete file failed. 2018-09-04 09:47:49 WARN StoreScheduledThread1 - disk space will be full soon, but delete file failed.) 而其他的broker确是正常的,感觉十分奇怪。 十分抱歉再次打扰。
Best regards. Will. On 2018/09/03 11:58:48, Jason Joo <[email protected]> wrote: > 对,物理重启 > > 之前强删数据的时候碰过一次,删了shm也不管用,当时没去细查具体机制,索性做了重启快速恢复服务节点。 > > best regards, > > Jason > > > On Sep 3, 2018, at 18:39, [email protected] wrote: > > > > 刚想起来,我没有重启namesrv,但应该和namesrv无关吧,我只重启了broker并且清了数据。 > > > > On 2018/09/03 10:22:55, Jason Joo <[email protected]> wrote: > >> 应该是某处缓存没清,一般发生在强停后强删数据文件之后,一般重启节点(系统)即可解决 > >> > >> best regards, > >> > >> Jason > >> > >>> On Sep 3, 2018, at 18:04, [email protected] wrote: > >>> > >>> > >>> > >>> On 2018/09/03 10:01:33, [email protected] > >>> <[email protected]> wrote: > >>>> 具体日志如下: > >>>> 2018-09-03 17:53:18 WARN StoreScheduledThread1 - disk space will be full > >>>> soon, but delete file failed. > >>>> 2018-09-03 17:53:28 INFO StoreScheduledThread1 - physic disk maybe full > >>>> soon, so reclaim space, -1.0 > >>>> 2018-09-03 17:53:28 INFO StoreScheduledThread1 - begin to delete before > >>>> 24 hours file. timeup: false spacefull: true > >>>> manualDeleteFileSeveralTimes: 0 cleanAtOnce: false > >>>> INFO AllocateMappedFileService - /opt/push/rocketmq/store/commitlog > >>>> mkdir OK > >>>> 2018-07-31 16:55:51 INFO StoreScheduledThread1 - logics disk maybe full > >>>> soon, so reclaim space, -1.0 > >>>> 2018-07-31 16:55:51 INFO StoreScheduledThread1 - begin to delete before > >>>> 120 hours file. timeup: false spacefull: true > >>>> manualDeleteFileSeveralTimes: 0 cleanAtOnce: false > >>>> 2018-07-31 16:55:51 WARN StoreScheduledThread1 - disk space will be full > >>>> soon, but delete file failed. > >>>> > >>>> 把所有集群停止后把所有数据都清掉,再启动集群,还是会报这个错误,这个具体可能是什么原因呢?大大们帮忙看一下。 > >>>> > >>>> > >>>> > >>>> > >> > >> > >
