On Monday 19 October 2015 22:25:28 Sven Arvidsson wrote: > On Mon, 2015-10-19 at 22:53 +0300, David Baron wrote: > > OK, but these errors are marked by smart so may indicate disk > > problems. > > That seems very, very odd, please post logs.
Here is the result of cat /var/log/syslog.1 |grep -i smart: CACHE ERRORs are to what I refer Oct 19 11:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], FAILED SMART self- check. BACK UP DATA NOW! Oct 19 11:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct. Oct 19 11:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 102 to 107 Oct 19 11:30:48 dovidhalevi spamd[2105]: spamd: processing message <20151019083014.117268.3...@smartphoneexperts.com> for root:65534 Oct 19 11:30:53 dovidhalevi spamd[2105]: spamd: result: Y 6 - BOTNET,HTML_MESSAGE,LOTS_OF_MONEY,RCVD_IN_DNSWL_NONE,RDNS_NONE,T_DKI M_INVALID scantime=5.6,size=21174,user=root,uid=65534,required_score=5.0,rhost=localhost,raddr=::1 ,rport=56444,mid=<20151019083014.117268.3...@smartphoneexperts.com>,autolearn=n o autolearn_force=no Oct 19 11:49:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], FAILED SMART self- check. BACK UP DATA NOW! Oct 19 11:49:10 dovidhalevi smartd[907]: Sending warning via /usr/share/smartmontools/smartd-runner to root ... Oct 19 11:49:11 dovidhalevi smartd[907]: Warning via /usr/share/smartmontools/smartd- runner to root: successful Oct 19 11:49:11 dovidhalevi smartd[907]: Device: /dev/sda [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct. Oct 19 11:49:11 dovidhalevi smartd[907]: Sending warning via /usr/share/smartmontools/smartd-runner to root ... Oct 19 11:49:12 dovidhalevi smartd[907]: Warning via /usr/share/smartmontools/smartd- runner to root: successful Oct 19 11:49:12 dovidhalevi smartd[907]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 108 to 109 Oct 19 12:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], FAILED SMART self- check. BACK UP DATA NOW! Oct 19 12:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct. Oct 19 12:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], SMART Prefailure Attribute: 7 Seek_Error_Rate changed from 200 to 100 Oct 19 12:19:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 107 to 103 Oct 19 12:49:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], FAILED SMART self- check. BACK UP DATA NOW! Oct 19 12:49:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct. Oct 19 12:49:10 dovidhalevi smartd[907]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 103 to 106 Oct 19 12:49:10 dovidhalevi smartd[907]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 109 to 108 Oct 19 13:04:05 dovidhalevi systemd[1]: Started Self Monitoring and Reporting Technology (SMART) Daemon. Oct 19 13:04:06 dovidhalevi smartd[904]: smartd 6.4 2014-10-07 r4002 [x86_64- linux-4.0.0-1-amd64] (local build) Oct 19 13:04:06 dovidhalevi smartd[904]: Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org Oct 19 13:04:06 dovidhalevi smartd[904]: Opened configuration file /etc/smartd.conf Oct 19 13:04:06 dovidhalevi smartd[904]: Drive: DEVICESCAN, implied '-a' Directive on line 21 of file /etc/smartd.conf Oct 19 13:04:06 dovidhalevi smartd[904]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices Oct 19 13:04:06 dovidhalevi smartd[904]: Device: /dev/sda, type changed from 'scsi' to 'sat' Oct 19 13:04:06 dovidhalevi smartd[904]: Device: /dev/sda [SAT], opened Oct 19 13:04:06 dovidhalevi smartd[904]: Device: /dev/sda [SAT], WDC WD800JB-00ETA0, S/N:WD-WCAHL5220676, FW:77.07W77, 80.0 GB Oct 19 13:04:06 dovidhalevi smartd[904]: Device: /dev/sda [SAT], found in smartd database: Western Digital Caviar SE Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list. Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sda [SAT], state read from /var/lib/smartmontools/smartd.WDC_WD800JB_00ETA0-WD_WCAHL5220676.ata.state Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb, type changed from 'scsi' to 'sat' Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], opened Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], WDC WD10EFRX-68PJCN0, S/N:WD-WCC4J3154373, WWN:5-0014ee-25f442d96, FW:01.01A01, 1.00 TB Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], found in smartd database: Western Digital Red (AF) Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list. Oct 19 13:04:07 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], state read from /var/lib/smartmontools/smartd.WDC_WD10EFRX_68PJCN0-WD_WCC4J3154373.ata.state Oct 19 13:04:07 dovidhalevi smartd[904]: Monitoring 2 ATA and 0 SCSI devices Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sda [SAT], FAILED SMART self- check. BACK UP DATA NOW! Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sda [SAT], Failed SMART usage Attribute: 5 Reallocated_Sector_Ct. Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 106 to 102 Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 3 Spin_Up_Time changed from 137 to 138 Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.WDC_WD800JB_00ETA0-WD_WCAHL5220676.ata.state Oct 19 13:04:08 dovidhalevi smartd[904]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.WDC_WD10EFRX_68PJCN0-WD_WCC4J3154373.ata.state Oct 19 13:06:07 dovidhalevi kernel: [ 138.753114] nouveau E[ PFIFO][0000:01:00.0] CACHE_ERROR - ch 18 [smart-notifier[3593]] subc 7 mthd 0x0000 data 0xbeef3097 Oct 19 13:06:07 dovidhalevi kernel: [ 138.753207] nouveau E[ PFIFO][0000:01:00.0] CACHE_ERROR - ch 18 [smart-notifier[3593]] subc 7 mthd 0x0180 data 0xbeef0301 Oct 19 13:06:07 dovidhalevi kernel: [ 138.753292] nouveau E[ PFIFO][0000:01:00.0] CACHE_ERROR - ch 18 [smart-notifier[3593]] subc 7 mthd 0x0184 data 0xbeef0201