On Wed, Dec 4, 2024 at 11:43 PM Michael Stone <mst...@debian.org> wrote:

> On Wed, Dec 04, 2024 at 05:11:47PM +0000, Daniel Harris wrote:
> >Thanks for all your replies.
> >As far as I can tell there are no errors reported using fsck or smartctl
> or
> >nvme
> > and the firmware is the correct and newest version so no problems there.
> >
> >The following are the messages that appear but only taken from my phone
> and
> >copied from the photo (lots of scrolling errors repeating over).
> >I thought these new drives were supposed to last longer than older moving
> HDD
> >but obviously not
>
> Is this during boot? The messages indicate a corrupted journal, which
> generally means a device error, or maybe a device which lost power while
> writing. It should be possible to mount read-only without replaying the
> journal for recovery purposes, but it's basically unfixable.
>

So its not actually a crash.  On the 2 occasions it has happened, I have
been away from my computer for a while, and when I return and move the
mouse, I can see messages scrolling on a black screen (no X running).  I
can move to a new vt but I cannot log in.  When I try to log in I just get
the errors repeating on the screen.  After I do a hard reset everything
works perfectly. No errors anywhere.


> >I guess its time to buy a new drive : (
>
> Did you try "nvme smart-log /dev/nvme0" to look for issues?
>
>
seems normal to me

 Smart Log for NVME device:nvme0 namespace-id:ffffffff
critical_warning                        : 0
temperature                             : 31°C (304 Kelvin)
available_spare                         : 100%
available_spare_threshold               : 10%
percentage_used                         : 0%
endurance group critical warning summary: 0
Data Units Read                         : 807,634 (413.51 GB)
Data Units Written                      : 5,680,746 (2.91 TB)
host_read_commands                      : 6,573,734
host_write_commands                     : 75,990,191
controller_busy_time                    : 1,145
power_cycles                            : 618
power_on_hours                          : 197
unsafe_shutdowns                        : 21
media_errors                            : 0
num_err_log_entries                     : 0
Warning Temperature Time                : 0
Critical Composite Temperature Time     : 0
Temperature Sensor 1           : 31°C (304 Kelvin)
Temperature Sensor 2           : 38°C (311 Kelvin)
Thermal Management T1 Trans Count       : 0
Thermal Management T2 Trans Count       : 0
Thermal Management T1 Total Time        : 0
Thermal Management T2 Total Time        : 0


Thanks Dan

Reply via email to