Toni writes: > > > > Is Maxtorman correct about the 320 log entries? > > > My dealer told me a similar story, but I don't know where he had it > > > from. > > > > I guess the next step is to find out if Maxtorman is correct about this > > 320 log entries stuff, and if the SMART log entries as reported by > > smartmontools is the log to worry about, or if there is some other log. > > I don't have an account on /., and also feel incapable of actually > working on this problem, but someone who has and can, could probably > try to nag maxtorman about improving smartmontools to the point that > they do the right thing, or try to get him to connect one to somebody > else who can verify the issue and/or provide more technical details. > > If he can find a way to almost-anonymously post to /., he might be able > to give some hints to the smartmontools gyus, too. Then, we only need > them to integrate everything and make a new release.
It is easy to set up a slashdot account. Or you can post as "anonymous coward". He set up the Maxtorman account to post anonymously, he mentioned that he has another slashdot account that isn't anonymous. Problem I have is I can't find a way to send him a PM (private message). Most web forums have a facility for sending other users a PM. We can post a reply to the thread, but he would have to read the thread again to see it. Any slashdot wizards out there have an idea? Your suggestion of smartmontools is helpful, thank you. > Personally, I'd say that it'd be best if Seagate themselves would grab > the opportunity to partially make good on the issue, but I heavily > doubt that they "understand", or want to understand, what's it about > with FLOSS. It isn't even just FLOSS. Any non-x86 machine is out of luck. Proprietary Unix is out of luck. Anything embedded is out of luck. Even Mac is probably out of luck. And if the reboot to run the firmware installer bricks the drive(s) even wintel is out of luck. I don't understand the common corporate policy of keeping everything secret. All they are doing is hurting their previously loyal customers. It didn't used to be this way. Supposedly there was a broken test machine that didn't zero out some special area after writing a test pattern. So only drives that were tested on that machine are at risk. If we can find out what area this is (I assume it isn't in the normal space used for user storage) and how to zero it (if not already zero) there is no need to update the firmware. ---------- Raimo writes: > How can I know if I have a suspicious drive? Good question. Seagate has some web page that supposedly will tell you, but of course it is broken and doesn't work with all browsers. > Google for ST3808110AS gives me "Barracuda 7200.9 SATA 80-GB Hard Drive", > so I guess this one is not suspicious, but I have more disks, > in other servers. What if i find a 7200.10, 7200.11, ES or ES.2, > is that enough for me to suspect it? I haven't read anything about problems with 7200.10 or earlier. Toni reports that ES and ES.2 may be affected. ---------- Glenn writes: > Just a hypothetical situation, since we do not have the sourcecode of > the firmware: isn't it possible some kind of mathematical operation > is occuring on the number of log entries causing some kind of infinite > loop to occur or a division that leads to/by 0 that the software/hardware > is unable to handle? That could mean this problem could also manifest > itself on for example multiples of 320, so just putting the counter on > 321 may just be delaying the inevitable. And what happens if the counter > overflows and reaches 320 again? >From what I've read it sounds like the counter must be exactly 320 AND some location must have a test pattern rather than zero when you init (power up or reboot) the drive. From Maxtorman's description, the log is circular, so it will eventually wrap around to 320 again. So keeping the counter away from 320 is an okay short term workaround, but long term we want to either zero out the magic location or update the firmware. ---------- matheus writes: > but I also heard of new firmwares being worse than old > ones, from seagate first try to fix things. What I read is that the firmware itself was ok but the installer program would brick a previously working drive. But it didn't brick it as badly as the firmware bug, you can still update the firmware again once you get a proper update program. =============== There is supposed to be some document that explains all this, with enough details to create a fix. If anyone finds this document I need a copy please. If you have one or more of the suspect drives, if it running, try to keep it running and don't reboot. If it is powered down leave it powered down if possible until this all gets sorted out.