Hi, On 2018-04-08 16:27:57 -0700, Christophe Pettus wrote: > > On Apr 8, 2018, at 16:16, Andres Freund <and...@anarazel.de> wrote: > > We don't panic that way when getting IO > > errors during reads either, and they're more likely to be persistent > > than errors during writes (because remapping on storage layer can fix > > issues, but not during reads). > > There is a distinction to be drawn there, though, because we > immediately pass an error back to the client on a read, but a write > problem in this situation can be masked for an extended period of > time.
Only if you're "lucky" enough that your clients actually read that data, and then you're somehow able to figure out across the whole stack that these 0.001% of transactions that fail are due to IO errors. Or you also need to do log analysis. If you want to solve things like that you need regular reads of all your data, including verifications etc. Greetings, Andres Freund