On Wed, Jun 17, 2020 at 01:23:41PM -0700, David Christensen wrote:
> On 2020-06-17 12:26, Reco wrote:
> 
> > On Wed, Jun 17, 2020 at 12:10:51PM -0700, David Christensen wrote:
> > > 2.  AIUI dd(1) uses asynchronous (buffered) I/O unless told otherwise.
> > 
> > You seem to confuse asynchronous and cached I/O too.
> > 
> > > From Linux kernel POV, *asynchronous* I/O is a pair of
> > io_submit/io_getevents syscalls, and dd does not do these regardless of
> > the options that are provided.
> > What dd does is a *synchronous* I/O (read/write syscalls or
> > pread64/pwrite64 for older kernels).
> > 
> > Whenever the I/O is cached (an output file is opened without
> > O_DIRECT|O_DSYNC flags) or not is orthogonal to whenever it's sync or
> > async.
> 
> It would not surprise me if there are a dozen layers of chip registers, 
> caches, buffers, memory, etc., between a CPU register and whatever device 
> implements
> data storage in a recent to modern x86_64 computer. And, I can only wonder if 
> reading /dev/zero even comes from a CPU register -- it would not surprise me 
> if
> an MMU feature does.
> 
> 
> I was referring to the 'fdatasync', 'fsync', 'dsync', 'sync', and 'nocache' 
> options to dd(1).

You did. These all deal with the various caching aspects.
Several years ago I saw some patches to dd that implemented async I/O,
but AFAIK they were never accepted upstream. May have something with the
fact that the patches in question were written by Oracle employees.


> Given the terse manual page, and a unwillingness to crawl the dd(1)
> and/or kernel code, I can only guess at my understanding of what is
> happening.

There's strace for the lazy ones like you and me ;) One of my favorite
troubleshooting tools for many years.


> But, all of those options seem to imply some opposite of
> asynchronous or buffered I/O as I know it.

No, it's just the wording.

"sync" - you are blocked on each read/write syscall, and there's no way
around it.

"async" - you're not, if you're doing io_submit. You are if you're doing
io_getevents, but life's unfair.

"direct" - you're telling the kernel that you do not consider a write
completed unless it has landed at a persistent storage. You're asking a
kernel to read for you regardless of the state of the kernel's current
page cache.

"cached" - you don't care whenever your write lands to the persistent
storage. Likewise, you don't care if the read comes from the page cache
or the persistent storage.

Most of the I/O your everyday programs do is "cached sync". There are
exceptions to this rule, like apt (dsync writes), or firefox (dsync
writes for sqlite). But to have an async I/O, one usually have to link
against libaio, like mysql or postgres do.


> "The devil is in the details."

Totally agree.

Reco

Reply via email to