Am 21.02.2018 um 17:59 hat Eric Blake geschrieben: > On 02/21/2018 10:51 AM, Kevin Wolf wrote: > > Am 20.02.2018 um 23:24 hat Eric Blake geschrieben: > > > When reading a compressed image, we were allocating s->cluster_data > > > to 32*cluster_size + 512 (possibly over 64 megabytes, for an image > > > with 2M clusters). Let's check out the history: > > > > > > > Much later, in commit de82815d (v2.2), we noticed that a 64M > > > allocation is prone to failure, so we switched over to a graceful > > > memory allocation error message. But note that elsewhere in the > > > code, we do g_malloc(2 * cluster_size) without ever checking for > > > failure. > > > > > > > - } > > > - if (!s->cluster_cache) { > > > - s->cluster_cache = g_malloc(s->cluster_size); > > > + assert(!s->cluster_cache); > > > > Wouldn't it be better to assert (!!s->cluster_cache == > > !!s->cluster_data) unconditionally? > > > > Sure. > > > > + s->cluster_data = g_try_malloc(s->cluster_size); > > > > Why are you going from qemu_try_blockalign() to simple malloc here? This > > buffer is used with bdrv_read() (or bdrv_pread() after patch 1), so we > > should avoid unnecessary use of a bounce buffer. > > But does bdrv_pread() actually need to use a bounce buffer if we don't have > an aligned buffer to read into? Either the underlying protocol already > supports byte-aligned reads (direct into our buffer, regardless of > alignment, no bouncing required), or it already has do to a full sector read > into a bounce buffer anyways (and it doesn't matter whether we aligned our > buffer). blockalign() made sense when we had multiple clients for the > buffer, but ever since v1.1, when we have only a single client, and that > single client is most likely not going to read sector-aligned data in the > first place, aligning the buffer doesn't buy us anything.
Good point. To be honest, I don't even analyse each caller, but just consistently use qemu_try_blockalign() whenever a buffer is used for I/O. It's a simple rule of thumb that generally makes sense. So as you say, in this case it's unlikely, but possible that we can benefit from an aligned buffer. I guess my point is more about consistency than actual functionality then. But it's okay either way. > > > > > + s->cluster_cache = g_try_malloc(s->cluster_size); > > > > As you already said, either g_malloc() or check the result. I actually > > think that g_try_malloc() and checking the result is nicer, we still > > allocate up to 2 MB here. > > See my commit message comment - we have other spots in the code base that > blindly g_malloc(2 * s->cluster_size). Though is that a reason to do the same in new code or to phase out such allocations whenever you touch them? > And I intended (but sent the email without amending my commit) to use > g_malloc(). But as Berto has convinced me that an externally produced > image can convince us to read up to 4M (even though we don't need that > much to decompress), I suppose that the try_ variant plus checking is > reasonable (and care in NULL'ing out if one but not both allocations > succeed). Sounds good. Another thought I had is whether we should do per-request allocation for compressed clusters, too, instead of having per-BDS buffers. Kevin