On Sat, Nov 16, 2024 at 6:02 AM Michael <confabul...@kintzios.com> wrote:
>
> I assume (simplistically) with DM-SMRs the
> discard-garbage collection is managed wholly by the onboard drive controller,
> while with HM-SMRs the OS will signal the drive to start trimming when the
> workload is low in order to distribute the timing overheads to the system's
> idle time.

I'll admit I haven't looked into the details as I have no need for SMR
and there aren't any good FOSS solutions for using it that I'm aware
of (just a few that might be slightly less terrible).  However, this
doesn't seem correct for two reasons:

First, I'm not sure why HM-SMR would even need a discard function.
The discard command is used to tell a drive that a block is safe to
overwrite without preservation.  A host-managed SMR drive doesn't need
to know what data is disposable and what data is not.  It simply needs
to write data when the host instructs it to do so, destroying other
data in the process, and it is the host's job to not destroy anything
it cares about.  If a write requires a prior read, then the host needs
to first do the read, then adjust the written data appropriately so
that nothing is lost.

Second, there is no reason that any drive of any kind (SMR or SSD)
NEEDS to do discard/trim operations when the drive is idle, because
discard/trim is entirely a metadata operation that doesn't require IO
with the drive data itself.  Now, some drives might CHOOSE to
implement it that way, but they don't have to.  On an SSD, a discard
command does not mean that the drive needs to erase or move any data
at all.  It just means that if there is a subsequent erase that would
impact that block, it isn't necessary to first read the data and
re-write it afterwards.  A discard could be implemented entirely in
non-volatile metadata storage, such as with a bitmap.  For a DM-SMR
using flash for this purpose would make a lot of sense - you wouldn't
need much of it.

This is probably why you have endless arguing online about whether
discard/trim is helpful for SSDs. It completely depends on how the
drive implements the command.  The drives I've owned can discard
blocks without any impact on IO, but I've heard some have a terrible
impact on IO.  It is just like how you can complete the same sort
operation in seconds or hours depending on how dumb your sorting
algorithm is.

In any case, to really take advantage of SMR the OS needs to
understand exactly how to structure its writes so as to not take a
penalty, and that requires information about the implementation of the
storage that isn't visible in a DM-SMR.  Sure, some designs will do
better on SMR even without this information, but I don't think they'll
ever be all that efficient.  It is no different from putting f2fs on a
flash drive with a brain-dead discard implementation - even if the OS
does all its discards in nice consolidated contiguous operations it
doesn't mean that the drive will handle that in milliseconds instead
of just blocking all IO for an hour - sure, the drive COULD do the
operation quickly, but that doesn't mean that the firmware designers
didn't just ignore the simplest use case in favor of just optimizing
around the assumption that NTFS is the only filesystem in the world.

-- 
Rich

Reply via email to