Re: [PATCH v6 2/4] migration: enable multifd and postcopy together

Fabiano Rosas Tue, 18 Feb 2025 06:23:33 -0800

Prasad Pandit <ppan...@redhat.com> writes:

> Hello Fabiano,
>
> On Tue, 18 Feb 2025 at 03:20, Fabiano Rosas <faro...@suse.de> wrote:
>> > -static bool migration_should_start_incoming(bool main_channel)
>> > +static bool migration_has_main_and_multifd_channels(void)
>> >  {
> ...
>> > +    /* main channel and all multifd channels are established */
>> >      return true;
>> >  }
>>
>> How will this avoid peeking the preempt channel? You're assuming preempt
>> is mutually exclusive with multifd it seems. Otherwise you could get the
>> preempt channel creation racing with multifd channels creation.
>
> * IIUC postcopy preempt channel is created towards the end of the
> migration when the Postcopy phase starts. At that time, the 'main' and
> 'multifd' channels are already established and working. Having the
> 'main' and when multifd is enabled 'multifd' channels in place is a
> requirement for starting new migration. So when both the 'main' and
> 'multifd' channels are established, the new incoming connection is
> seen as the 'postcopy' one; And it falls to the 'else' block of the
> 'if' conditional ->  if (!migration_has_main_and_multifd_channels()) {
>


Do you concede that this code has a hidden assumption? Either that
migrate_multifd() != migrate_postcopy_preempt() or that multifd channels
must be set up before postcopy preempt channel? Because that is enough
for us to have to do something about it. Either restructuring or a
comment explaining.

> * When does postcopy preempt channel creation race with the multifd
> channel creation?
>

Well, taking that this is the destination side, I don't think we can
predict when the other end of the migration will open a channel, can we?
For instance, postcopy_do_resume() has this comment:

    /*
     * If preempt is enabled, re-establish the preempt channel.  Note that
     * we do it after resume prepare to make sure the main channel will be
     * created before the preempt channel.  E.g. with weak network, the
     * dest QEMU may get messed up with the preempt and main channels on
     * the order of connection setup.  This guarantees the correct order.
     */

It looks like if the main channel can race, so do the multifd channels,
no?

In any case, I'm fine with just documenting any assumption for now.

>> > @@ -989,13 +983,12 @@ void migration_ioc_process_incoming(QIOChannel *ioc, 
>> > Error **errp)
>> > +    if (!migration_has_main_and_multifd_channels()) {
>> I'm not entirely sure we need these checks here. They will happen anyway
>> later. Could this be replaced by migration_needs_multiple_sockets()
>> instead?
>
> * migration_needs_multiple_sockets() => return migrate_multifd() ||
> migrate_postcopy_preempt();
>
> * That logical OR should have been AND, no? It returns true even when
> one of them is true. That's not multiple types (multifd/postcopy) of
> sockets. I don't think it helps much.
>

Nope, this is just saying whether a single channel is expected, or more
than one. That's why I think it would be a good gate for this peeking
code. Since postcopy preempt could be a peekable channel, it's
misleading to put it all behind QIO_CHANNEL_FEATURE_READ_MSG_PEEK
only. This is a time-bomb for the next person to refactor this code.

> * Let's try this:
>     - First differentiation: peekable Vs non-peekable channels

As I said above, this is not entirely what we're doing.

>     - Peekable channels
>          - Main
>          - Multifd
>     - Non-peekable channels
>          - Postcopy preempt
>          - TLS
>          - File/mapped_ram
>
>     if (qio_channel_has_feature(ioc, QIO_CHANNEL_FEATURE_READ_MSG_PEEK))
>     {
>             if (channel_magic == cpu_to_be32(QEMU_VM_FILE_MAGIC)) {
>                 channel = CH_MAIN;
>             } else if (channel_magic == cpu_to_be32(MULTIFD_MAGIC)) {
>                 channel = CH_MULTIFD;
>     } else {
>             if (!strcmp(ioc->name, "migration-tls-incoming")
>                 || !strcmp(ioc->name, "migration-file-incoming"))) {
>                channel = CH_MULTIFD;
>             } else {
>                channel = CH_POSTCOPY;
>             }
>     }
>
> * With above, the 'main' channel shall have to send 'magic value' even
> for reconnection during the postcopy recovery phase. If all channels
> were consistent and sent a magic value, this code would be much
> simpler and we may not have to care/worry about the _order_ in which
> these connections are made.
>

Right, but that's not what we have today. Changing this requires
figuring out how to keep the stream compatible when channels now start
sending extra stuff at the start. It's not trivial. There's also
mapped-ram which is asynchronous and there might be something special to
be done about the TLS handshake, I'm not sure.

>    if (channel == 'main')
>        process_main_channel()
>    else if (channel == 'multifd')
>        process_multifd_channel()
>    else if (channel == 'tls')
>        process_tls_channel()
>     else if (channel == 'file')
>        process_file_channel()
>     else if (channel == 'postcopy')
>        process_postcopy_channel()
>
>> And I'd put this whole channel discovery business in channel.c since
>> it's encoding several assumptions about channels. Some helpers used here
>> might need to be exported, but that's ok.
>>
>> Also, please make a separate patch, we need to be really confident that
>> changing the discovery code around won't introduce any regression, and
>> if it does, we'll want it separate from the postcopy+multifd
>> enablement. It's ok if you have the patch assume that multifd+postcopy
>> will happen later in the series.
>
> * TBH, I think we have complicated this whole thing with multiple
> channel types, their inconsistent behaviour and dependence on the
> _order_ in which connections are made. Do we really need channel
> types? Could we consider rationalising them?
>

Well, aside from preempt, they're *not* dependent on the order. That's
the point of having to do all of this dance. In fact we might be better
off if we could serialize the connections somehow.

I havent't followed this series closely, could you point me to the
discussion that led to the channels concept being introduced?

>> > +        if (qio_channel_has_feature(ioc, 
>> > QIO_CHANNEL_FEATURE_READ_MSG_PEEK)) {
>> >          /*
>> >           * With multiple channels, it is possible that we receive channels
>> >           * out of order on destination side, causing incorrect mapping of
>> > @@ -1006,42 +999,58 @@ void migration_ioc_process_incoming(QIOChannel 
>> > *ioc, Error **errp)
>> >           * tls handshake while initializing main channel so with tls this
>> >           * issue is not possible.
>> >           */
>>
>> This comment block needs to be indented properly.
>>
>> > -        ret = migration_channel_read_peek(ioc, (void *)&channel_magic,
>> > -                                          sizeof(channel_magic), errp);
>> > +            ret = migration_channel_read_peek(ioc, (void *)&channel_magic,
>> > +                                              sizeof(channel_magic), 
>> > errp);
>> > +            if (ret != 0) {
>> > +                return;
>> > +            }
>> >
>> > -        if (ret != 0) {
>> > -            return;
>> > +            if (channel_magic == cpu_to_be32(QEMU_VM_FILE_MAGIC)) {
>> > +                channel = CH_MAIN;
>> > +            } else if (channel_magic == cpu_to_be32(MULTIFD_MAGIC)) {
>> > +                channel = CH_MULTIFD;
>> > +            } else if (!mis->from_src_file
>> > +                        && mis->state == 
>> > MIGRATION_STATUS_POSTCOPY_PAUSED) {
>>
>> The usual style is to keep the && on the previous line.
>>
>> > +                /* reconnect default channel for postcopy recovery */
>>
>> s/default/main/
>
> * Okay, will fix these.
>
>
>> > +                channel = CH_MAIN;
>> > +            } else {
>> > +                error_report("%s: unknown channel magic: %u",
>> > +                                __func__, channel_magic);
>> > +                return;
>>
>> This needs to set errp instead of reporting.
>
> * Okay.
>
>> > +            }
>> > +        } else if (mis->from_src_file
>> > +                && (!strcmp(ioc->name, "migration-tls-incoming")
>> > +                || !strcmp(ioc->name, "migration-file-incoming"))) {
>> > +            channel = CH_MULTIFD;
>>
>> This is quite misleading. These channels are used without multifd as
>> well. For instance, file-backed fd migration goes past this because
>> !mis->from_src_file but it still uses the file channel.
>>
>> I agree with Peter that checking for channel names is not ideal. I don't
>> see an alternative at the moment (hiding the assumption is of course not
>> a fix). Maybe check migrate_multifd() here and explain in a comment that
>> at the moment, the non-peekable channels happen to be used with multifd
>> only.
>
> * The first paragraph says these channels are used without
> migrate_multifd(); And the second paragraph says they are used with
> migrate_multifd() only....??

Yes. They *can* be used without multifd. The comment would explain that
at that point in the code, these are the only types possible. So as to
not mislead future readers that whenever tls/file, then multifd must be
used.

> ===
> } else {
>         /* Multiple connections */
>         assert(migration_needs_multiple_sockets());
>         if (migrate_multifd()) {
>             multifd_recv_new_channel(ioc, &local_err);
>         } else {
>             assert(migrate_postcopy_preempt());
>             f = qemu_file_new_input(ioc);
>             postcopy_preempt_new_channel(mis, f);
>         }

See? Multifd mutually exclusive with postcopy preempt. You carried that
assumption (well done), but made it more subtle (not good), since
if/else is by definition showing the relationship between the two while
migration_has_main_and_multifd_channels() makes it hidden under the
multifd check allowing the last return true to happen.

If we're enabling multifd along with postcopy, we need to be aware that
the relationship with preempt might not hold true anymore.

> ===
> * IIUC in the current code, when migrate_multifd() is true, these
> channels get processed via - multifd_recv_new_channel(). And when
> migrate_multifd() is false, it falls to postcopy_preempt_new_channel()
> part.

This relies on the fact that the management layer will certainly have
set multifd on both sides of the migration and that multifd will not be
used with postcopy.

>Not sure if/how that works really. It is possible that currently
> these (tls/file) channels are used only with migrate_multifd() enabled
> and so are processed with multifd_recv_new_channel() function. The
> current patch handles them the same way.
>

That's the entire point I'm making when I ask to not omit the else
clauses. The tls and file channels can be the main channel as well, but
that is never explicit in the code. That uint8 channel = CH_MAIN up
there is doing some heavy-lifting.

> * About checking channel names, in the non-peekable category above,
> how do we differentiate between 'TLS', 'File' and 'Postcopy' channels?
> Without magic values, we don't have much choice really.

Indeed.

> And seeing
> those names in the code also tells the reader that 'TLS' and 'File'
> channels are processed as CH_MULTIFD via - multifd_recv_new_channel().

We shouldn't rely on the underlying iochannel type. That's why they
expose the QIO_CHANNEL_FEATUREs. It's fine if we double check at that
point that the channels are either file/tls, but that cannot be the
algorithm. The current code can work without relying on it after all and
I don't think your postcopy enablement affects that particular part.

>
>> No else clause here and in the rest of the patch makes this is as opaque
>> as the previous version, IMO. We need to define what's supposed to
>> happen whenever the conditionals don't match. Is it an error,
>> g_assert_not_reached(), a fallthrough? Better to set CH_MAIN explicitly
>> wherenever that's the case.
>
> * I'd say let's treat unmatched conditions as an error and return.
>
>> > -        default_channel = (channel_magic == 
>> > cpu_to_be32(QEMU_VM_FILE_MAGIC));
>> > -    } else {
>> > -        default_channel = !mis->from_src_file;
>> > +    } else if (mis->from_src_file) {
>> > +        channel = CH_POSTCOPY;
>> >      }
>>
>> Same here.
>
> * Here final else means the main channel (mis->from_src_file) is not
> established, so it defaults to CH_MAIN.
>

Yes, but let's make it explicit. We have this thing in our minds now. In
6 months time no one remembers anything anymore.

>>
>> You could check CH_POSTCOPY first in the block above this one and return
>> early.
>>
>
> * Okay.
>
> * My request is the same - let's think about having consistent channel
> behaviour. The restructuring and overhauling of the channel processing
> part could be done separately, outside the current scope of enabling
> multifd+postcopy together with this series. Let's not try to fix
> everything in one go, in one series.
>

Ok, no one is asking for this to be done in one go. You're free to split
any side efforts into a new series and proceed with that. Having to do
prerequisite work is pretty common.

Do you think this series can work without touching the channel discovery
code? As I said earlier, I'm missing a bit of context, but to me it
seems it cannot.

If instead of this refactoring you want to start working on a model for
consistent channel advertisement, then that's fine. But we'll have to
put this series on hold (which is also fine). It also looks like it
could be considerably more work, although I haven't looked at it in
detail. Granted, it's work that makes sense, instead of the heuristics
we have today.

>
> Thank you.
> ---
>   - Prasad

Re: [PATCH v6 2/4] migration: enable multifd and postcopy together

Reply via email to