On Thu, Nov 16, 2023 at 02:14:54PM +0100, Thomas Huth wrote:
> QOM names currently don't have any enforced naming rules. This
> can be problematic, e.g. when they are used on the command line
> for the "-device" option (where the comma is used to separate
> properties). To avoid that such problematic type names come in
> again, let's restrict the set of acceptable characters during the
> type registration.
> 
> Ideally, we'd apply here the same rules as for QAPI, i.e. all type
> names should begin with a letter, and contain only ASCII letters,
> digits, hyphen, and underscore. However, we already have so many
> pre-existing types like:
> 
>     486-x86_64-cpu
>     cfi.pflash01
>     power5+_v2.1-spapr-cpu-core
>     virt-2.6-machine
>     pc-i440fx-3.0-machine
> 
> ... so that we have to allow "." and "+" for now, too. While the
> dot is used in a lot of places, the "+" can fortunately be limited
> to two classes of legacy names ("power" and "Sun-UltraSparc" CPUs).
> 
> We also cannot enforce the rule that names must start with a letter
> yet, since there are lot of types that start with a digit. Still,
> at least limiting the first characters to the alphanumerical range
> should be way better than nothing.
> 
> Signed-off-by: Thomas Huth <th...@redhat.com>
> ---
>  qom/object.c | 41 +++++++++++++++++++++++++++++++++++++++++
>  1 file changed, 41 insertions(+)
> 
> diff --git a/qom/object.c b/qom/object.c
> index 95c0dc8285..571ef68950 100644
> --- a/qom/object.c
> +++ b/qom/object.c
> @@ -138,9 +138,50 @@ static TypeImpl *type_new(const TypeInfo *info)
>      return ti;
>  }
>  
> +static bool type_name_is_valid(const char *name)
> +{
> +    const int slen = strlen(name);
> +
> +    g_assert(slen > 1);
> +
> +    /*
> +     * Ideally, the name should start with a letter - however, we've got
> +     * too many names starting with a digit already, so allow digits here,
> +     * too (except '0' which is not used yet)
> +     */
> +    if (!g_ascii_isalnum(name[0]) || name[0] == '0') {
> +        return false;
> +    }
> +
> +    for (int i = 1; i < slen; i++) {
> +        if (name[i] != '-' && name[i] != '_' && name[i] != '.' &&
> +            !g_ascii_isalnum(name[i])) {
> +            if (name[i] == '+') {
> +                if (i == 6 && !strncmp(name, "power", 5)) {
> +                    /* It's a legacy name like "power5+" */
> +                    continue;
> +                }
> +                if (i >= 17 && !strncmp(name, "Sun-UltraSparc", 14)) {
> +                    /* It's a legacy name like "Sun-UltraSparc-IV+" */
> +                    continue;
> +                }
> +            }
> +            return false;
> +        }
> +    }

Replace this big loop with strspn, which has an asm optimized impl
in glibc

      ALPHA_LC "abcdefghijklmnopqrstuvwxyz"
      ALPHA_UC "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
      OTHER "0123456789-_."

      return (strspn(name, ALPHA_UC ALPHA_LC OTHER) == slen) ||
          (g_str_has_prefix(name, "power") && slen > 6 && name[6] == '+') ||
          (g_str_has_prefix(name, "Sun-UltraSparc") && slen > 17 && name[17] == 
'+');


With regards,
Daniel
-- 
|: https://berrange.com      -o-    https://www.flickr.com/photos/dberrange :|
|: https://libvirt.org         -o-            https://fstop138.berrange.com :|
|: https://entangle-photo.org    -o-    https://www.instagram.com/dberrange :|


Reply via email to