Re: [PATCH] libcpp, c++: Optimize initializers using #embed in C++

Jason Merrill Tue, 23 Jul 2024 18:38:34 -0700

On 7/17/24 3:47 AM, Jakub Jelinek wrote:

Hi!


This patch on top of the
https://gcc.gnu.org/pipermail/gcc-patches/2024-June/655012.html
https://gcc.gnu.org/pipermail/gcc-patches/2024-June/655013.html
https://gcc.gnu.org/pipermail/gcc-patches/2024-July/657049.html
patches which just introduce non-optimized support for the C23 feature
and two extensions to it actually optimizes it and on top of the
https://gcc.gnu.org/pipermail/gcc-patches/2024-July/657053.html
patch which adds optimizations to C & middle-end adds similar
optimizations to the C++ FE.
The first hunk enables use of CPP_EMBED token even for C++, not just
C; the preprocessor guarantees there is always a CPP_NUMBER CPP_COMMA
before CPP_EMBED and CPP_COMMA CPP_NUMBER after it which simplifies
parsing (unless #embed is more than 2GB, in that case it could be
CPP_NUMBER CPP_COMMA CPP_EMBED CPP_COMMA CPP_EMBED CPP_COMMA CPP_EMBED
CPP_COMMA CPP_NUMBER etc. with each CPP_EMBED covering at most INT_MAX
bytes).
Similarly to the C patch, this patch parses it into RAW_DATA_CST tree
in the braced initializers (and from there peels into INTEGER_CSTs unless
it is an initializer of an std::byte array or integral array with CHAR_BIT
element precision), parses CPP_EMBED in cp_parser_expression into just
the last INTEGER_CST in it because I think users don't need millions of
-Wunused-value warnings because they did useless
   int a = (
   #embed "megabyte.dat"
   );
and so most of the inner INTEGER_CSTs would be there just for the warning,
and in the rest of contexts like template argument list, function argument
list, attribute argument list, ...) parse it into a sequence of INTEGER_CSTs
(I wrote a range/iterator classes to simplify that).

My dumb
cat embed-11.c
constexpr unsigned char a[] = {
#embed "cc1plus"
};
const unsigned char *b = a;
testcase where cc1plus is 492329008 bytes long when configured
--enable-checking=yes,rtl,extra against recent binutils with .base64 gas
support results in:
time ./xg++ -B ./ -S -O2 embed-11.c

real    0m4.350s
user    0m2.427s
sys     0m0.830s
time ./xg++ -B ./ -c -O2 embed-11.c

real    0m6.932s
user    0m6.034s
sys     0m0.888s
(compared to running out of memory or very long compilation).
On a shorter inclusion,
cat embed-12.c
constexpr unsigned char a[] = {
#embed "xg++"
};
const unsigned char *b = a;
where xg++ is 15225904 bytes long, this takes using GCC with the #embed
patchset except for this patch:
time ~/src/gcc/obj36/gcc/xg++ -B ~/src/gcc/obj36/gcc/ -S -O2 embed-12.c

real    0m33.190s
user    0m32.327s
sys     0m0.790s
and with this patch:
time ./xg++ -B ./ -S -O2 embed-12.c

real    0m0.118s
user    0m0.090s
sys     0m0.028s

The patch doesn't change anything on what the first patch in the series
introduces even for C++, namely that #embed is expanded (actually or as if)
into a sequence of literals like
127,69,76,70,2,1,1,3,0,0,0,0,0,0,0,0,2,0,62,0,1,0,0,0,80,211,64,0,0,0,0,0,64,0,0,0,0,0,0,0,8,253
and so each element has int type.
That is how I believe it is in C23, and the different versions of the
C++ P1967 paper specified there some casts, P1967R12 in particular
"Otherwise, the integral constant expression is the value of std::fgetc’s 
return is cast
to unsigned char."
but please see
https://github.com/llvm/llvm-project/pull/97274#issuecomment-2230929277
comment and whether we really want the preprocessor to preprocess it for
C++ as (or as-if)
static_cast<unsigned char>(127),static_cast<unsigned char>(69),static_cast<unsigned 
char>(76),static_cast<unsigned char>(70),static_cast<unsigned char>(2),...
i.e. 9 tokens per byte rather than 2, or
(unsigned char)127,(unsigned char)69,...
or
((unsigned char)127),((unsigned char)69),...
etc.

The discussion at that link suggests that the author is planning topropose removing the cast.

@@ -6895,16 +6918,68 @@ reshape_init_array_1 (tree elt_type, tre
      {
        tree elt_init;
        constructor_elt *old_cur = d->cur;
+      const char *old_ptr = NULL;
+
+      if (TREE_CODE (d->cur->value) == RAW_DATA_CST)
+       old_ptr = RAW_DATA_POINTER (d->cur->value);

Let's call this variable old_raw_data_ptr for clarity, here and inreshape_init_class.

if (d->cur->index)

        CONSTRUCTOR_IS_DESIGNATED_INIT (new_init) = true;
        check_array_designated_initializer (d->cur, index);
-      elt_init = reshape_init_r (elt_type, d,
-                                /*first_initializer_p=*/NULL_TREE,
-                                complain);
+      if (TREE_CODE (d->cur->value) == RAW_DATA_CST
+         && (TREE_CODE (elt_type) == INTEGER_TYPE
+             || (TREE_CODE (elt_type) == ENUMERAL_TYPE
+                 && TYPE_CONTEXT (TYPE_MAIN_VARIANT (elt_type)) == std_node
+                 && strcmp (TYPE_NAME_STRING (TYPE_MAIN_VARIANT (elt_type)),
+                            "byte") == 0))

Maybe is_byte_access_type? Or finally factor out a function to testspecifically for std::byte, it's odd that we don't have one yet.

@@ -7158,6 +7244,7 @@ reshape_init_class (tree type, reshape_i
             is initialized by the designated-initializer-list { D }, where D
             is the designated- initializer-clause naming a member of the
             anonymous union member."  */
+         gcc_checking_assert (TREE_CODE (d->cur->value) != RAW_DATA_CST);

Is there a test of trying to use #embed as a designated initializer? Idon't see one.

@@ -7358,9 +7461,16 @@ reshape_init_r (tree type, reshape_iter
         valid aggregate initialization.  */
        && !first_initializer_p
        && (same_type_ignoring_top_level_qualifiers_p (type, TREE_TYPE (init))
-         || can_convert_arg (type, TREE_TYPE (init), init, LOOKUP_NORMAL,
-                             complain)))
+         || can_convert_arg (type, TREE_TYPE (init),
+                             TREE_CODE (init) == RAW_DATA_CST
+                             ? build_int_cst (integer_type_node,
+                                              *(const unsigned char *)
+                                              RAW_DATA_POINTER (init))
+                             : init,
+                             LOOKUP_NORMAL, complain)))
      {
+      if (tree raw_init = cp_maybe_split_raw_data (d))
+       return raw_init;
        d->cur++;
        return init;

This split-or-++ pattern seems to repeat a lot in reshape_init_r, couldwe factor it out to avoid problems with people forgetting one or theother? Maybe consume_init (d) or d->consume_init ()?


Jason

Re: [PATCH] libcpp, c++: Optimize initializers using #embed in C++

Reply via email to