On 14 January 2018 at 17:27, Amit Khandekar <amitdkhan...@gmail.com> wrote:
> On 13 January 2018 at 02:56, Robert Haas <robertmh...@gmail.com> wrote:
> > I guess I'm inclined to keep mt_per_sub_plan_maps for the case where
> > there are no partitions, but not use it when partitions are present.
> > What do you think about that?
>
> Even where partitions are present, in the usual case where there are no 
> transition tables we won't require per-leaf map at all [1]. So I think we 
> should keep mt_per_sub_plan_maps only for the case where per-leaf map is not 
> allocated. And we will not allocate mt_per_sub_plan_maps when 
> mt_per_leaf_maps is needed. In other words, exactly one of the two maps will 
> be allocated.
>
> This is turning out to be close to what's already there in the last patch 
> versions: use a single map array, and an offsets array. The difference is : 
> in the patch I am using the *same* variable for the two maps. Where as, now 
> we are talking about two different array variables for maps, but only 
> allocating one of them.
>
> Are you ok with this ? I think the thing you were against was to have a 
> common *variable* for two purposes. But above, I am saying we have two 
> variables but assign a map array to only *one* of them and leave the other 
> unused.
>
> ---------
>
> Regarding the on-demand map allocation ....
> Where mt_per_sub_plan_maps is allocated, we won't have the on-demand 
> allocation: all the maps will be allocated initially. The reason is becaues 
> the map_is_required array is only per-leaf. Or else, again, we need to keep 
> another map_is_required array for per-subplan. May be we can support the 
> on-demand stuff for subplan maps also, but only as a separate change after we 
> are done with update-partition-key.
>
>
> ---------
>
> Regarding mt_per_leaf_tupconv_required, I am thinking we can make it a bool, 
> and name it : mt_per_leaf_map_not_required. When it is true for a given 
> index, it means, we have already called convert_tuples_by_name() and it 
> returned NULL; i.e. it means we are sure that map is not required. A false 
> value means we need to call convert_tuples_by_name() if it is NULL, and then 
> set mt_per_leaf_map_not_required to (map == NULL).
>
> Instead of a bool array, , we can instead make it a Bitmapset. But I think 
> access would become slower as compared to array, particularly because it is 
> going to be a heavily used function.

I went ahead and did the above changes. I haven't yet merged these
changes in the main patch. Instead, I have attached it as an
incremental patch to be applied on the main v36 patch. The incremental
patch is not yet quite polished, and quite a bit of cosmetic changes
might be required, plus testing. But am posting it in case I have some
early feedback. Details :

The per-subplan map array variable is kept in ModifyTableState :
-       TupleConversionMap **mt_childparent_tupconv_maps;
-       /* Per plan/partition map for tuple conversion from child to root */
-       bool            mt_is_tupconv_perpart;  /* Is the above map
per-partition ? */
+       TupleConversionMap **mt_per_subplan_tupconv_maps;
+       /* Per plan map for tuple conversion from child to root */
 } ModifyTableState;

The per-leaf array variable and the not_required array is kept in
PartitionTupleRouting :
-       TupleConversionMap **partition_tupconv_maps;
+       TupleConversionMap **parent_child_tupconv_maps;
+       TupleConversionMap **child_parent_tupconv_maps;
+       bool       *child_parent_tupconv_map_not_reqd;
As you can see above, all the arrays are per-partition. So removed the
per-leaf tag in these arrays. Instead, renamed the existing
partition_tupconv_maps to parent_child_tupconv_maps, and the new
per-leaf array to child_parent_tupconv_maps

Have two separate functions ExecSetupChildParentMapForLeaf() and
ExecSetupChildParentMapForSubplan() since most of their code is
different. And now because of this, we can re-use
ExecSetupChildParentMapForLeaf() in both copy.c and nodeModifyTable.c.

Even inserts/copy will benefit from the on-demand map allocation. This
is because now there is a function TupConvMapForLeaf() that is called
in both copy.c and ExecInsert(). This is the function that does
on-demand allocation.

Attached the incremental patch conversion_map_changes.patch that has
the above changes. It is to be applied over the latest main patch
(update-partition-key_v36.patch).

Attachment: conversion_map_changes.patch
Description: Binary data

Reply via email to