Re: Multi Inserts in CREATE TABLE AS - revived patch

Luc Vlaming Thu, 26 Nov 2020 04:04:55 -0800

On 26-11-2020 12:36, Bharath Rupireddy wrote:

Few things:
IIUC Andres mentioned similar kinds of APIs earlier in [1].
[1] -https://www.postgresql.org/message-id/20200924024128.kyk3r5g7dnu3fxxx%40alap3.anarazel.de<https://www.postgresql.org/message-id/20200924024128.kyk3r5g7dnu3fxxx%40alap3.anarazel.de>
I would like to add some more info to one of the API:

typedef struct MultiInsertStateData
{
MemoryContext micontext; /* A temporary memory context formulti insert. */
     BulkInsertStateData *bistate;   /* Bulk insert state. */
     TupleTableSlot      **mislots; /* Array of buffered slots. */
     uint32              nslots; /* Total number of buffered slots. */
int64 nbytes; /* Flush buffers if the total tuple size>= nbytes. */ int32 nused; /* Number of current buffered slots for amulti insert batch. */ int64 nsize; /* Total tuple size for a multi insertbatch. */
} MultiInsertStateData;
/* Creates a temporary memory context, allocates theMultiInsertStateData, BulkInsertStateData and initializes other members. */ void (*begin_multi_insert) (Relation rel,MultiInsertStateData **mistate, uint32 nslots, uint64 nbytes);
/* Buffers the input slot into mistate slots, computes the size of thetuple, and adds it total buffer tuple size, if this size crossesmistate->nbytes, flush the buffered tuples into table. For heapam,existing heap_multi_insert can be used. Once the buffer is flushed, thenthe micontext can be reset and buffered slots can be cleared. *If nbytesi.e. total tuple size of the batch is not given, tuple size is notcalculated, tuples are buffered until all the nslots are filled and thenflushed.* */ void (*do_multi_insert) (Relation rel, MultiInsertStateData*mistate, struct TupleTableSlot *slot, CommandId cid, int options);
/* Flush the buffered tuples if any. For heapam, existingheap_multi_insert can be used. Deletes temporary memory context anddeallocates mistate. */ void (*end_multi_insert) (Relation rel, MultiInsertStateData*mistate, CommandId cid, int options);
With Regards,
Bharath Rupireddy.
EnterpriseDB: http://www.enterprisedb.com <http://www.enterprisedb.com>


Looks all good to me, except for the nbytes part.

Could you explain to me what use case that supports? IMHO the tableamcan best decide itself that its time to flush, based on itsimplementation that e.g. considers how many pages to flush at a time andsuch, etc? This means also that most of the fields ofMultiInsertStateData can be private as each tableam would return aderivative of that struct (like with the destreceivers).

One thing I'm wondering is in which memory context the slots end upbeing allocated. I'd assume we would want to keep the slots aroundbetween flushes. If they are in the temporary context this might proveproblematic however?


Regards,
Luc

Re: Multi Inserts in CREATE TABLE AS - revived patch

Reply via email to