Re: [DISCUSS] PIP-186: Introduce two phase deletion protocol based on system topic

Asaf Mesika Thu, 02 Feb 2023 06:34:46 -0800

Heesung points out really good points, which actually made me add one more:


Can you please describe all the triggers for ledger deletion? Some are user
facing, some not.


On Mon, Jan 30, 2023 at 8:19 PM Heesung Sohn
<[email protected]> wrote:

> Hi,
>
>
> I assume the deletion APIs are async(when a user requests deletion, pulsar
> first returns success to the user if the request is persisted. Then, Pulsar
> asynchronously runs the deletion flow)
>
> - Have we considered a metadata store to persist and dedup deletion
> requests instead of the system topic? Why is the system topic the better
> choice than a metadata store for this problem?
>
> - How does Pulsar deduplicate deletion requests(error out to users) while
> the deletion request is running?
>
> - How do users track async deletion flow status? (do we expose any
> describeDeletion API to show the deletion status?)
>
>
> Thanks,
> Heesung
>
>
>
>
> On Mon, Jan 30, 2023 at 6:10 AM Yan Zhao <[email protected]> wrote:
>
> > > Couples of notes:
> > >
> > > 1.
> > >
> > > > In the LedgerDeletionService  start, it will create  a producer to
> send
> > > > pending delete ledger.
> > > > When delete a ledger,  a PendingDeleteLedgerInfo msg with 1 min delay
> > (the
> > > > delay is for consumer side, if send it immediately, maybe the
> metadata
> > > > din't change when consumer receive it). After the send operation
> > succeeds,
> > > >  then to operate metadata. If send msg failed, we think this deletion
> > > > operation failed, and didn't operate metadata.
> > >
> > >
> > > This section is completely unclear to me. Can you please provide step
> by
> > > step exactly what will happen in the workflow?
> > > Who does what and where (node)?
> >
> > In PulsarService, it defines ledgerDeletionService. The
> > ledgerDeletionService will create a producer and a consumer, which looks
> > like topicPoliciesService. The PulsarService passes it to ManagedLedger.
> > When pulsar wants to delete a ledger, ManagedLedger uses
> > ledgerDeletionService to send a message, the message content contains the
> > waiting delete ledger info. After sending success, delete the ledger id
> > from the metadata store.
> > The consumer receives the message, it will use PulsarClient to send a
> > delete command to the corresponding broker, the broker receives delete
> > command, and do the actual delete operation.
> >
> > In https://github.com/apache/pulsar/issues/16569, these are some
> pictures
> > for the workflow.
> >
> >
> > >
> > > 2.
> > >
> > > > /**
> > > >      * The ledger component . managed-ledger, managed-cursor and
> > > > schema-storage.
> > > >      */
> > > >     private LedgerComponent ledgerComponent;
> > >
> > >
> > > Why is this needed?
> > > What do you mean by a component of a ledger? Is the ledger divided into
> > > components?
> >
> > It's the ledger source, (MANAGED_LEDGER,MANAGED_CURSOR,SCHEMA_STORAGE)
> > When the broker wants to delete a ledger, we will check if the bookkeeper
> > metadata matches or not. In the pulsar, it will mark the ledger source
> when
> > creating a new ledger. See LedgerMetadataUtils.
> >
> > >
> > > 3.
> > >
> > > >   /**
> > > >      * The ledger type. ledger or offload-ledger.
> > > >      */
> > > >     private LedgerType ledgerType;
> > >
> > >
> > > I don't understand why you need this type.
> > It marks the ledger as a normal ledger or an offload ledger, broker need
> > it to determine whether to delete bookkeeper data or offload data.
> >
> > >
> > > 4.
> > >
> > > > private MLDataFormats.ManagedLedgerInfo.LedgerInfo context;
> > >
> > >
> > > Context is a very generic word, but your type is so specific. Can you
> > > please explain why you need this for?
> > >
> > > Are you sure you want to tie one data structure into the other - just
> > > validating.
> > It's for the offloaded ledger, when we want to delete the offload ledger,
> > we need offloadedContextUuid, here we can simplify it to
> offloadContextUuid.
> >
> > >
> > > 5.
> > >
> > > >   /**
> > > >      * Extent properties.
> > > >      */
> > > >     private Map<String, String> properties = new HashMap<>();
> > >
> > >
> > > Why is this needed?
> > It's for extended.
> >
> > >
> > >
> > > 6.
> > >
> > > > When receiving a pending delete ledger msg, we will check if the
> topic
> > > > still exists. If the topic exists, send a delete command
> > > > (PendingDelteLedger) to the broker which owns the topic. In the
> > broker, it
> > > > will check if the ledger is still in the metadata store, if the
> ledger
> > in
> > > > the metadata store means the ledger is still in use, give up to
> delete
> > this
> > > > ledge
> > >
> > >
> > > I don't understand this workflow. You say you check if it's in the
> > metadata
> > > store, and if it is , then it is used - what will make it unused?
> > > Can you explain the starting point? How does deletion work in general?
> > > When? What happens? ... I understand there are time based triggers, and
> > > sometimes used based triggers. They are somehow marked in metadata.
> > In https://github.com/apache/pulsar/issues/16569. The first step section
> > and second step section process flow picture are detailed.
> >
> > >
> > > 7.
> > >
> > > If we delete successfully, the consumer will ack this msg. If delete
> > fails,
> > > > reconsume this msg 10 min later.
> > >
> > >
> > > Where did you define 10min?
> > > Why 10 min?
> > If delete fails, that means the storage system occur some problems. I
> > guess the storage system will recovery in 10 mins.
> >
> > In https://github.com/apache/pulsar/issues/16569, we define
> > reconsumeLaterOfTopicTwoPhaseDeletionInSeconds in the
> ServiceConfiguration,
> > it's configurable.
> > private int reconsumeLaterOfTopicTwoPhaseDeletionInSeconds = 600;
> >
> > >
> > > You mentioned you are working with a client which has retries
> configured.
> > > Retry is client side based, ack one message while producing another,
> > > transaction free. Are you prepared to handle a case where you acked but
> > > failed to produce the message, hence you lost it completely?
> > >
> > The pulsarClient only sends a new message that succeeds, then ack the
> > origin message, so didn't care in this case.
> >
> > > 8.
> > >
> > > > If we want to delete a topic, we should send the delete ledger msg to
> > > > system topic and remove ledger id from metadata one by one, after all
> > the
> > > > ledger has been deleted, then delete the topic. If any ledger operate
> > > > failed, we think this delete topic operation failed and return the
> left
> > > > ledger id in the response.
> > >
> > > I couldn't understand. Can you please clarify this section. How exactly
> > > topic deletion is modified to use the features described in this pip?
> > >
> > We need to ensure that all ledgers are deleted before the topic is
> > deleted, otherwise, there will be orphan ledgers.
> >
> > > 9.>
> > > Regarding configuration. I suggest we prefix all config keys with the
> > > feature name so we can easily separate them.
> > That's fine.
> >
> > >
> > > 10.
> > > Backward compatibility - I would make sure to document in the docs
> > exactly
> > > what to do in case we need to rollback (cook book).
> > Well.
> >
> > >
> > > 11.
> > > General comment - You're basically implementing a bespoke workflow
> using
> > a
> > > topic to save the state of where you are in the topic.
> > > Is this the only place in Pulsar (delete ledger) that an action is
> > composed
> > > of several steps ?
> > > If the answer is no to this, wouldn't it be better to have a small
> > utility
> > > which is in charge of moving through the workflow steps? It can even
> be a
> > > simple state enum, where you move your state from a to b to c to d and
> it
> > > is persisted.
> > We need to persist in the middle steps, and we didn't want to operate the
> > metadata store continually, so used pulsar to persist it.
> >
> >
> > > 12. Monitoring
> > > Some actions here can take a long time. We're basically relying on logs
> > to
> > > monitor where we are in the flow?
> > yes, we didn't trace the ledger deletion steps. we only use stats to
> > record whether the delete operation succeeds or not.
> >
>

Re: [DISCUSS] PIP-186: Introduce two phase deletion protocol based on system topic

Reply via email to