Hi, yubiao, I think it's an implementation detail that shouldn't be exposed to the user at all. Yours sincerely, Xiangying Meng
On Mon, Aug 15, 2022 at 8:02 PM Yubiao Feng <yubiao.f...@streamnative.io.invalid> wrote: > Hi Xiangying > > Thank you for your reply. Sorry, I have one more question: > > > If these operations are failed at operation 2, the old snapshots will be > covered by the new large snapshot when compact due to they have the same > sequence ID. > > Can the sequence id generation strategy be added to the doc? > > On Mon, Aug 15, 2022 at 6:35 PM Xiangying Meng <xiangy...@apache.org> > wrote: > > > Hi, yubiao, > > First of all, thanks for the attention and questions. Then for your three > > questions: > > 1. > > > Does the merge take place in memory or in BK? > > The snapshot will merge in BK. For specific details, you can see detailed > > instructions in the* ### Merge snapshot section.* > > 2. > > >How do we ensure the atomicity of the two writes, I suggest adding a > check > > We do not guarantee their atomicity. The position of the snapshot is > > generally unchanged, so the previous index is also valid. If the index > > write fails after a snapshot is written, the final result is that the > > snapshot write fails this time. There will be no other worse results, and > > no dirty data will be introduced due to compression. > > 3. > > >Clean up unused aborts data > > Snapshot cleanup can be found in *####take snapshot ##### How*. > > The cleanup of the index is done automatically by the compressor. I will > > add it at *### Snapshot index topic.* > > > > yours sincerely, > > Xiangying Meng > > > > > > > > > > On Mon, Aug 15, 2022 at 3:56 PM Yubiao Feng > > <yubiao.f...@streamnative.io.invalid> wrote: > > > > > Hi Xiangying > > > > > > I think Multiple-snapshots for TB is a good idea. And I have these > > > questions: > > > > > > > > > > The number of the transactions in a snapshot can be configured, and > we > > > hope it is small, then we can merge the small snapshots into a large > > > snapshot when it reaches a configured number. > > > > > > Does the merge take place in memory or in BK? > > > > > > - If we merge small-snapshot in memory, can we just use large-snapshot? > > > - If we merge small-snapshot in BK, how to do it? > > > > > > > > > > > > > The index is written after each multiple-snapshot is written. > > > > > > Snapshot and index are stored in different topics, right? > > > > > > How do we ensure the atomicity of the two writes, I suggest adding a > > check > > > mechanism that snapshot not recorded in the index is invalid. > > > > > > > > > > > > > #### Clean up unused aborts data > > > > > > Now, this section only has instructions for clear snapshots. > > > I think we should add this: how to delete/override the index data. > > > > > > Thanks > > > Yubiao Feng > > > > > > On Thu, Aug 4, 2022 at 10:27 AM Xiangying Meng <xiangy...@apache.org> > > > wrote: > > > > > > > Hi, Pulsar community, > > > > I`d like to start a discussion about transaction multiple-snapshot. > > > > In order to get rid of the capacity limitation of the bookkeeper > entry, > > > we > > > > plan to use multiple snapshots. More details can be found here > > > > <https://github.com/apache/pulsar/issues/16913>. > > > > > > > > Yours sincerely, > > > > Xiangying Meng > > > > > > > > > >