Re: Question about the Log Compaction

2022-03-02 Thread Jun Rao
Hi, Liang, Currently, we store the MD5 of the record key in OffsetMap. Since it has a large domain (16 bytes), we assume there is no collision there. Thanks, Jun On Wed, Mar 2, 2022 at 1:20 AM 阮良 wrote: > Hi all > > I am confused about the Log Compaction logic,use OffsetMap > to deduplicating

Question about the Log Compaction

2022-03-02 Thread 阮良
Hi all I am confused about the Log Compaction logic,use OffsetMap to deduplicating the log. in my opinion when there is a hash conflict , data may be lost Eg: Record1(key1,offset1) Record2(key2,offset2) Conditionhash(key1) == hash(key2) && (offset1 < offset2)