Hey Jun, Thanks for this information. I am not aware of this difference between the purge and delete. Given this difference, I will prefer to the existing name of the purge.
Ismael, please let me if you are strong about using delete. Thanks, Dong On Thu, Mar 2, 2017 at 9:40 AM, Jun Rao <j...@confluent.io> wrote: > Hi, Dong, > > It seems that delete means removing everything while purge means removing a > portion. So, it seems that it's better to be able to distinguish the two? > > Thanks, > > Jun > > On Wed, Mar 1, 2017 at 1:57 PM, Dong Lin <lindon...@gmail.com> wrote: > > > Hi all, > > > > I have updated the KIP to include a script that allows user to purge data > > by providing a map from partition to offset. I think this script may be > > convenience and useful, e.g., if user simply wants to purge all data of > > given partitions from command line. I am wondering if anyone object this > > script or has suggestions on the interface. > > > > Besides, Ismael commented in the pull request that it may be better to > > rename PurgeDataBefore() to DeleteDataBefore() and rename PurgeRequest to > > DeleteRequest. I think it may be a good idea because kafka-topics.sh > > already use "delete" as an option. Personally I don't have strong > > preference between "purge" and "delete". I am wondering if anyone object > to > > this change. > > > > Thanks, > > Dong > > > > > > > > On Wed, Mar 1, 2017 at 9:46 AM, Dong Lin <lindon...@gmail.com> wrote: > > > > > Hi Ismael, > > > > > > I actually mean log_start_offset. I realized that it is a better name > > > after I start implementation because "logStartOffset" is already used > in > > > Log.scala and LogCleanerManager.scala. So I changed it from > > > log_begin_offset to log_start_offset in the patch. But I forgot to > update > > > the KIP and specify it in the mailing thread. > > > > > > Thanks for catching this. Let me update the KIP to reflect this change. > > > > > > Dong > > > > > > > > > On Wed, Mar 1, 2017 at 6:15 AM, Ismael Juma <ism...@juma.me.uk> wrote: > > > > > >> Hi Dong, > > >> > > >> When you say "logStartOffset", do you mean "log_begin_offset "? I > could > > >> only find the latter in the KIP. If so, would log_start_offset be a > > better > > >> name? > > >> > > >> Ismael > > >> > > >> On Tue, Feb 28, 2017 at 4:26 AM, Dong Lin <lindon...@gmail.com> > wrote: > > >> > > >> > Hi Jun and everyone, > > >> > > > >> > I would like to change the KIP in the following way. Currently, if > any > > >> > replica if offline, the purge result for a partition will > > >> > be NotEnoughReplicasException and its low_watermark will be 0. The > > >> > motivation for this approach is that we want to guarantee that the > > data > > >> > before purgedOffset has been deleted on all replicas of this > partition > > >> if > > >> > purge result indicates success. > > >> > > > >> > But this approach seems too conservative. It should be sufficient in > > >> most > > >> > cases to just tell user success and set low_watermark to minimum > > >> > logStartOffset of all live replicas in the PurgeResponse if > > >> logStartOffset > > >> > of all live replicas have reached purgedOffset. This is because for > an > > >> > offline replicas to become online and be elected leader, it should > > have > > >> > received one FetchReponse from the current leader which should tell > it > > >> to > > >> > purge beyond purgedOffset. The benefit of doing this change is that > we > > >> can > > >> > allow purge operation to succeed when some replica is offline. > > >> > > > >> > Are you OK with this change? If so, I will go ahead to update the > KIP > > >> and > > >> > implement this behavior. > > >> > > > >> > Thanks, > > >> > Dong > > >> > > > >> > > > >> > > > >> > On Tue, Jan 17, 2017 at 10:18 AM, Dong Lin <lindon...@gmail.com> > > wrote: > > >> > > > >> > > Hey Jun, > > >> > > > > >> > > Do you have time to review the KIP again or vote for it? > > >> > > > > >> > > Hey Ewen, > > >> > > > > >> > > Can you also review the KIP again or vote for it? I have discussed > > >> with > > >> > > Radai and Becket regarding your concern. We still think putting it > > in > > >> > Admin > > >> > > Client seems more intuitive because there is use-case where > > >> application > > >> > > which manages topic or produces data may also want to purge data. > It > > >> > seems > > >> > > weird if they need to create a consumer to do this. > > >> > > > > >> > > Thanks, > > >> > > Dong > > >> > > > > >> > > On Thu, Jan 12, 2017 at 9:34 AM, Mayuresh Gharat < > > >> > > gharatmayures...@gmail.com> wrote: > > >> > > > > >> > >> +1 (non-binding) > > >> > >> > > >> > >> Thanks, > > >> > >> > > >> > >> Mayuresh > > >> > >> > > >> > >> On Wed, Jan 11, 2017 at 1:03 PM, Dong Lin <lindon...@gmail.com> > > >> wrote: > > >> > >> > > >> > >> > Sorry for the duplicated email. It seems that gmail will put > the > > >> > voting > > >> > >> > email in this thread if I simply replace DISCUSS with VOTE in > the > > >> > >> subject. > > >> > >> > > > >> > >> > On Wed, Jan 11, 2017 at 12:57 PM, Dong Lin < > lindon...@gmail.com> > > >> > wrote: > > >> > >> > > > >> > >> > > Hi all, > > >> > >> > > > > >> > >> > > It seems that there is no further concern with the KIP-107. > At > > >> this > > >> > >> point > > >> > >> > > we would like to start the voting process. The KIP can be > found > > >> at > > >> > >> > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-107 > > >> > >> > > %3A+Add+purgeDataBefore%28%29+API+in+AdminClient. > > >> > >> > > > > >> > >> > > Thanks, > > >> > >> > > Dong > > >> > >> > > > > >> > >> > > > >> > >> > > >> > >> > > >> > >> > > >> > >> -- > > >> > >> -Regards, > > >> > >> Mayuresh R. Gharat > > >> > >> (862) 250-7125 > > >> > >> > > >> > > > > >> > > > > >> > > > >> > > > > > > > > >