bq. an older timestamp that allowed

I guess you meant 'than allowed'

Cheers

On Tue, Nov 21, 2017 at 2:57 PM, Matthias J. Sax <matth...@confluent.io>
wrote:

> This is possible, but I think you don't need the time-based index for it :)
>
> You will just buffer up all messages for a 5 minute sliding-window and
> maintain all message sorted by timestamp in this window. Each time the
> window "moves" you write the oldest records that "drop out" of the
> window to the topic. If you get a record with an older timestamp that
> allowed, you don't insert in into the window but drop it.
>
> The timestamp index is useful if you want to seek to a specific offset
> base on timestamp. But I don't think you need this for your use case.
>
>
>
> -Matthias
>
> On 11/21/17 1:39 PM, Ray Ruvinskiy wrote:
> > I’ve been reading https://cwiki.apache.org/confluence/display/KAFKA/KIP-
> 33+-+Add+a+time+based+log+index and trying to determine whether I can use
> the time-based index as an efficient way to sort a stream of messages into
> timestamp (CreateTime) order.
> >
> > I am dealing with a number of sources emitting messages that are then
> processed in a distributed fashion and written to a Kafka topic. During
> this processing, the original order of the messages is not strictly
> maintained. Each message has an embedded timestamp. I’d like to be able to
> sort these messages back into timestamp order, allowing for a certain
> lateness interval, before processing them further. For example, supposing
> the lateness interval is 5 minutes, at time T I’d like to consume from the
> topic all messages with timestamp up to (T - 5 minutes), in timestamp
> order. The assumption is that a message should be no more than 5 minutes
> late; if it is more than 5 minutes late, it can be discarded. Is this
> something that can be done with the time-based index?
> >
> > Thanks,
> >
> > Ray
> >
>
>

Reply via email to