It's probably useful for a client to know whether its requests are throttled or not (e.g., for monitoring and alerting). From that perspective, option B (delay the requests and return an error) seems better.
Thanks, Jun On Wed, Mar 4, 2015 at 3:51 PM, Aditya Auradkar < aaurad...@linkedin.com.invalid> wrote: > Posted a KIP for quotas in kafka. > https://cwiki.apache.org/confluence/display/KAFKA/KIP-13+-+Quotas > > Appreciate any feedback. > > Aditya >