[ https://issues.apache.org/jira/browse/KAFKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15241270#comment-15241270 ]
Ismael Juma commented on KAFKA-3554: ------------------------------------ Thanks for filing this [~becket_qin], I agree that this is important and it came up in the following PR: https://github.com/apache/kafka/pull/1221 I changed the payload to be randomly generated with a smallish int range, which I think is good enough for that PR. However, it would be great to improve it as suggested in this JIRA. > Generate actual data with specific compression ratio in the > ProducerPerformance tool. > ------------------------------------------------------------------------------------- > > Key: KAFKA-3554 > URL: https://issues.apache.org/jira/browse/KAFKA-3554 > Project: Kafka > Issue Type: Improvement > Affects Versions: 0.9.0.1 > Reporter: Jiangjie Qin > Assignee: Dong Lin > Fix For: 0.10.1.0 > > > Currently the ProducerPerformance always generate the payload with same > bytes. This does not quite well to test the compressed data because the > payload is extremely compressible no matter how big the payload is. > We can make some changes to make it more useful for compressed messages. > Currently I am generating the payload containing integer from a given range. > By adjusting the range of the integers, we can get different compression > ratios. > API wise, we can either let user to specify the integer range or the expected > compression ratio (we will do some probing to get the corresponding range for > the users) -- This message was sent by Atlassian JIRA (v6.3.4#6332)