[ 
https://issues.apache.org/jira/browse/KAFKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15241270#comment-15241270
 ] 

Ismael Juma commented on KAFKA-3554:
------------------------------------

Thanks for filing this [~becket_qin], I agree that this is important and it 
came up in the following PR:

https://github.com/apache/kafka/pull/1221

I changed the payload to be randomly generated with a smallish int range, which 
I think is good enough for that PR. However, it would be great to improve it as 
suggested in this JIRA.

> Generate actual data with specific compression ratio in the 
> ProducerPerformance tool.
> -------------------------------------------------------------------------------------
>
>                 Key: KAFKA-3554
>                 URL: https://issues.apache.org/jira/browse/KAFKA-3554
>             Project: Kafka
>          Issue Type: Improvement
>    Affects Versions: 0.9.0.1
>            Reporter: Jiangjie Qin
>            Assignee: Dong Lin
>             Fix For: 0.10.1.0
>
>
> Currently the ProducerPerformance always generate the payload with same 
> bytes. This does not quite well to test the compressed data because the 
> payload is extremely compressible no matter how big the payload is.
> We can make some changes to make it more useful for compressed messages. 
> Currently I am generating the payload containing integer from a given range. 
> By adjusting the range of the integers, we can get different compression 
> ratios. 
> API wise, we can either let user to specify the integer range or the expected 
> compression ratio (we will do some probing to get the corresponding range for 
> the users)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to