> On Thu, Oct 10, 2013 at 3:57 PM, Bruno D. Rodrigues <
> bruno.rodrig...@litux.org> wrote:
> 
>> My personal newbie experience, which is surely completely wrong and
>> miss-configured, got me up to 70MB/sec, either with controlled 1K messages
>> (hence 70Kmsg/sec) as well as with more random data (test data from 100
>> bytes to a couple MB). First I thought the 70MB were the hard disk limit,
>> but when I got the same result both with a proper linux server with a 10K
>> disk, as well as with a Mac mini with a 5400rpm disk, I got confused.
>> 
>> The mini has 2G, the linux server has 8 or 16, can'r recall at the moment.
>> 
>> The test was performed both with single and multi producers and consumers.
>> One producer = 70MB, two producers = 35MB each and so forth. Running
>> standalone instances on each server, same value. Running both together in 2
>> partition 2 replica crossed mode, same result.
>> 
>> As far as I understood, more memory just means more kernel buffer space to
>> speed up the lack of disk speed, as kafka seems to not really depend on
>> memory for the queueing.

A 11/10/2013, às 17:28, Guozhang Wang <wangg...@gmail.com> escreveu:

> Hello,
> 
> In most cases of Kafka, network bottleneck will be hit before the disk
> bottleneck. So maybe you want to check your network capacity to see if it
> has been saturated.

They are all connected to Gbit ethernet cards and proper network routers. I can 
easily get way above 950Mbps up and down between each machine and even between 
multiple machines. Gbit is 128MB/s. 70MB/s is 560Kbps. So far so good, 56% 
network capacity is a goodish value. But then I enable snappy, get the same 
70MB on the input and output side, and 20MB/sec on the network, so it surely 
isn't network limits. It's also not on the input or output side - the input 
reads a pre-processed MMaped file that reads at 150MB/sec without cache (SSD) 
up to 3GB/sec when loaded into memory. The output simply counts the messages 
and size of them.

One weird thing is that the kafka process seems to not cross the 100% cpu on 
the top or equivalent command. Top shows 100% for each CPU, so a multi-threaded 
process should go up to 400% (both the linux and mac mini are 2 CPU with 
hiperthreading, so "almost" 4 cpus).


Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail

Reply via email to