Re: what happened in case of single disk failure

2020-03-11 Thread 张祥
Hi, Peter, following what we talked about before, I want to understand what will happen when one broker goes down, I would say it will be very similar to what happens under disk failure, except that the rules apply to all the partitions on that broker instead of only one malfunctioned disk. Am I ri

Re: what happened in case of single disk failure

2020-03-11 Thread Peter Bukowinski
Yes, that’s correct. While a broker is down: all topic partitions assigned to that broker will be under-replicated topic partitions with an unmet minimum ISR count will be offline leadership of partitions meeting the minimum ISR count will move to the next in-sync replica in the replica list if n

log.dirs and SSDs

2020-03-11 Thread Eugen Dueck
So log.dirs should contain only one entry per HDD disk, to avoid random seeks. What about SSDs? Can throughput be increased by specifying multiple directories on the same SSD?

Re: log.dirs and SSDs

2020-03-11 Thread Peter Bukowinski
> On Mar 11, 2020, at 4:28 PM, Eugen Dueck wrote: > > So log.dirs should contain only one entry per HDD disk, to avoid random seeks. > What about SSDs? Can throughput be increased by specifying multiple > directories on the same SSD? Given a constant number of partitions, I don’t see any advan

Re: log.dirs and SSDs

2020-03-11 Thread Eugen Dueck
So there is not e.g. a single thread responsible per directory in log.dirs that could become a bottleneck relative to SSD throughput of GB/s? This is in fact the case for Apache Pulsar, and the openmessaging benchmark uses 4 directories on the same SSD to increase throughput. __

Re: log.dirs and SSDs

2020-03-11 Thread Peter Bukowinski
Couldn’t the same be accomplished by increasing the num.io.threads broker setting? > On Mar 11, 2020, at 5:15 PM, Eugen Dueck wrote: > > So there is not e.g. a single thread responsible per directory in log.dirs > that could become a bottleneck relative to SSD throughput of GB/s? > > This is

Re: log.dirs and SSDs

2020-03-11 Thread Eugen Dueck
I'm asking the questions here! 🙂 So is that the way to tune the broker if it does not achieve disk throughput? 差出人: Peter Bukowinski 送信日時: 2020年3月12日 9:38 Couldn’t the same be accomplished by increasing the num.io.threads broker setting? > On Mar 11, 2020, at 5

Re: what happened in case of single disk failure

2020-03-11 Thread 张祥
Thanks, very helpful ! Peter Bukowinski 于2020年3月12日周四 上午5:48写道: > Yes, that’s correct. While a broker is down: > > all topic partitions assigned to that broker will be under-replicated > topic partitions with an unmet minimum ISR count will be offline > leadership of partitions meeting the minim

Re: log.dirs and SSDs

2020-03-11 Thread Eugen Dueck
The reason I'm asking is because the documentation is quite brief: num.io.threads: The number of threads that the server uses for processing requests, which may include disk I/O And none of the google hits revealed any more details. I'm asking the questions her

Re: log.dirs and SSDs

2020-03-11 Thread Peter Bukowinski
Hah :) I think this deserves an experiment. I’d try setting up some tests with one, two, four, and eight log directories per disk and running some performance tests. I’d be interested to see your results. > On Mar 11, 2020, at 5:45 PM, Eugen Dueck wrote: > > I'm asking the questions here! 🙂 >

Re: log.dirs and SSDs

2020-03-11 Thread Eugen Dueck
Thanks! Will let this list know if and when I run a log.dirs vs. num.io.threads test. Hah :) I think this deserves an experiment. I’d try setting up some tests with one, two, four, and eight log directories per disk and running some performance tests. I’d be i

Sequential writes make Kafka fast, or so they say

2020-03-11 Thread Eugen Dueck
A question about something that was always in the back of my mind. According to Jay Kreps > The first [reason that Kafka is so fast despite writing to disk] is that > Kafka does only sequential file I/O. I wonder how true this statement is, because Kafka uses 3 segments per partition. so even