Re: new log.dirs property (as opposed to log.dir)

2013-08-15 Thread Jay Kreps
My assumption is that you have something like log.dirs=/mnt/diska,/mnt/diskb,/mnt/diskc Let's say that diskc goes bad. The expected behavior is that the kafka broker gets an error on write and kills itself. When that happens you have two options. Either you remove the drive and add a new one a

Re: new log.dirs property (as opposed to log.dir)

2013-08-15 Thread Vadim Keylis
Good morning Jay. When you say delete directory from the list, did you mean from file system? Can I see through JMX which partitions online and which one are not? Thanks On Thu, Aug 15, 2013 at 10:04 AM, Jason Rosenberg wrote: > Thanks Jay, I'll do some testing with this and report back. > > J

Re: new log.dirs property (as opposed to log.dir)

2013-08-15 Thread Jason Rosenberg
Thanks Jay, I'll do some testing with this and report back. Jason On Thu, Aug 15, 2013 at 7:10 AM, Jay Kreps wrote: > I believe either should work. The broker has a record of what it should > have in zk and will recreate any missing logs. Try it to make sure though. > > Sent from my iPhone > >

Re: new log.dirs property (as opposed to log.dir)

2013-08-15 Thread Jay Kreps
I believe either should work. The broker has a record of what it should have in zk and will recreate any missing logs. Try it to make sure though. Sent from my iPhone On Aug 15, 2013, at 12:52 AM, Jason Rosenberg wrote: > Ok, that makes sense that the broker will shut itself down. > > If we b

Re: new log.dirs property (as opposed to log.dir)

2013-08-15 Thread Jason Rosenberg
Ok, that makes sense that the broker will shut itself down. If we bring it back up, can this be with an altered set of log.dirs? Will the destroyed partitions get rebuilt on a new log.dir? Or do we have to bring it back up with a new or repaired disk, matching the old log.dir, in order for those

Re: new log.dirs property (as opposed to log.dir)

2013-08-14 Thread Jay Kreps
If you get a disk error that results in an IOException the broker will shut itself down. You would then have the option of replacing the disk or deleting that data directory from the list. When the broker is brought back up the intact partitions will quickly catch up and be online; the destroyed

Re: new log.dirs property (as opposed to log.dir)

2013-08-14 Thread Jason Rosenberg
I'm getting ready to try out this configuration (use multiple disks, no RAID, per broker). One concern is the procedure for recovering if there is a disk failure. If a disk fails, will the broker go offline, or will it continue serving partitions on its remaining good disks? And if so, is there

Re: new log.dirs property (as opposed to log.dir)

2013-06-20 Thread Jason Rosenberg
yeah, that would work! On Thu, Jun 20, 2013 at 1:20 PM, Jay Kreps wrote: > Yeah we didn't go as far as adding weighting or anything like that--I > think we'd be open to a patch that did that as long as it was > optional. In the short term you can obviously add multiple directories > on the same

Re: new log.dirs property (as opposed to log.dir)

2013-06-20 Thread Jay Kreps
Yeah we didn't go as far as adding weighting or anything like that--I think we'd be open to a patch that did that as long as it was optional. In the short term you can obviously add multiple directories on the same disk to increase its share. -Jay On Thu, Jun 20, 2013 at 12:59 PM, Jason Rosenberg

Re: new log.dirs property (as opposed to log.dir)

2013-06-20 Thread Jason Rosenberg
This sounds like a great idea, to just disks as "just a bunch of disks" or JBOD.hdfs works well this way. Do all the disks need to be the same size, to use them evenly? Since it will allocate partitions randomly? It would be nice if you had 2 disks, with one twice as large as the other, if t

Re: new log.dirs property (as opposed to log.dir)

2013-06-20 Thread Jay Kreps
The intention is to allow the use of multiple disks without RAID or logical volume management. We have found that there are a lot of downsides to RAID--in particular a huge throughput hit. Since we already have a parallelism model due to partitioning and a fault tolerance model with replication RAI

Re: new log.dirs property (as opposed to log.dir)

2013-06-20 Thread Jun Rao
No, our replica assignment logic won't allow that during topic creation. Thanks, Jun On Wed, Jun 19, 2013 at 11:01 PM, Jason Rosenberg wrote: > is it possible for a partition to have multiple replicas on different > directories on the same broker? (hopefully no!) > > > On Wed, Jun 19, 2013 a

Re: new log.dirs property (as opposed to log.dir)

2013-06-19 Thread Jason Rosenberg
is it possible for a partition to have multiple replicas on different directories on the same broker? (hopefully no!) On Wed, Jun 19, 2013 at 10:47 PM, Jun Rao wrote: > It takes a comma separated list and partition replicas are randomly > distributed to the list. > > Thanks, > > Jun > > > On W

Re: new log.dirs property (as opposed to log.dir)

2013-06-19 Thread Jun Rao
It takes a comma separated list and partition replicas are randomly distributed to the list. Thanks, Jun On Wed, Jun 19, 2013 at 10:25 PM, Jason Rosenberg wrote: > In the 0.8 config, log.dir is now log.dirs. It looks like the singular > log.dir is still supported, but under the covers the pr

new log.dirs property (as opposed to log.dir)

2013-06-19 Thread Jason Rosenberg
In the 0.8 config, log.dir is now log.dirs. It looks like the singular log.dir is still supported, but under the covers the property is log.dirs. I'm curious, does this take a comma separated list of directories? The new config page just says: "The directories in which the log data is kept" Als