Hi Divij, Thanks for taking a look!
1. In order to retrieve the sizes, the plan is to use getTotalSpace() and getUsableSpace() from java.nio.file.FileStore. The implementations may vary depending on the filesystem but these calls typically don't depend on the size of storage but instead just return metadata the filesystem maintains. 2. I'm not an expert on KIP-405, so correct me if I'm wrong. As far as I understand brokers will still have local log dirs and remote volumes are not counted as log dirs. KIP-405 does not mention updating the DescribeLogDirs API. So I don't think this KIP needs to do anything special to be compatible with KIP-405. On the other hand, I wonder if KIP-405 should update DescribeLogDirs to provide details about the location of replicas. 3. Counting files can be a slow operation as it requires exploring all paths recursively to find all files. Administrators should definitively monitor file descriptors via host metrics but I'm not sure it's something we want to expose via the Kafka API. As mentioned it could be slow to compute and files are not really a Kafka concept. 4. DescribeLogDirs is usually a low volume API. This change should not significantly affect the latency of this API. Thanks, Mickael On Thu, Apr 7, 2022 at 1:41 PM Divij Vaidya <divijvaidy...@gmail.com> wrote: > > Hi Mickael > > Thanks for starting this. It is a very useful feature. > > Some initial thoughts (I am new to Kafka so please excuse if these are > naive suggestions): > 1. What is the impact on latency of the DescribeLogDirs API due to this > change? Would calculating the totalSpace from each logdir be a bottleneck > for the API? What if we are talking about a large storage size in the order > of hundred (or tens) of GBs? > 2. How does this fit in with RemoteStorage (KIP-405)? I think integration > with KIP-405 is worth discussing in the scope of this KIP. My > recommendation will be to add a new API in the RLMM > (RemoteLogMetadataManager) called GetLogSize() and leave it upto the remote > storage to perform a concrete implementation for this > interface. DescribeLogDirs could call this interface internally to provide > the relevant information. > 3. Do you think adding the number of files in the directory as part of the > API response will be useful as well? e.g. a use case where this information > will be useful is to monitor/alarm the situations when the number of files > are dangerously reaching the max value of file descriptors configured at > the OS. > 4. Please add an API latency perf test as part of the release criteria for > this change. We want to avoid regression. > > Regards, > Divij Vaidya > > > > On Thu, Apr 7, 2022 at 11:17 AM Mickael Maison <mickael.mai...@gmail.com> > wrote: > > > Hi, > > > > I wrote a small KIP to expose the total and usable space of logdirs > > via the DescribeLogDirs API: > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-827%3A+Expose+logdirs+total+and+usable+space+via+Kafka+API > > > > Please take a look and let me know if you have any feedback. > > > > Thanks, > > Mickael > >