yihua commented on code in PR #13007:
URL: https://github.com/apache/hudi/pull/13007#discussion_r2019386700


##########
hudi-io/src/main/java/org/apache/hudi/storage/StorageSchemes.java:
##########
@@ -86,6 +88,11 @@ public enum StorageSchemes {
   // Hopsworks File System
   HOPSFS("hopsfs", false, true);
 
+  // list files may bring pressure to storage with centralized meta service 
like HDFS.
+  // when we want to get only part of files under a directory rather than all 
files, use getStatus may be more friendly than listStatus.
+  // here is a trade-off between rpc times and throughput of storage meta 
service
+  private static final Set<String> LIST_STATUS_FRIENDLY_SCHEMES = new 
HashSet<>(Arrays.asList(FILE.scheme, S3.scheme, S3A.scheme, GCS.scheme));

Review Comment:
   OK, I see your point.  If this is not going to be widely used, I agree that 
we can use the static variables.



##########
hudi-io/src/main/java/org/apache/hudi/storage/StorageSchemes.java:
##########
@@ -86,6 +88,11 @@ public enum StorageSchemes {
   // Hopsworks File System
   HOPSFS("hopsfs", false, true);
 
+  // list files may bring pressure to storage with centralized meta service 
like HDFS.
+  // when we want to get only part of files under a directory rather than all 
files, use getStatus may be more friendly than listStatus.
+  // here is a trade-off between rpc times and throughput of storage meta 
service
+  private static final Set<String> LIST_STATUS_FRIENDLY_SCHEMES = new 
HashSet<>(Arrays.asList(FILE.scheme, S3.scheme, S3A.scheme, GCS.scheme));

Review Comment:
   OK, I see your point.  If this is not going to be widely used, I agree that 
we can use the static variable.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to