Github user JoshRosen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10835#discussion_r50646732
  
    --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala 
---
    @@ -370,6 +370,14 @@ object SparkHadoopUtil {
     
       val SPARK_YARN_CREDS_COUNTER_DELIM = "-"
     
    +  /**
    +   * Number of records to update input metrics when reading from 
HadoopRDDs.
    +   *
    +   * Each update is potentially expensive because we need to use 
reflection to access the
    +   * Hadoop FileSystem API of interest (only available in 2.5), so we 
should do this sparingly.
    +   */
    +  private[spark] val UPDATE_INPUT_METRICS_INTERVAL_RECORDS = 1000
    --- End diff --
    
    I noticed that `HadoopRDD` has a field named 
`RECORDS_BETWEEN_BYTES_READ_METRIC_UPDATES` which isn't used anywhere...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to