xiarixiaoyao commented on pull request #5137: URL: https://github.com/apache/hudi/pull/5137#issuecomment-1079711532
@xushiyan @YannByron @leesf @alexeykudinkin could you pls help me review this pr, thanks its a serious bug before pacth: 295553 ms after patch: 5279 ms ``` val dfx = spark.range(0, 50000000).toDF("id") .withColumn("c1", lit("dsfsdfsafsasdfa")) .withColumn("c2", lit(12.99d)) .withColumn("c3", lit(1)) val avroSchemax = AvroConversionUtils.convertStructTypeToAvroSchema(dfx.schema, "record", "my") val sparkSchema = dfx.schema spark.sparkContext.getConf.registerAvroSchemas(avroSchemax) val testRDD = HoodieSparkUtils.createRdd(dfx,"record", "my", Some(avroSchemax)) // warm up dfx.count() spark.time(testRDD.foreach(f => f)) ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org