yihua commented on code in PR #13498:
URL: https://github.com/apache/hudi/pull/13498#discussion_r2183801889


##########
hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/BaseSparkInternalRowReaderContext.java:
##########
@@ -110,6 +112,14 @@ public HoodieRecord<InternalRow> 
constructHoodieRecord(BufferedRecord<InternalRo
     return new HoodieSparkRecord(hoodieKey, row, 
HoodieInternalRowUtils.getCachedSchema(schema), false);
   }
 
+  @Override
+  public InternalRow constructEngineRecord(Schema schema, List<Object> values) 
{
+    if (schema.getFields().size() != values.size()) {
+      throw new IllegalArgumentException("Schema field count and values size 
must match.");
+    }
+    return new GenericInternalRow(values.toArray());

Review Comment:
   Let's microbenchmark this part of code and then decide.  For memory 
efficiency, the binary-represented row objects is stored in the Spillable map.  
So some other place might already covert the engine record to the binary 
format; check existing code on this.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to