Github user fhueske commented on a diff in the pull request:

    https://github.com/apache/flink/pull/3330#discussion_r101775264
  
    --- Diff: 
flink-libraries/flink-table/src/main/scala/org/apache/flink/table/codegen/CodeGenerator.scala
 ---
    @@ -1463,21 +1465,23 @@ class CodeGenerator(
         */
       def addReusableFunction(function: UserDefinedFunction): String = {
         val classQualifier = function.getClass.getCanonicalName
    -    val fieldTerm = s"function_${classQualifier.replace('.', '$')}"
    +    val functionSerializedData = serialize(function)
    +    val fieldTerm =
    +      s"""
    +         |function_${classQualifier.replace('.', 
'$')}_${DigestUtils.md5Hex(functionSerializedData)}
    --- End diff --
    
    I think encoding the serialized object as String is a good idea. I like the 
implementation. 
    
    Regarding the name collisions. I did a test with two identical scalar 
functions and everything worked well because `reusableMemberStatements` is a 
hash set that deduplicates the terms.
    
    Btw. what else do we need to do for scalar UDFs? They are injected with the 
same method and hence should be serialized as well. Isn't the only missing 
thing to add more tests? If yes, we could do that in this PR. 
    
    What do you think @sunjincheng121 and @wuchong?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to