Hi,

I'm using Flink 1.17.1 streaming API, on YARN.

My app first stuck at process func serialization. I know Avro Schema
is not serializable so I removed all references from my process
functions. Now it passes first round, but stuck again at the following
error:

org.apache.flink.client.program.ProgramInvocationException: The main
method caused an error: Error in serialization.
        at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:372)
        at 
org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:222)
        at 
org.apache.flink.client.ClientUtils.executeProgram(ClientUtils.java:105)
        at 
org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:851)
        at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:245)
        at 
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1095)
        at 
org.apache.flink.client.cli.CliFrontend.lambda$mainInternal$9(CliFrontend.java:1189)
        at java.base/java.security.AccessController.doPrivileged(Native Method)
        at java.base/javax.security.auth.Subject.doAs(Subject.java:423)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1899)
        at 
org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
        at 
org.apache.flink.client.cli.CliFrontend.mainInternal(CliFrontend.java:1189)
        at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1157)
Caused by: org.apache.flink.util.FlinkRuntimeException: Error in serialization.
        at 
org.apache.flink.streaming.api.graph.StreamingJobGraphGenerator.createJobGraph(StreamingJobGraphGenerator.java:326)
        at 
org.apache.flink.streaming.api.graph.StreamingJobGraphGenerator.createJobGraph(StreamingJobGraphGenerator.java:160)
        at 
org.apache.flink.streaming.api.graph.StreamGraph.getJobGraph(StreamGraph.java:1029)
        at 
org.apache.flink.client.StreamGraphTranslator.translateToJobGraph(StreamGraphTranslator.java:56)
        at 
org.apache.flink.client.FlinkPipelineTranslationUtil.getJobGraph(FlinkPipelineTranslationUtil.java:43)
        at 
org.apache.flink.client.deployment.executors.PipelineExecutorUtils.getJobGraph(PipelineExecutorUtils.java:61)
        at 
org.apache.flink.client.deployment.executors.AbstractSessionClusterExecutor.execute(AbstractSessionClusterExecutor.java:72)
        at 
org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.executeAsync(StreamExecutionEnvironment.java:2197)
        at 
org.apache.flink.client.program.StreamContextEnvironment.executeAsync(StreamContextEnvironment.java:189)
        at 
org.apache.flink.client.program.StreamContextEnvironment.execute(StreamContextEnvironment.java:118)
        at 
org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2058)
        at org.example.FlinkCDCMulti.startFlinkJob(FlinkCDCMulti.java:164)
        at org.example.FlinkCDCMulti.main(FlinkCDCMulti.java:104)
        at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native
Method)
        at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
        at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.base/java.lang.reflect.Method.invoke(Method.java:566)
        at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:355)
        ... 12 more
Caused by: java.util.concurrent.ExecutionException:
org.apache.flink.streaming.runtime.tasks.StreamTaskException: Could
not serialize object for key serializedUDF.
        at 
java.base/java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:395)
        at 
java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:2005)
        at 
org.apache.flink.streaming.api.graph.StreamingJobGraphGenerator.createJobGraph(StreamingJobGraphGenerator.java:322)
        ... 29 more
Caused by: org.apache.flink.streaming.runtime.tasks.StreamTaskException:
Could not serialize object for key serializedUDF.
        at 
org.apache.flink.streaming.api.graph.StreamConfig.lambda$serializeAllConfigs$1(StreamConfig.java:198)
        at java.base/java.util.HashMap.forEach(HashMap.java:1337)
        at 
org.apache.flink.streaming.api.graph.StreamConfig.serializeAllConfigs(StreamConfig.java:192)
        at 
org.apache.flink.streaming.api.graph.StreamConfig.lambda$triggerSerializationAndReturnFuture$0(StreamConfig.java:169)
        at 
java.base/java.util.concurrent.CompletableFuture$UniAccept.tryFire(CompletableFuture.java:714)
        at 
java.base/java.util.concurrent.CompletableFuture$Completion.run(CompletableFuture.java:478)
        at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
        at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
        at java.base/java.lang.Thread.run(Thread.java:829)
Caused by: java.io.NotSerializableException: org.apache.avro.Schema$RecordSchema
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1175)
        at 
java.base/java.io.ObjectOutputStream.writeArray(ObjectOutputStream.java:1369)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1165)
        at 
java.base/java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1543)
        at 
java.base/java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1500)
        at 
java.base/java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1423)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1169)
        at 
java.base/java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1543)
        at 
java.base/java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1500)
        at 
java.base/java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1423)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1169)
        at 
java.base/java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1543)
        at 
java.base/java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1500)
        at 
java.base/java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1423)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1169)
        at 
java.base/java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1543)
        at 
java.base/java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1500)
        at 
java.base/java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1423)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1169)
        at 
java.base/java.io.ObjectOutputStream.defaultWriteFields(ObjectOutputStream.java:1543)
        at 
java.base/java.io.ObjectOutputStream.writeSerialData(ObjectOutputStream.java:1500)
        at 
java.base/java.io.ObjectOutputStream.writeOrdinaryObject(ObjectOutputStream.java:1423)
        at 
java.base/java.io.ObjectOutputStream.writeObject0(ObjectOutputStream.java:1169)
        at 
java.base/java.io.ObjectOutputStream.writeObject(ObjectOutputStream.java:345)
        at 
org.apache.flink.util.InstantiationUtil.serializeObject(InstantiationUtil.java:554)
        at 
org.apache.flink.util.InstantiationUtil.writeObjectToConfig(InstantiationUtil.java:486)
        at 
org.apache.flink.streaming.api.graph.StreamConfig.lambda$serializeAllConfigs$1(StreamConfig.java:195)
        ... 8 more

The strange thing is, this only happens in session mode. There is no
problem in application mode, even when I have Avro Schema references!
I have many tables to sync but I'm using an older version of MongoDB,
so to save resources I had to use session mode.

In session mode this is not even debuggable because I do not know
which class or where it had reference to the org.apache.avro.Schema
class. I do have some local variables of Avro schema type but since
they are local I think they are fine. I was not able to locate exactly
which UDF or configuration was causing this issue.

Please advise. Thanks!

-Dai

Reply via email to