I would recommend reaching out to the Beam project instead, because this seems like an issue that is entirely on their side.

As for the listed workaround, they are suggesting to modify the wordcount example to also bundle the listed jackson dependencies.
Maybe you could use a different example instead?

On 08/07/2021 18:10, Joey Tran wrote:
Hi!

I'm trying to just get my bearings with dataproc and flink/beam. When trying to run the wordcount example <https://cloud.google.com/dataproc/docs/concepts/components/flink#portable_beam_jobs> with a long-running YARN sessions <https://cloud.google.com/dataproc/docs/concepts/components/flink>. The error looks like:

    Traceback (most recent call last):
      File "wordcount.py", line 99, in <module>
        run()
      File "wordcount.py", line 94, in run
        output | 'Write' >> WriteToText(known_args.output)
      File
    "/home/jtran/.local/lib/python3.8/site-packages/apache_beam/pipeline.py",
    line 586, in __exit__
        self.result.wait_until_finish()
      File
    
"/home/jtran/.local/lib/python3.8/site-packages/apache_beam/runners/portability/portable_runner.py",
    line 599, in wait_until_finish
        raise self._runtime_exception
    RuntimeError: Pipeline
    BeamApp-jtran-0708141941-5fcff870_effe0238-2afb-4707-a981-133bc46618fd
    failed in state FAILED: java.util.ServiceConfigurationError:
    com.fasterxml.jackson.databind.Module: Provider
    com.fasterxml.jackson.module.jaxb.JaxbAnnotationModule not a subtype

flink version 1.12, dataproc image 2.0, beam version 2.30


I found some mention of this when trying to run the example with EMR but no mention with dataproc (https://issues.apache.org/jira/browse/BEAM-10430 <https://issues.apache.org/jira/browse/BEAM-10430>). The workaround is unclear to me, is there something wrong I'm doing with setting this up? Any advice would be greatly appreciated. Thanks in advance!


Reply via email to