Command for starting a customer expansion service is available here:
https://beam.apache.org/documentation/sdks/python-multi-language-pipelines/#choose-an-expansion-service

Also you can customize the environment set by this expansion service in the
expanded transforms using the environementType and environmentConfig
PipelineOptions available here:
https://github.com/apache/beam/blob/fdccad20f2af4f4af84b55529acae4b9d0004a01/sdks/java/core/src/main/java/org/apache/beam/sdk/options/PortablePipelineOptions.java#L54

But the end-to-end execution also depends on whether the runner you are
using supports the environment type/config you specify in the expansion
service.

Thanks,
Cham




On Mon, Jul 18, 2022 at 2:54 PM Ahmet Altay <al...@google.com> wrote:

> Adding a few relevant folks who could help answer this question: @John
> Casey <johnjca...@google.com> @Chamikara Jayalath <chamik...@google.com> 
> @Robert
> Bradshaw <rober...@google.com>
>
> Lydian, if you have any other information please share an update.
>
> Ahmet
>
> On Tue, Jun 7, 2022 at 12:49 AM Lydian <lydia...@gmail.com> wrote:
>
>> Hi Folks,
>>
>> I am trying to set up the Beam environment to run our Python pipeline
>> which reads data from Kafka.  According to some previous thread
>> <https://lists.apache.org/thread/kz47y88t6zr9k4z043mx3wnb9mz5dqpq>, it
>> seems like the Java SDK harness doesn't work with PROCESS environment_type,
>> and thus I can only use either Docker or External.  Given that I need to
>> deploy the job to K8s, and there are some security concerns which prevent
>> me from using the DinD approach. It seems like my best solution is to start
>> a sidecar container which starts the Java extension service in Flink Task
>> manager.  However, I am not sure what's the command to start the java
>> extension service which supports the approach.
>>
>> It looks like in the Docker environment, it is trying to call
>> ```
>> /opt/apache/beam/boot --id=1-2 --provision_endpoint=localhost:33025
>> ```
>> But the script requires me to provide `id` and `provision_endpoint`,
>> which I am not sure what I should do if I want to set up an external
>> service. Wondering if someone can help me with this?
>>
>> for context, I am using beam 2.38.0 with flink 1.13, the job is deployed
>> to k8s using lyft/flinkk8soperator
>> <https://github.com/lyft/flinkk8soperator>
>>
>> Thanks!
>>
>

Reply via email to