Re: TensorRT inference not starting

Sai Shashank Fri, 26 Sep 2025 21:19:26 -0700

So, can I make an issue and raise PR for the updated TensorRT handler?

On Sat, 20 Sept 2025 at 08:46, XQ Hu <[email protected]> wrote:


> This is what Beam tests use
> https://github.com/apache/beam/blob/master/sdks/python/test-suites/containers/tensorrt_runinference/tensor_rt.dockerfile#L17
>
> nvcr.io/nvidia/tensorrt:23.05-py3
>
> From the latest doc:
> https://docs.nvidia.com/deeplearning/tensorrt/latest/_static/python-api/infer/Core/Engine.html#icudaengine
> and https://github.com/NVIDIA/TensorRT/issues/4216, num_io_tensors should
> be used now.
>
> You can either use the tensorRT version Beam supports now or you can
> define your own TensorRTEngine model handler with the new tensorRT.
>
>
> On Sat, Sep 20, 2025 at 1:10 AM Sai Shashank <[email protected]>
> wrote:
>
>> So finally, I was able to resolve the issue  of docker image but now, I
>> saw this error           ^^^^^^^^^^^^^^^^^^^^^^
>>   File
>> "/usr/local/lib/python3.12/dist-packages/apache_beam/ml/inference/tensorrt_inference.py",
>> line 132, in __init__
>>     for i in range(self.engine.num_bindings):
>>                    ^^^^^^^^^^^^^^^^^^^^^^^^
>> AttributeError: 'tensorrt.tensorrt.ICudaEngine' object has no attribute
>> 'num_bindings'
>> Traceback (most recent call last):
>>   File "apache_beam/runners/common.py", line 1562, in
>> apache_beam.runners.common.DoFnRunner._invoke_lifecycle_method
>>   File "apache_beam/runners/common.py", line 602, in
>> apache_beam.runners.common.DoFnInvoker.invoke_setup
>>   File
>> "/usr/local/lib/python3.12/dist-packages/apache_beam/ml/inference/base.py",
>> line 1882, in setup
>>     self._model = self._load_model()
>>                   ^^^^^^^^^^^^^^^^^^
>>   File
>> "/usr/local/lib/python3.12/dist-packages/apache_beam/ml/inference/base.py",
>> line 1848, in _load_model
>>     model = self._shared_model_handle.acquire(load, tag=self._cur_tag)
>>             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>>
>> I have seen this error where the tensorrt  version is older than 10.x .
>> Is there any undated tensorRT handler , or am I doing something wrong ?
>>
>> On Fri, 19 Sept 2025 at 09:15, XQ Hu <[email protected]> wrote:
>>
>>> GPU driver: DRIVER_VERSION=535.261.03
>>> From the log, the driver was installed correctly (make sure this can be
>>> used for your tensor RT.)
>>>
>>> "Error syncing pod, skipping" err="failed to \"StartContainer\" for
>>> \"sdk-0-0\" with ErrImagePull: \"failed to pull and unpack image \\\"
>>> us-east4-docker.pkg.dev/anbc-dev-suspecting/suspecting-docker/tensorrt_ss:latest\\\
>>> <http://us-east4-docker.pkg.dev/anbc-dev-suspecting/suspecting-docker/tensorrt_ss:latest%5C%5C%5C>":
>>> failed to extract layer
>>> sha256:a848022a4558c435b349317630b139960b44ae09f218ab7f93f764ba4661607d:
>>> write
>>> /var/lib/containerd/io.containerd.snapshotter.v1.gcfs/snapshotter/snapshots/55/fs/usr/local/cuda-13.0/targets/x86_64-linux/lib/libcusparseLt.so.0.8.0.4:
>>> no space left on device: unknown\""
>>> pod="default/df-inference-pipeline-4-09182012-1a3n-harness-rvxc"
>>> podUID="3a9b74645db23e77932d981439f1d3cc"
>>>
>>> The Dataflow worker cannot unpack the image:  no space left on device
>>>
>>> Try
>>> https://cloud.google.com/dataflow/docs/guides/configure-worker-vm#disk-size
>>>
>>>
>>>
>>>
>>> On Thu, Sep 18, 2025 at 11:27 PM Sai Shashank <[email protected]>
>>> wrote:
>>>
>>>> So, I was able the start, dataflow :  
>>>> 2025-09-18_20_12_41-10973298801093076892
>>>> , but not it having this error: 2025-09-18 23:21:25.401 EDT
>>>> SDK harnesses are not healthy after 5 minutes, status: Waiting for 4 of
>>>> 4 SDK Harnesses to register. I have noticed this error when there is a
>>>> mismatch of environments. As advice by you I try running Direct Runner in
>>>> the docker image and it was running perfectly. is there any tips which you
>>>> would give me to correct this error ?
>>>>
>>>>
>>>> On Wed, 17 Sept 2025 at 09:42, XQ Hu <[email protected]> wrote:
>>>>
>>>>> From the worker log,
>>>>>
>>>>> "Failed to read pods from URL" err="invalid pod:
>>>>> [spec.containers[3].image: Invalid value: \"
>>>>> us-east4-docker.pkg.dev/anbc-dev-suspecting/suspecting-docker/tensorrt_ss:latest\
>>>>> <http://us-east4-docker.pkg.dev/anbc-dev-suspecting/suspecting-docker/tensorrt_ss:latest%5C>":
>>>>> must not have leading or trailing whitespace]"
>>>>>
>>>>> 2025-09-16_17_52_06-10817935125972705087
>>>>>
>>>>> Looks like you specify the image URL with the leading whitespace.
>>>>> Remove it and give it a try.
>>>>>
>>>>> And if you have any further questions about GPUs, I highly recommend
>>>>> you start the VM with L4 GPU and pull your image and ssh into it and run
>>>>> your pipeline locally with DirectRunner. That can make sure all your code
>>>>> works.
>>>>>
>>>>>
>>>>> On Wed, Sep 17, 2025 at 9:25 AM XQ Hu <[email protected]> wrote:
>>>>>
>>>>>> I saw it. Let me follow up internally.
>>>>>>
>>>>>> On Tue, Sep 16, 2025 at 10:10 PM Sai Shashank <
>>>>>> [email protected]> wrote:
>>>>>>
>>>>>>> I already I have open one but opening one more : case number is
>>>>>>> 63121285
>>>>>>>
>>>>>>> On Tue, Sep 16, 2025 at 10:05 PM XQ Hu <[email protected]> wrote:
>>>>>>>
>>>>>>>> Can you open a cloud support ticket? That can give us the
>>>>>>>> permission to access your job.
>>>>>>>>
>>>>>>>> On Tue, Sep 16, 2025, 9:57 PM Sai Shashank <
>>>>>>>> [email protected]> wrote:
>>>>>>>>
>>>>>>>>> Hey , can we connect on my office mail? since I could share more
>>>>>>>>> stuff like pipeline options and other stuff there better and I work 
>>>>>>>>> at CVS
>>>>>>>>> so that way it would be under compliance too
>>>>>>>>>
>>>>>>>>> On Tue, 16 Sept 2025 at 21:54, Sai Shashank <
>>>>>>>>> [email protected]> wrote:
>>>>>>>>>
>>>>>>>>>> The code works without the custom image of TensorRT, I will only
>>>>>>>>>> get this:
>>>>>>>>>>
>>>>>>>>>> ject to benefit from faster worker startup and autoscaling. If you 
>>>>>>>>>> experience container-startup related issues, pass the 
>>>>>>>>>> "disable_image_streaming" experiment to disable image streaming for 
>>>>>>>>>> the job.
>>>>>>>>>> INFO:apache_beam.runners.dataflow.dataflow_runner:2025-09-17T00:52:10.327Z:
>>>>>>>>>>  JOB_MESSAGE_BASIC: Worker configuration: g2-standard-4 in 
>>>>>>>>>> us-east4-c.
>>>>>>>>>> INFO:apache_beam.runners.dataflow.dataflow_runner:2025-09-17T00:52:11.480Z:
>>>>>>>>>>  JOB_MESSAGE_BASIC: Executing operation [10]: Create 
>>>>>>>>>> URIs/Impulse+[10]: Create URIs/FlatMap(<lambda at 
>>>>>>>>>> core.py:3994>)+[10]: Create URIs/Map(decode)+[10]: Download 
>>>>>>>>>> PDFs+[10]: Load PDF Pages+[10]: Preprocess Images+[10]: Run 
>>>>>>>>>> Inference/BatchElements/ParDo(_GlobalWindowsBatchingDoFn)+[10]: Run 
>>>>>>>>>> Inference/BeamML_RunInference
>>>>>>>>>> INFO:apache_beam.runners.dataflow.dataflow_runner:2025-09-17T00:52:11.543Z:
>>>>>>>>>>  JOB_MESSAGE_BASIC: Starting 1 workers in us-east4...
>>>>>>>>>> INFO:apache_beam.runners.dataflow.dataflow_runner:Job 
>>>>>>>>>> 2025-09-16_17_52_06-10817935125972705087 is in state 
>>>>>>>>>> JOB_STATE_RUNNING
>>>>>>>>>> WARNING:google_auth_httplib2:httplib2 transport does not support 
>>>>>>>>>> per-request timeout. Set the timeout when constructing the 
>>>>>>>>>> httplib2.Http instance.
>>>>>>>>>> WARNING:google_auth_httplib2:httplib2 transport does not support 
>>>>>>>>>> per-request timeout. Set the timeout when constructing the 
>>>>>>>>>> httplib2.Http instance.
>>>>>>>>>> WARNING:google_auth_httplib2:httplib2 transport does not support 
>>>>>>>>>> per-request timeout. Set the timeout when constructing the 
>>>>>>>>>> httplib2.Http instance.
>>>>>>>>>> WARNING:google_auth_httplib2:httplib2 transport does not support 
>>>>>>>>>> per-request timeout. Set the timeout when constructing the 
>>>>>>>>>> httplib2.Http instance.
>>>>>>>>>>
>>>>>>>>>> WARNING:google_auth_httplib2:httplib2 transport does not support
>>>>>>>>>> per-request timeout. Set the timeout when constructing the 
>>>>>>>>>> httplib2.Http
>>>>>>>>>> instance.
>>>>>>>>>>
>>>>>>>>>> just recurring and after an hour this message pops up  Workflow
>>>>>>>>>> failed. Causes: The Dataflow job appears to be stuck because no 
>>>>>>>>>> worker
>>>>>>>>>> activity has been seen in the last 1h. For more information, see
>>>>>>>>>> https://cloud.google.com/dataflow/docs/guides/common-errors#error-syncing-pod.
>>>>>>>>>> You can also get help with Cloud Dataflow at
>>>>>>>>>> https://cloud.google.com/dataflow/support.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On Tue, 16 Sept 2025 at 21:35, XQ Hu <[email protected]> wrote:
>>>>>>>>>>
>>>>>>>>>>> Does the code work without using  TensorRT? Any logs?
>>>>>>>>>>>
>>>>>>>>>>> On Tue, Sep 16, 2025 at 9:28 PM Sai Shashank <
>>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> import apache_beam as beam
>>>>>>>>>>>> from apache_beam.ml.inference.tensorrt_inference import
>>>>>>>>>>>> TensorRTEngineHandlerNumPy
>>>>>>>>>>>> from apache_beam.ml.inference.base import RunInference
>>>>>>>>>>>>
>>>>>>>>>>>> #!/usr/bin/env python3
>>>>>>>>>>>> """
>>>>>>>>>>>> Apache Beam pipeline for processing PDFs with Triton server and
>>>>>>>>>>>> saving results to BigQuery.
>>>>>>>>>>>> This pipeline combines functionality from
>>>>>>>>>>>> test_triton_document.py, create_bigquery_tables.py,
>>>>>>>>>>>> and save_to_bigquery.py into a single workflow.
>>>>>>>>>>>> """
>>>>>>>>>>>>
>>>>>>>>>>>> import os
>>>>>>>>>>>> import sys
>>>>>>>>>>>> import json
>>>>>>>>>>>> import uuid
>>>>>>>>>>>> import argparse
>>>>>>>>>>>> import logging
>>>>>>>>>>>> import tempfile
>>>>>>>>>>>> import datetime
>>>>>>>>>>>> import requests
>>>>>>>>>>>> import numpy as np
>>>>>>>>>>>> import cv2
>>>>>>>>>>>> from PIL import Image
>>>>>>>>>>>> import fitz  # PyMuPDF
>>>>>>>>>>>> from pathlib import Path
>>>>>>>>>>>> from typing import Dict, List, Tuple, Any, Optional, Iterator
>>>>>>>>>>>>
>>>>>>>>>>>> # Apache Beam imports
>>>>>>>>>>>> import apache_beam as beam
>>>>>>>>>>>> from apache_beam.options.pipeline_options import
>>>>>>>>>>>> PipelineOptions, SetupOptions
>>>>>>>>>>>> from apache_beam.ml.inference.base import RemoteModelHandler,
>>>>>>>>>>>> PredictionResult
>>>>>>>>>>>> from apache_beam.ml.inference.utils import _convert_to_result
>>>>>>>>>>>> from apache_beam.ml.inference.base import RunInference
>>>>>>>>>>>> from apache_beam.io.gcp.bigquery import WriteToBigQuery
>>>>>>>>>>>> from apache_beam.io.filesystems import FileSystems
>>>>>>>>>>>> from apache_beam.io.gcp.gcsio import GcsIO
>>>>>>>>>>>>
>>>>>>>>>>>> # Google Cloud imports
>>>>>>>>>>>> from google.cloud import storage
>>>>>>>>>>>> from google.cloud import bigquery
>>>>>>>>>>>>
>>>>>>>>>>>> # Set up logging
>>>>>>>>>>>> logging.basicConfig(level=logging.INFO)
>>>>>>>>>>>> logger = logging.getLogger(__name__)
>>>>>>>>>>>>
>>>>>>>>>>>> # DocLayNet classes
>>>>>>>>>>>> CLASS_ID_TO_NAME = {
>>>>>>>>>>>>     0: 'Caption',
>>>>>>>>>>>>     1: 'Footnote',
>>>>>>>>>>>>     2: 'Formula',
>>>>>>>>>>>>     3: 'List-item',
>>>>>>>>>>>>     4: 'Page-footer',
>>>>>>>>>>>>     5: 'Page-header',
>>>>>>>>>>>>     6: 'Picture',
>>>>>>>>>>>>     7: 'Section-header',
>>>>>>>>>>>>     8: 'Table',
>>>>>>>>>>>>     9: 'Text',
>>>>>>>>>>>>     10: 'Title'
>>>>>>>>>>>> }
>>>>>>>>>>>> class DownloadPDFFromGCS(beam.DoFn):
>>>>>>>>>>>>     """Download a PDF from Google Cloud Storage."""
>>>>>>>>>>>>
>>>>>>>>>>>>     def __init__(self, temp_dir=None):
>>>>>>>>>>>>         self.temp_dir = temp_dir or tempfile.gettempdir()
>>>>>>>>>>>>
>>>>>>>>>>>>     def process(self, gcs_uri):
>>>>>>>>>>>>         try:
>>>>>>>>>>>>             # Parse GCS URI
>>>>>>>>>>>>             if not gcs_uri.startswith("gs://"):
>>>>>>>>>>>>                 raise ValueError(f"Invalid GCS URI: {gcs_uri}")
>>>>>>>>>>>>
>>>>>>>>>>>>             # Remove gs:// prefix and split into bucket and
>>>>>>>>>>>> blob path
>>>>>>>>>>>>             path_parts = gcs_uri[5:].split("/", 1)
>>>>>>>>>>>>             bucket_name = path_parts[0]
>>>>>>>>>>>>             blob_path = path_parts[1]
>>>>>>>>>>>>
>>>>>>>>>>>>             # Get filename from blob path
>>>>>>>>>>>>             filename = os.path.basename(blob_path)
>>>>>>>>>>>>             local_path = os.path.join(self.temp_dir, filename)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create temp directory if it doesn't exist
>>>>>>>>>>>>             os.makedirs(self.temp_dir, exist_ok=True)
>>>>>>>>>>>>
>>>>>>>>>>>>             try:
>>>>>>>>>>>>                 # Download using Beam's GcsIO
>>>>>>>>>>>>                 with FileSystems.open(gcs_uri, 'rb') as
>>>>>>>>>>>> gcs_file:
>>>>>>>>>>>>                     with open(local_path, 'wb') as local_file:
>>>>>>>>>>>>                         local_file.write(gcs_file.read())
>>>>>>>>>>>>
>>>>>>>>>>>>                 logger.info(f"Downloaded {gcs_uri} to
>>>>>>>>>>>> {local_path}")
>>>>>>>>>>>>
>>>>>>>>>>>>                 # Return a dictionary with the local path and
>>>>>>>>>>>> original URI
>>>>>>>>>>>>                 yield {
>>>>>>>>>>>>                     'local_path': local_path,
>>>>>>>>>>>>                     'gcs_uri': gcs_uri,
>>>>>>>>>>>>                     'filename': filename
>>>>>>>>>>>>                 }
>>>>>>>>>>>>             except Exception as e:
>>>>>>>>>>>>                 logger.error(f"Error reading from GCS:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>>                 # Try alternative download method
>>>>>>>>>>>>                 logger.info(f"Trying alternative download
>>>>>>>>>>>> method for {gcs_uri}")
>>>>>>>>>>>>
>>>>>>>>>>>>                 # For testing with local files
>>>>>>>>>>>>                 if os.path.exists(gcs_uri.replace("gs://", "")):
>>>>>>>>>>>>                     local_path = gcs_uri.replace("gs://", "")
>>>>>>>>>>>>                     logger.info(f"Using local file:
>>>>>>>>>>>> {local_path}")
>>>>>>>>>>>>                     yield {
>>>>>>>>>>>>                         'local_path': local_path,
>>>>>>>>>>>>                         'gcs_uri': gcs_uri,
>>>>>>>>>>>>                         'filename': os.path.basename(local_path)
>>>>>>>>>>>>                     }
>>>>>>>>>>>>                 else:
>>>>>>>>>>>>                     # Try using gsutil command
>>>>>>>>>>>>                     import subprocess
>>>>>>>>>>>>                     try:
>>>>>>>>>>>>                         subprocess.run(["gsutil", "cp",
>>>>>>>>>>>> gcs_uri, local_path], check=True)
>>>>>>>>>>>>                         logger.info(f"Downloaded {gcs_uri} to
>>>>>>>>>>>> {local_path} using gsutil")
>>>>>>>>>>>>                         yield {
>>>>>>>>>>>>                             'local_path': local_path,
>>>>>>>>>>>>                             'gcs_uri': gcs_uri,
>>>>>>>>>>>>                             'filename': filename
>>>>>>>>>>>>                         }
>>>>>>>>>>>>                     except Exception as e2:
>>>>>>>>>>>>                         logger.error(f"Failed to download using
>>>>>>>>>>>> gsutil: {str(e2)}")
>>>>>>>>>>>>
>>>>>>>>>>>>         except Exception as e:
>>>>>>>>>>>>             logger.error(f"Error downloading {gcs_uri}:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>> class LoadPDFPages(beam.DoFn):
>>>>>>>>>>>>     """Load PDF pages as images."""
>>>>>>>>>>>>
>>>>>>>>>>>>     def __init__(self, dpi=200):
>>>>>>>>>>>>         self.dpi = dpi
>>>>>>>>>>>>
>>>>>>>>>>>>     def process(self, element):
>>>>>>>>>>>>         doc = None
>>>>>>>>>>>>         try:
>>>>>>>>>>>>             # Make sure we have all required fields
>>>>>>>>>>>>             if not isinstance(element, dict):
>>>>>>>>>>>>                 logger.error(f"Expected dictionary, got
>>>>>>>>>>>> {type(element)}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             if 'local_path' not in element:
>>>>>>>>>>>>                 logger.error("Missing 'local_path' in element")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             local_path = element['local_path']
>>>>>>>>>>>>             gcs_uri = element.get('gcs_uri', '')
>>>>>>>>>>>>
>>>>>>>>>>>>             # Extract filename from local_path if not provided
>>>>>>>>>>>>             filename = element.get('filename',
>>>>>>>>>>>> os.path.basename(local_path))
>>>>>>>>>>>>
>>>>>>>>>>>>             logger.info(f"Loading PDF: {local_path}, filename:
>>>>>>>>>>>> {filename}")
>>>>>>>>>>>>
>>>>>>>>>>>>             # Check if file exists and is accessible
>>>>>>>>>>>>             if not os.path.exists(local_path):
>>>>>>>>>>>>                 logger.error(f"File not found: {local_path}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             if not os.access(local_path, os.R_OK):
>>>>>>>>>>>>                 logger.error(f"File not readable: {local_path}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             # Open the PDF
>>>>>>>>>>>>             try:
>>>>>>>>>>>>                 doc = fitz.open(local_path)
>>>>>>>>>>>>                 if doc.is_closed:
>>>>>>>>>>>>                     logger.error(f"Failed to open PDF:
>>>>>>>>>>>> {local_path}")
>>>>>>>>>>>>                     return
>>>>>>>>>>>>             except Exception as e:
>>>>>>>>>>>>                 logger.error(f"Error opening PDF {local_path}:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             # Process each page
>>>>>>>>>>>>             page_count = len(doc)
>>>>>>>>>>>>             logger.info(f"Processing {page_count} pages from
>>>>>>>>>>>> {local_path}")
>>>>>>>>>>>>
>>>>>>>>>>>>             for i in range(page_count):
>>>>>>>>>>>>                 try:
>>>>>>>>>>>>                     if doc.is_closed:
>>>>>>>>>>>>                         logger.error(f"Document was closed
>>>>>>>>>>>> unexpectedly while processing page {i}")
>>>>>>>>>>>>                         break
>>>>>>>>>>>>
>>>>>>>>>>>>                     page = doc[i]
>>>>>>>>>>>>                     if page is None:
>>>>>>>>>>>>                         logger.error(f"Failed to get page {i}
>>>>>>>>>>>> from document")
>>>>>>>>>>>>                         continue
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Use a higher resolution for better quality
>>>>>>>>>>>>                     scale = self.dpi / 72.0
>>>>>>>>>>>>                     mat = fitz.Matrix(scale, scale)
>>>>>>>>>>>>
>>>>>>>>>>>>                     try:
>>>>>>>>>>>>                         pix = page.get_pixmap(matrix=mat,
>>>>>>>>>>>> alpha=False)
>>>>>>>>>>>>                     except Exception as e:
>>>>>>>>>>>>                         logger.error(f"Error getting pixmap for
>>>>>>>>>>>> page {i}: {str(e)}")
>>>>>>>>>>>>                         continue
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Check pixmap dimensions
>>>>>>>>>>>>                     if pix.height <= 0 or pix.width <= 0 or
>>>>>>>>>>>> pix.n <= 0:
>>>>>>>>>>>>                         logger.error(f"Invalid pixmap
>>>>>>>>>>>> dimensions: {pix.width}x{pix.height}x{pix.n}")
>>>>>>>>>>>>                         continue
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Convert to numpy array
>>>>>>>>>>>>                     try:
>>>>>>>>>>>>                         arr = np.frombuffer(pix.samples,
>>>>>>>>>>>> dtype=np.uint8).reshape(pix.height, pix.width, pix.n)
>>>>>>>>>>>>                     except Exception as e:
>>>>>>>>>>>>                         logger.error(f"Error converting pixmap
>>>>>>>>>>>> to numpy array: {str(e)}")
>>>>>>>>>>>>                         continue
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Convert BGR to RGB if needed
>>>>>>>>>>>>                     if pix.n == 3:  # RGB
>>>>>>>>>>>>                         try:
>>>>>>>>>>>>                             arr = cv2.cvtColor(arr,
>>>>>>>>>>>> cv2.COLOR_BGR2RGB)
>>>>>>>>>>>>                         except Exception as e:
>>>>>>>>>>>>                             logger.error(f"Error converting BGR
>>>>>>>>>>>> to RGB: {str(e)}")
>>>>>>>>>>>>                             continue
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Store original size for later use
>>>>>>>>>>>>                     original_size = (arr.shape[0], arr.shape[1])
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Create page info
>>>>>>>>>>>>                     page_info = {
>>>>>>>>>>>>                         'page_num': i,
>>>>>>>>>>>>                         'image': arr,
>>>>>>>>>>>>                         'original_size': original_size,
>>>>>>>>>>>>                         'local_path': local_path,
>>>>>>>>>>>>                         'gcs_uri': gcs_uri,
>>>>>>>>>>>>                         'filename': filename
>>>>>>>>>>>>                     }
>>>>>>>>>>>>
>>>>>>>>>>>>                     # Use document ID and page number as key
>>>>>>>>>>>>                     doc_id = os.path.splitext(filename)[0]
>>>>>>>>>>>>                     key = f"{doc_id}_{i}"
>>>>>>>>>>>>
>>>>>>>>>>>>                     yield (key, page_info)
>>>>>>>>>>>>                 except Exception as e:
>>>>>>>>>>>>                     import traceback
>>>>>>>>>>>>                     logger.error(f"Error processing page {i}:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>>                     logger.error(traceback.format_exc())
>>>>>>>>>>>>
>>>>>>>>>>>>             logger.info(f"Loaded {len(doc)} pages from
>>>>>>>>>>>> {local_path}")
>>>>>>>>>>>>
>>>>>>>>>>>>         except Exception as e:
>>>>>>>>>>>>             import traceback
>>>>>>>>>>>>             logger.error(f"Error loading PDF: {str(e)}")
>>>>>>>>>>>>             logger.error(traceback.format_exc())
>>>>>>>>>>>>         finally:
>>>>>>>>>>>>             # Make sure to close the document only if it was
>>>>>>>>>>>> successfully opened
>>>>>>>>>>>>             if doc is not None:
>>>>>>>>>>>>                 try:
>>>>>>>>>>>>                     if not doc.is_closed:
>>>>>>>>>>>>                         doc.close()
>>>>>>>>>>>>                 except Exception as e:
>>>>>>>>>>>>                     logger.debug(f"Error closing document:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>>
>>>>>>>>>>>> class PreprocessImage(beam.DoFn):
>>>>>>>>>>>>     """Preprocess image for Triton server."""
>>>>>>>>>>>>
>>>>>>>>>>>>     def __init__(self, size=1024):
>>>>>>>>>>>>         self.size = size
>>>>>>>>>>>>
>>>>>>>>>>>>     def letterbox(self, img, new_shape=1024,
>>>>>>>>>>>> color=(114,114,114)):
>>>>>>>>>>>>         """Resize and pad image to target size."""
>>>>>>>>>>>>         h, w = img.shape[:2]
>>>>>>>>>>>>         r = min(new_shape / h, new_shape / w)
>>>>>>>>>>>>         nh, nw = int(round(h * r)), int(round(w * r))
>>>>>>>>>>>>         pad_h, pad_w = new_shape - nh, new_shape - nw
>>>>>>>>>>>>         top = pad_h // 2
>>>>>>>>>>>>         bottom = pad_h - top
>>>>>>>>>>>>         left = pad_w // 2
>>>>>>>>>>>>         right = pad_w - left
>>>>>>>>>>>>         img = cv2.resize(img, (nw, nh),
>>>>>>>>>>>> interpolation=cv2.INTER_LINEAR)
>>>>>>>>>>>>         img = cv2.copyMakeBorder(img, top, bottom, left, right,
>>>>>>>>>>>> cv2.BORDER_CONSTANT, value=color)
>>>>>>>>>>>>         return img, r, left, top
>>>>>>>>>>>>
>>>>>>>>>>>>     def process(self, element):
>>>>>>>>>>>>         try:
>>>>>>>>>>>>             if not isinstance(element, tuple) or len(element)
>>>>>>>>>>>> != 2:
>>>>>>>>>>>>                 logger.error(f"Expected (key, value) tuple, got
>>>>>>>>>>>> {type(element)}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             key, page_info = element
>>>>>>>>>>>>
>>>>>>>>>>>>             if not isinstance(page_info, dict):
>>>>>>>>>>>>                 logger.error(f"Expected dictionary for
>>>>>>>>>>>> page_info, got {type(page_info)}")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             if 'image' not in page_info:
>>>>>>>>>>>>                 logger.error("Missing 'image' in page_info")
>>>>>>>>>>>>                 return
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create a new dictionary to avoid modifying the
>>>>>>>>>>>> input
>>>>>>>>>>>>             new_page_info = dict(page_info)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Apply letterbox resize
>>>>>>>>>>>>             img = new_page_info['image']
>>>>>>>>>>>>             lb, r, left, top = self.letterbox(img,
>>>>>>>>>>>> new_shape=self.size)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Convert to float32 and normalize to [0,1]
>>>>>>>>>>>>             x = lb.astype(np.float32) / 255.0
>>>>>>>>>>>>
>>>>>>>>>>>>             # Convert to CHW format
>>>>>>>>>>>>             x = np.transpose(x, (2, 0, 1))
>>>>>>>>>>>>
>>>>>>>>>>>>             # Add batch dimension
>>>>>>>>>>>>             batched_img = np.expand_dims(x, axis=0)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Update page info
>>>>>>>>>>>>             new_page_info['preprocessed_image'] = batched_img
>>>>>>>>>>>>             new_page_info['letterbox_info'] = (r, left, top)
>>>>>>>>>>>>
>>>>>>>>>>>>             yield (key, new_page_info)
>>>>>>>>>>>>
>>>>>>>>>>>>         except Exception as e:
>>>>>>>>>>>>             import traceback
>>>>>>>>>>>>             logger.error(f"Error preprocessing image: {str(e)}")
>>>>>>>>>>>>             logger.error(traceback.format_exc())
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> class ExtractBoxes(beam.DoFn):
>>>>>>>>>>>>     """Extract bounding boxes from Triton response."""
>>>>>>>>>>>>
>>>>>>>>>>>>     def __init__(self, conf_th=0.25, iou_th=0.7,
>>>>>>>>>>>> model_size=1024):
>>>>>>>>>>>>         self.conf_th = conf_th
>>>>>>>>>>>>         self.iou_th = iou_th
>>>>>>>>>>>>         self.model_size = model_size
>>>>>>>>>>>>
>>>>>>>>>>>>     def _nms(self, boxes, scores, iou_th=0.7):
>>>>>>>>>>>>         """Non-Maximum Suppression"""
>>>>>>>>>>>>         if len(boxes) == 0:
>>>>>>>>>>>>             return []
>>>>>>>>>>>>
>>>>>>>>>>>>         boxes = boxes.astype(np.float32)
>>>>>>>>>>>>         x1, y1, x2, y2 = boxes.T
>>>>>>>>>>>>         areas = (x2 - x1) * (y2 - y1)
>>>>>>>>>>>>         order = scores.argsort()[::-1]
>>>>>>>>>>>>
>>>>>>>>>>>>         keep = []
>>>>>>>>>>>>         while order.size > 0:
>>>>>>>>>>>>             i = order[0]
>>>>>>>>>>>>             keep.append(i)
>>>>>>>>>>>>
>>>>>>>>>>>>             xx1 = np.maximum(x1[i], x1[order[1:]])
>>>>>>>>>>>>             yy1 = np.maximum(y1[i], y1[order[1:]])
>>>>>>>>>>>>             xx2 = np.minimum(x2[i], x2[order[1:]])
>>>>>>>>>>>>             yy2 = np.minimum(y2[i], y2[order[1:]])
>>>>>>>>>>>>
>>>>>>>>>>>>             w = np.maximum(0.0, xx2 - xx1)
>>>>>>>>>>>>             h = np.maximum(0.0, yy2 - yy1)
>>>>>>>>>>>>             inter = w * h
>>>>>>>>>>>>
>>>>>>>>>>>>             iou = inter / (areas[i] + areas[order[1:]] - inter
>>>>>>>>>>>> + 1e-9)
>>>>>>>>>>>>             inds = np.where(iou <= iou_th)[0]
>>>>>>>>>>>>             order = order[inds + 1]
>>>>>>>>>>>>
>>>>>>>>>>>>         return keep
>>>>>>>>>>>>
>>>>>>>>>>>>     def process(self, page_info):
>>>>>>>>>>>>         try:
>>>>>>>>>>>>             triton_response = page_info['triton_response']
>>>>>>>>>>>>             original_size = page_info['original_size']
>>>>>>>>>>>>             r, left, top = page_info['letterbox_info']
>>>>>>>>>>>>
>>>>>>>>>>>>             if "outputs" not in triton_response or not
>>>>>>>>>>>> triton_response["outputs"]:
>>>>>>>>>>>>                 logger.error("Invalid response from Triton
>>>>>>>>>>>> server")
>>>>>>>>>>>>                 return []
>>>>>>>>>>>>
>>>>>>>>>>>>             out_meta = triton_response["outputs"][0]
>>>>>>>>>>>>             shape = out_meta["shape"]
>>>>>>>>>>>>             data = np.array(out_meta["data"],
>>>>>>>>>>>> dtype=np.float32).reshape(shape)
>>>>>>>>>>>>
>>>>>>>>>>>>             logger.info(f"Output shape: {shape}")
>>>>>>>>>>>>
>>>>>>>>>>>>             # For YOLO output [B, C, P] where C is channels
>>>>>>>>>>>> (box coords + objectness + classes)
>>>>>>>>>>>>             B, C, P = shape
>>>>>>>>>>>>
>>>>>>>>>>>>             # Assuming 4 box coordinates + class probabilities
>>>>>>>>>>>> (no objectness)
>>>>>>>>>>>>             has_objectness = False
>>>>>>>>>>>>             num_classes = C - 5 if has_objectness else C - 4
>>>>>>>>>>>>
>>>>>>>>>>>>             # Extract data
>>>>>>>>>>>>             xywh = data[:, 0:4, :]
>>>>>>>>>>>>             if has_objectness:
>>>>>>>>>>>>                 obj = data[:, 4:5, :]
>>>>>>>>>>>>                 cls = data[:, 5:5 + num_classes, :]
>>>>>>>>>>>>             else:
>>>>>>>>>>>>                 obj = None
>>>>>>>>>>>>                 cls = data[:, 4:4 + num_classes, :]
>>>>>>>>>>>>
>>>>>>>>>>>>             # Process batch item (we only have one)
>>>>>>>>>>>>             b = 0
>>>>>>>>>>>>             h, w = original_size
>>>>>>>>>>>>
>>>>>>>>>>>>             xywh_b = xywh[b].T  # (P,4)
>>>>>>>>>>>>             if obj is not None:
>>>>>>>>>>>>                 obj_b = obj[b].T.squeeze(1)  # (P,)
>>>>>>>>>>>>             else:
>>>>>>>>>>>>                 obj_b = np.ones((P,), dtype=np.float32)
>>>>>>>>>>>>             cls_b = cls[b].T  # (P,nc)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Get scores and labels
>>>>>>>>>>>>             scores_all = (obj_b[:, None] * cls_b) if obj is not
>>>>>>>>>>>> None else cls_b
>>>>>>>>>>>>             labels = scores_all.argmax(axis=1)
>>>>>>>>>>>>             scores = scores_all.max(axis=1)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Filter by confidence threshold
>>>>>>>>>>>>             keep = scores >= self.conf_th
>>>>>>>>>>>>             if not np.any(keep):
>>>>>>>>>>>>                 logger.info(f"No detections above threshold
>>>>>>>>>>>> {self.conf_th}")
>>>>>>>>>>>>                 return []
>>>>>>>>>>>>
>>>>>>>>>>>>             xywh_k = xywh_b[keep]
>>>>>>>>>>>>             scores_k = scores[keep]
>>>>>>>>>>>>             labels_k = labels[keep]
>>>>>>>>>>>>
>>>>>>>>>>>>             # xywh -> xyxy in model space
>>>>>>>>>>>>             cx, cy, ww, hh = xywh_k.T
>>>>>>>>>>>>             xyxy_model = np.stack([cx - ww / 2, cy - hh / 2, cx
>>>>>>>>>>>> + ww / 2, cy + hh / 2], axis=1)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Apply NMS per class
>>>>>>>>>>>>             final_boxes = []
>>>>>>>>>>>>             final_scores = []
>>>>>>>>>>>>             final_labels = []
>>>>>>>>>>>>
>>>>>>>>>>>>             for c in np.unique(labels_k):
>>>>>>>>>>>>                 idxs = np.where(labels_k == c)[0]
>>>>>>>>>>>>                 if idxs.size == 0:
>>>>>>>>>>>>                     continue
>>>>>>>>>>>>                 keep_idx = self._nms(xyxy_model[idxs],
>>>>>>>>>>>> scores_k[idxs], iou_th=self.iou_th)
>>>>>>>>>>>>                 final_boxes.append(xyxy_model[idxs][keep_idx])
>>>>>>>>>>>>                 final_scores.append(scores_k[idxs][keep_idx])
>>>>>>>>>>>>                 final_labels.append(np.full(len(keep_idx), c,
>>>>>>>>>>>> dtype=int))
>>>>>>>>>>>>
>>>>>>>>>>>>             if not final_boxes:
>>>>>>>>>>>>                 logger.info("No detections after NMS")
>>>>>>>>>>>>                 return []
>>>>>>>>>>>>
>>>>>>>>>>>>             xyxy_model = np.vstack(final_boxes)
>>>>>>>>>>>>             scores_k = np.concatenate(final_scores)
>>>>>>>>>>>>             labels_k = np.concatenate(final_labels)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Map boxes from model space to original image space
>>>>>>>>>>>>             xyxy_orig = xyxy_model.copy()
>>>>>>>>>>>>
>>>>>>>>>>>>             # Remove padding
>>>>>>>>>>>>             xyxy_orig[:, [0, 2]] -= left
>>>>>>>>>>>>             xyxy_orig[:, [1, 3]] -= top
>>>>>>>>>>>>
>>>>>>>>>>>>             # Scale back to original size
>>>>>>>>>>>>             xyxy_orig /= r
>>>>>>>>>>>>
>>>>>>>>>>>>             # Clip to image boundaries
>>>>>>>>>>>>             xyxy_orig[:, 0::2] = np.clip(xyxy_orig[:, 0::2], 0,
>>>>>>>>>>>> w - 1)
>>>>>>>>>>>>             xyxy_orig[:, 1::2] = np.clip(xyxy_orig[:, 1::2], 0,
>>>>>>>>>>>> h - 1)
>>>>>>>>>>>>
>>>>>>>>>>>>             # Format as requested: x_min, y_min, x_max, y_max,
>>>>>>>>>>>> class, probability
>>>>>>>>>>>>             boxes = []
>>>>>>>>>>>>             for (x1, y1, x2, y2), label, score in
>>>>>>>>>>>> zip(xyxy_orig, labels_k, scores_k):
>>>>>>>>>>>>                 class_name = CLASS_ID_TO_NAME.get(int(label))
>>>>>>>>>>>>                 box_info = {
>>>>>>>>>>>>                     "page": page_info['page_num'],
>>>>>>>>>>>>                     "x_min": float(x1),
>>>>>>>>>>>>                     "y_min": float(y1),
>>>>>>>>>>>>                     "x_max": float(x2),
>>>>>>>>>>>>                     "y_max": float(y2),
>>>>>>>>>>>>                     "class": int(label),
>>>>>>>>>>>>                     "class_name": class_name,
>>>>>>>>>>>>                     "probability": float(score),
>>>>>>>>>>>>                     "filename": page_info['filename'],
>>>>>>>>>>>>                     "local_path": page_info['local_path'],
>>>>>>>>>>>>                     "gcs_uri": page_info['gcs_uri']
>>>>>>>>>>>>                 }
>>>>>>>>>>>>                 boxes.append(box_info)
>>>>>>>>>>>>
>>>>>>>>>>>>             logger.info(f"Extracted {len(boxes)} boxes from
>>>>>>>>>>>> page {page_info['page_num']}")
>>>>>>>>>>>>
>>>>>>>>>>>>             return boxes
>>>>>>>>>>>>
>>>>>>>>>>>>         except Exception as e:
>>>>>>>>>>>>             logger.error(f"Error extracting boxes: {str(e)}")
>>>>>>>>>>>>             return []
>>>>>>>>>>>>
>>>>>>>>>>>> class PrepareForBigQuery(beam.DoFn):
>>>>>>>>>>>>     """Prepare data for BigQuery insertion."""
>>>>>>>>>>>>
>>>>>>>>>>>>     def process(self, box_info):
>>>>>>>>>>>>         try:
>>>>>>>>>>>>             # Generate UUIDs for primary keys
>>>>>>>>>>>>             v_note_id = str(uuid.uuid4())
>>>>>>>>>>>>             page_ocr_id = str(uuid.uuid4())
>>>>>>>>>>>>             class_prediction_id = str(uuid.uuid4())
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create timestamp
>>>>>>>>>>>>             processing_time =
>>>>>>>>>>>> datetime.datetime.now().strftime("%Y-%m-%d %H:%M:%S")
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create ocr_results row
>>>>>>>>>>>>             ocr_results_row = {
>>>>>>>>>>>>                 "v_note_id": v_note_id,
>>>>>>>>>>>>                 "filename": box_info['filename'],
>>>>>>>>>>>>                 "file_path": box_info['gcs_uri'],
>>>>>>>>>>>>                 "processing_time": processing_time,
>>>>>>>>>>>>                 "file_type": "pdf"
>>>>>>>>>>>>             }
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create page_ocr row
>>>>>>>>>>>>             page_ocr_row = {
>>>>>>>>>>>>                 "page_ocr_id": page_ocr_id,
>>>>>>>>>>>>                 "v_note_id": v_note_id,
>>>>>>>>>>>>                 "page_number": box_info['page']
>>>>>>>>>>>>             }
>>>>>>>>>>>>
>>>>>>>>>>>>             # Create class_prediction row
>>>>>>>>>>>>             class_prediction_row = {
>>>>>>>>>>>>                 "class_prediction_id": class_prediction_id,
>>>>>>>>>>>>                 "page_ocr_id": page_ocr_id,
>>>>>>>>>>>>                 "xmin": box_info['x_min'],
>>>>>>>>>>>>                 "ymin": box_info['y_min'],
>>>>>>>>>>>>                 "xmax": box_info['x_max'],
>>>>>>>>>>>>                 "ymax": box_info['y_max'],
>>>>>>>>>>>>                 "class": box_info['class_name'] if
>>>>>>>>>>>> box_info['class_name'] else str(box_info['class']),
>>>>>>>>>>>>                 "confidence": box_info['probability']
>>>>>>>>>>>>             }
>>>>>>>>>>>>
>>>>>>>>>>>>             # Return all three rows with table names
>>>>>>>>>>>>             return [
>>>>>>>>>>>>                 ('ocr_results', ocr_results_row),
>>>>>>>>>>>>                 ('page_ocr', page_ocr_row),
>>>>>>>>>>>>                 ('class_prediction', class_prediction_row)
>>>>>>>>>>>>             ]
>>>>>>>>>>>>
>>>>>>>>>>>>         except Exception as e:
>>>>>>>>>>>>             logger.error(f"Error preparing for BigQuery:
>>>>>>>>>>>> {str(e)}")
>>>>>>>>>>>>             return []
>>>>>>>>>>>>
>>>>>>>>>>>> model_handler = TensorRTEngineHandlerNumPy(
>>>>>>>>>>>>   min_batch_size=1,
>>>>>>>>>>>>   max_batch_size=1,
>>>>>>>>>>>>   engine_path="gs://temp/yolov11l-doclaynet.engine",
>>>>>>>>>>>> )
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> with beam.Pipeline(options=options) as pipeline:
>>>>>>>>>>>>
>>>>>>>>>>>>         # Create PCollection from input URIs
>>>>>>>>>>>>         pdf_uris = (
>>>>>>>>>>>>             pipeline
>>>>>>>>>>>>             | "Create URIs" >> beam.Create(["tmp.pdf"])
>>>>>>>>>>>>         )
>>>>>>>>>>>>
>>>>>>>>>>>>         # Download PDFs
>>>>>>>>>>>>         local_pdfs = (
>>>>>>>>>>>>             pdf_uris
>>>>>>>>>>>>             | "Download PDFs" >>
>>>>>>>>>>>> beam.ParDo(DownloadPDFFromGCS())
>>>>>>>>>>>>         )
>>>>>>>>>>>>
>>>>>>>>>>>>          # Load PDF pages
>>>>>>>>>>>>         pdf_pages = (
>>>>>>>>>>>>             local_pdfs
>>>>>>>>>>>>             | "Load PDF Pages" >> beam.ParDo(LoadPDFPages())
>>>>>>>>>>>>             #| "Flatten Pages" >> beam.FlatMap(lambda x: x)
>>>>>>>>>>>>         )
>>>>>>>>>>>>
>>>>>>>>>>>>         # Preprocess images
>>>>>>>>>>>>         preprocessed_pages = (
>>>>>>>>>>>>             pdf_pages
>>>>>>>>>>>>             | "Preprocess Images" >>
>>>>>>>>>>>> beam.ParDo(PreprocessImage())
>>>>>>>>>>>>         )
>>>>>>>>>>>>         inference_results = (
>>>>>>>>>>>>             preprocessed_pages
>>>>>>>>>>>>             | "Run Inference" >>
>>>>>>>>>>>> RunInference(model_handler=model_handler)
>>>>>>>>>>>>         )
>>>>>>>>>>>>
>>>>>>>>>>>> On Tue, 16 Sept 2025 at 21:23, XQ Hu <[email protected]> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>>> Can you share your commands and outputs?
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Tue, Sep 16, 2025 at 9:02 PM Sai Shashank <
>>>>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>>>>
>>>>>>>>>>>>>> Okay I have changed the docker image but  to now to RUN the
>>>>>>>>>>>>>> python command but it is still halting without are error or 
>>>>>>>>>>>>>> warnings or
>>>>>>>>>>>>>> errors
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Tue, 16 Sept 2025 at 17:38, XQ Hu via dev <
>>>>>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> The CMD is not necessary as it will be overridden by the
>>>>>>>>>>>>>>> ENTRYPOINT just like your comment.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> If you ssh to your Docker container like `docker run --rm
>>>>>>>>>>>>>>> -it --entrypoint=/bin/bash $CUSTOM_CONTAINER_IMAGE`, can you 
>>>>>>>>>>>>>>> run python and
>>>>>>>>>>>>>>> some Beam pipelines with a direct runner in the container? This 
>>>>>>>>>>>>>>> can help
>>>>>>>>>>>>>>> test the environment works fine.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I have one old Dockerfile that used to work with the old
>>>>>>>>>>>>>>> Beam:
>>>>>>>>>>>>>>> https://github.com/google/dataflow-ml-starter/blob/main/tensor_rt.Dockerfile
>>>>>>>>>>>>>>> .
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Tue, Sep 16, 2025 at 4:56 PM Sai Shashank <
>>>>>>>>>>>>>>> [email protected]> wrote:
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> ---------- Forwarded message ---------
>>>>>>>>>>>>>>>> From: Sai Shashank <[email protected]>
>>>>>>>>>>>>>>>> Date: Tue, Sep 16, 2025 at 4:27 PM
>>>>>>>>>>>>>>>> Subject: TensorRT inference not starting
>>>>>>>>>>>>>>>> To: <[email protected]>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Hey Everyone,
>>>>>>>>>>>>>>>>                          I was trying to use tensorRT
>>>>>>>>>>>>>>>> within the apache beam on dataflow but somehow , dataflow 
>>>>>>>>>>>>>>>> didn't start like
>>>>>>>>>>>>>>>> it did not even give me Worker logs. Below is the docker file 
>>>>>>>>>>>>>>>> that , use to
>>>>>>>>>>>>>>>> create a custom  image, at first I thought it is the version 
>>>>>>>>>>>>>>>> mismatched but
>>>>>>>>>>>>>>>> usually it gives me a harness error .
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> ARG BUILD_IMAGE=nvcr.io/nvidia/tensorrt:25.08-py3
>>>>>>>>>>>>>>>> FROM ${BUILD_IMAGE}
>>>>>>>>>>>>>>>> ENV PATH="/usr/src/tensorrt/bin:${PATH}"
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> WORKDIR /workspace
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> RUN apt-get update -y && apt-get install -y python3-venv
>>>>>>>>>>>>>>>> RUN pip install --no-cache-dir apache-beam[gcp]==2.67.0
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> COPY --from=apache/beam_python3.10_sdk:2.67.0
>>>>>>>>>>>>>>>> /opt/apache/beam /opt/apache/beam
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> # Install additional dependencies
>>>>>>>>>>>>>>>> RUN pip install --upgrade pip \
>>>>>>>>>>>>>>>>     && pip install torch \
>>>>>>>>>>>>>>>>     && pip install torchvision \
>>>>>>>>>>>>>>>>     && pip install pillow>=8.0.0 \
>>>>>>>>>>>>>>>>     && pip install transformers>=4.18.0 \
>>>>>>>>>>>>>>>>     && pip install cuda-python \
>>>>>>>>>>>>>>>>     && pip install opencv-python==4.7.0.72 \
>>>>>>>>>>>>>>>>     && pip install PyMuPDF==1.22.5 \
>>>>>>>>>>>>>>>>     && pip install requests==2.31.0
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> # Set the default command to run the inference script
>>>>>>>>>>>>>>>> # This will be overridden by the Apache Beam boot script
>>>>>>>>>>>>>>>> CMD ["python", "/workspace/inference.py"]
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> # Use the Apache Beam boot script as the entrypoint
>>>>>>>>>>>>>>>> ENTRYPOINT ["/opt/apache/beam/boot"]
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>

Re: TensorRT inference not starting

Reply via email to