Thanks Anand! I left a few comments, but overall I think this is a great/well constructed proposal - this is a good way to quickly bring a suite of preprocessing operations to Beam.
Thanks, Danny On Tue, May 9, 2023 at 12:52 PM Anand Inguva via dev <dev@beam.apache.org> wrote: > Hi all, > > In Apache Beam, we plan to introduce a *beam.MLTransform* for carrying > out common ML centric processing tasks. > > Using the tensorflow_transform as the backend, we will introduce several > data processing transforms in Beam. These can be easily utilized by simply > wrapping them with the beam.MLTransform. This approach not only simplifies > the process but also creates a pathway for implementing a comprehensive set > of ML-centric data processing transforms in Apache Beam. > > Please go through the design doc > <https://docs.google.com/document/d/1rQkSm_8tseLqDQaLohtlCGqt5pvMaP0XIpPi5UD0LCQ/edit#>[1] > for details and share your feedback. > > Thanks, > Anand > > [1] > https://docs.google.com/document/d/1rQkSm_8tseLqDQaLohtlCGqt5pvMaP0XIpPi5UD0LCQ/edit# >