Re: Question on printing out a PCollection

2021-04-30 Thread Robert Bradshaw
Sorry, no Java versions of this stuff (though it may be possible to use cross-language to invoke your Java pipeline from Python and get the benefits that way). On Fri, Apr 30, 2021 at 11:30 AM Tao Li wrote: > > Thanks @Ning Kang. > > @Robert Bradshaw I assume you are referring to > https://beam.

Re: Question on printing out a PCollection

2021-04-30 Thread Tao Li
Thanks @Ning Kang. @Robert Bradshaw I assume you are referring to https://beam.apache.org/releases/pydoc/2.22.0/apache_beam.runners.interactive.interactive_beam.html. Is there a java version for it? On 4/30/21, 11:00 AM, "Robert Bradshaw" wrote: You can also use interactive Beam's coll

Re: Question on printing out a PCollection

2021-04-30 Thread Robert Bradshaw
You can also use interactive Beam's collect, to get the PCollection as a Dataframe, and then print it or do whatever else with it as you like. On Fri, Apr 30, 2021 at 10:24 AM Ning Kang wrote: > > Hi Tao, > > The `show()` API works with any IPython notebook runtimes, including Colab, > Jupyter L

Re: Question on printing out a PCollection

2021-04-30 Thread Ning Kang
Hi Tao, The `show()` API works with any IPython notebook runtimes, including Colab, Jupyter Lab and pre-lab Jupyter Notebooks, as long as you have `%pip install apache-beam[interactive]`. Additionally, the `show_graph()` API needs GraphViz binary installed, details can be found in the README

Re: Question on printing out a PCollection

2021-04-30 Thread Brian Hulette
+Ning Kang +Sam Rohde On Thu, Apr 29, 2021 at 6:13 PM Tao Li wrote: > Hi Beam community, > > > > The notebook console from Google Cloud defines a show() API to display a > PCollection which is very neat: > https://cloud.google.com/dataflow/docs/guides/interactive-pipeline-development > > > > I

Question on printing out a PCollection

2021-04-29 Thread Tao Li
Hi Beam community, The notebook console from Google Cloud defines a show() API to display a PCollection which is very neat: https://cloud.google.com/dataflow/docs/guides/interactive-pipeline-development If we are using a regular jupyter notebook to run beam app, how can we print out a PCollect