Hi Tao,

The `show()` API works with any IPython notebook runtimes, including Colab,
Jupyter Lab and pre-lab Jupyter Notebooks, as long as you have `%pip
install apache-beam[interactive]`.

Additionally, the `show_graph()` API needs GraphViz binary installed,
details can be found in the README
<https://github.com/apache/beam/tree/master/sdks/python/apache_beam/runners/interactive#pre-requisites>
.

If you've created an Apache Beam notebook instance on Google Cloud, there
is an example notebook "Examples/Visualize_Data.ipynb" demonstrating how to
visualize data of PCollections with different libraries:

   - Native Interactive Beam Visualization
   - Pandas DataFrame
   - Matplotlib
   - Seaborn
   - Bokeh
   - D3.js

Hope this helps!

Ning

On Fri, Apr 30, 2021 at 9:24 AM Brian Hulette <bhule...@google.com> wrote:

> +Ning Kang <ni...@google.com> +Sam Rohde <sro...@google.com>
>
> On Thu, Apr 29, 2021 at 6:13 PM Tao Li <t...@zillow.com> wrote:
>
>> Hi Beam community,
>>
>>
>>
>> The notebook console from Google Cloud defines a show() API to display a
>> PCollection which is very neat:
>> https://cloud.google.com/dataflow/docs/guides/interactive-pipeline-development
>>
>>
>>
>> If we are using a regular jupyter notebook to run beam app, how can we
>> print out a PCollection easily? What’s the best practice? Thanks!
>>
>>
>>
>

Reply via email to