[jira] [Commented] (BEAM-12803) SqlTransform doesn't work on python 3.9

Brian Hulette (Jira) Mon, 20 Sep 2021 09:11:06 -0700


    [ 
https://issues.apache.org/jira/browse/BEAM-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17417711#comment-17417711
 ]


Brian Hulette commented on BEAM-12803:
--------------------------------------

Is it possible there's a typo somewhere? It looks like your code correctly 
references {{test_tbl}} everywhere, but the Calcite error says 'test_tabl'.

We do have some continuous tests that verify the tagged PCollection input case: 
https://github.com/apache/beam/blob/8072cc0bcfd4eee08a95902e13b9bf1dc2338693/sdks/python/apache_beam/transforms/sql_test.py#L131
 so I'd be surprised if there's an issue there.

> SqlTransform doesn't work on python 3.9
> ---------------------------------------
>
>                 Key: BEAM-12803
>                 URL: https://issues.apache.org/jira/browse/BEAM-12803
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>            Reporter: sean teeling
>            Assignee: Brian Hulette
>            Priority: P2
>
> Working example below -(Is there no way to paste pre-formatted code into 
> jira?!)- (EDIT: I added the appropriate "code" block)
> {code:python}
> import itertools
> import csv
> import io
> import apache_beam as beam
> from apache_beam.dataframe.io import read_csv
> from apache_beam.transforms.sql import SqlTransform
> def parse_csv(val):
> deflower_headers(iterator):
> return itertools.chain([next(iterator).lower()], iterator)
> return csv.DictReader(lower_headers(io.TextIOWrapper(val.open())))
> class BeamTransformBuilder():
>   def build(self, pipeline):
>     practices = (
>         pipeline
>           | beam.io.fileio.MatchFiles("data.csv")
>           | beam.io.fileio.ReadMatches()
>           | beam.Reshuffle()
>           | beam.FlatMap(parse_csv)
>           | beam.Map(lambda x: beam.Row(id="test-id"))
>           | SqlTransform("""
>                 SELECT
>                 id
>                 FROM PCOLLECTION""")
>         )
>     practices | beam.Map(print)
> def main():
>   builder = BeamTransformBuilder()
>   with beam.Pipeline('DirectRunner') as p:
>   builder.build(p)
> if __name__ == '__main__':
>   main()
> {code}
>  
>  Results in the error:
>  
> {code:java}
>   File 
> "/usr/local/lib/python3.9/site-packages/apache_beam/typehints/schemas.py", 
> line 185, in typing_to_runner_api
>     element_type = typing_to_runner_api(_get_args(type_)[0])
> IndexError: tuple index out of range
> {code}
>  
>  
> Tested on Python 3.9.6. 
>  
> Annoyingly, it is difficult to test this out on other python versions. 
> There's no documentation for how to setup a docker container using 
> DirectRunner and running it locally. There's barely any documentation on what 
> python versions are supported. And using pyenv, and pip install apache-beam 
> requires a lot of other downloads that have conflicts when other versions are 
> already installed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (BEAM-12803) SqlTransform doesn't work on python 3.9

Reply via email to