[ https://issues.apache.org/jira/browse/ARROW-4823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-4823: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21340 > [Python] read_csv shouldn't close file handles it doesn't own > ------------------------------------------------------------- > > Key: ARROW-4823 > URL: https://issues.apache.org/jira/browse/ARROW-4823 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.12.1 > Reporter: Dave Hirschfeld > Assignee: Wes McKinney > Priority: Minor > Labels: csv, pull-request-available > Fix For: 0.14.0 > > Time Spent: 20m > Remaining Estimate: 0h > > If a file-handle is passed into `read_csv` it is automatically closed: > > {{In [47]: csv = > io.BytesIO(b'''issue_date_utc,variable_name,station_name,station_id,value_date_utc,value}} > {{ ...: 2019-02-26 22:00:00,TEMPERATURE,ARCHERFIELD,040211,2019-02-27 > 03:00,29.1}} > {{ ...: ''')}} > {{In [48]: pa.csv.read_csv(csv, convert_options=opts)}} > {{Out[48]: }} > {{pyarrow.Table}} > {{issue_date_utc: timestamp[ns]}} > {{variable_name: string}} > {{station_name: string}} > {{station_id: int64}} > {{value_date_utc: string}} > {{value: double}} > {{In [49]: csv.seek(0)}} > {{Traceback (most recent call last):}} > {{ File "<ipython-input-50-0644e6e50712>", line 1, in <module>}} > {{ csv.seek(0)}} > {{ValueError: I/O operation on closed file.}} > > This behaviour is in contrast to pandas which leaves the file handle open. > Since the function didn't create the file handle I don't think it should > close it. -- This message was sent by Atlassian Jira (v8.20.10#820010)