[ https://issues.apache.org/jira/browse/ARROW-4858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-4858: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21369 > [Flight][Python] Enable custom FlightDataStream in Python > --------------------------------------------------------- > > Key: ARROW-4858 > URL: https://issues.apache.org/jira/browse/ARROW-4858 > Project: Apache Arrow > Issue Type: Improvement > Components: FlightRPC, Python > Reporter: David Li > Assignee: David Li > Priority: Minor > Labels: flight, pull-request-available > Fix For: 0.13.0 > > Time Spent: 2h > Remaining Estimate: 0h > > We should be able to provide a custom data stream as the result of Flight > do_get in Python. In particular, when returning data produced on the fly, or > when returning a large Pandas DataFrame, it'd be nice to provide data in > chunks as it becomes available, rather than having to copy everything into a > Table first. > On the Python side, a FlightDataStream wrapper that accepts RecordBatches > from a Python generator should suffice. -- This message was sent by Atlassian Jira (v8.20.10#820010)