Gregory Hayes created ARROW-5131: ------------------------------------ Summary: Add Azure Datalake Filesystem Gen1 Wrapper for pyarrow Key: ARROW-5131 URL: https://issues.apache.org/jira/browse/ARROW-5131 Project: Apache Arrow Issue Type: Wish Components: Python Affects Versions: 0.12.1 Reporter: Gregory Hayes
The current pyarrow package can only read parquet files that have been written to Gen1 Azure Datalake using the fastparquet engine. This only works if the dask-adlfs package is explicitly installed and imported. I've added a method to the dask-adlfs package, found [here|https://github.com/dask/dask-adlfs], and issued a PR for that change. To support this capability, added an ADLFSWrapper to filesystem.py file. -- This message was sent by Atlassian JIRA (v7.6.3#76005)