[ https://issues.apache.org/jira/browse/ARROW-11555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17281076#comment-17281076 ]
Uwe Korn commented on ARROW-11555: ---------------------------------- This has {{arrow-cpp}} from defaults, this is missing HDFS support. You have a lot packages from defaults, I suggest to recreate the environment with solely using {{conda-forge}} as the package source. The {{arrow-cpp}} / {{pyarrow}} packages are currently in a broken state. > The pyarrow installation is not built with support for 'HadoopFileSystem' > ------------------------------------------------------------------------- > > Key: ARROW-11555 > URL: https://issues.apache.org/jira/browse/ARROW-11555 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 3.0.0 > Environment: Ubuntu 20.04 LTS > conda 4.8.3 > Python 3.7.7 > pyarrow=3.0.0 > Reporter: Nikolai Janakiev > Assignee: Uwe Korn > Priority: Minor > > When running: > {code:java} > import pyarrow as pa > pa.fs.HadoopFileSystem("node-master", port=54310) > {code} > I get the following error: > {code:java} > ImportError: The pyarrow installation is not built with support for > 'HadoopFileSystem'{code} > Installed on Ubuntu 20.04 LTS and via: > {code:java} > conda install -c conda-forge "pyarrow=3.0.0"{code} > But when I run the following command, I am able to connect to my HDFS cluster: > {code:java} > hdfs = pa.hdfs.connect('node-master', port=54310) > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)