Maybe a possible fix:
https://stackoverflow.com/questions/31495657/development-build-of-pandas-giving-importerror-c-extension-hashtable-not-bui


Hyukjin Kwon wrote
> Hi all,
> 
> I am seeing flaky Python tests time to time and if I am not mistaken
> mostly
> in amp-jenkins-worker-05:
> 
> 
> ======================================================================
> ERROR: test_filtered_frame (pyspark.sql.tests.ArrowTests)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File
> "/home/anaconda/envs/py3k/lib/python3.4/site-packages/pandas/__init__.py",
> line 25, in 
> <module>
>     from pandas import hashtable, tslib, lib
> ImportError: cannot import name 'hashtable'
> 
> During handling of the above exception, another exception occurred:
> 
> Traceback (most recent call last):
>   File
> "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/sql/tests.py",
> line 3057, in test_filtered_frame
>     pdf = df.filter("i < 0").toPandas()
>   File
> "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/sql/dataframe.py",
> line 1727, in toPandas
>     import pandas as pd
>   File
> "/home/anaconda/envs/py3k/lib/python3.4/site-packages/pandas/__init__.py",
> line 31, in 
> <module>
>     "the C extensions first.".format(module))
> ImportError: C extension: 'hashtable' not built. If you want to import
> pandas from the source directory, you may need to run 'python setup.py
> build_ext --inplace --force' to build the C extensions first.
> 
> ======================================================================
> ERROR: test_null_conversion (pyspark.sql.tests.ArrowTests)
> ----------------------------------------------------------------------
> ...
> 
> ======================================================================
> ERROR: test_pandas_round_trip (pyspark.sql.tests.ArrowTests)
> ----------------------------------------------------------------------
> ...
> 
> ======================================================================
> ERROR: test_toPandas_arrow_toggle (pyspark.sql.tests.ArrowTests)
> ----------------------------------------------------------------------
> ...
> 
> 
> I sounds environment problem apparently due to missing hashtable (which I
> believe should have been compiled and importable properly).
> 
> I suspect few possibilities such as a bug somewhere or unsuccessful manual
> build from Pandas source but I am unable to reproduce this and check this.
> So, yes. This is rather my guess.
> 
> 
> Does anyone know if this is an environment problem and how to fix this?





-----
Liang-Chi Hsieh | @viirya 
Spark Technology Center 
http://www.spark.tc/ 
--
View this message in context: 
http://apache-spark-developers-list.1001551.n3.nabble.com/Question-Flaky-tests-pyspark-sql-tests-ArrowTests-tests-in-Jenkins-worker-5-tp22085p22086.html
Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to