George Sakkis created ARROW-5825:
------------------------------------

             Summary: [Python] Exceptions swallowed in 
ParquetManifest._visit_directories
                 Key: ARROW-5825
                 URL: https://issues.apache.org/jira/browse/ARROW-5825
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
            Reporter: George Sakkis


{{ParquetManifest._visit_directories}} uses a {{ThreadPoolExecutor}} to visit 
partitioned parquet datasets concurrently, it waits for them to finish but 
doesn't check if the respective futures have failed or not. This is quite 
tricky to detect and debug as an exception is either raised later as a a 
side-effect or (perhaps worse) it passes silently.

Observed on 0.12.1 but appears to be on latest master too.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to