Re: beam rebuilds numpy on pipeline run

2020-10-15 Thread Ross Vandegrift
On Fri, 2020-10-09 at 19:10 +, Ross Vandegrift wrote: > Starting today, running a beam pipeline triggers a large reinstallation of > python modules. For some reason, it forces full rebuilds from source - > since > beam depends on numpy, this takes a long time. I opened a support ticket with G

Re: beam rebuilds numpy on pipeline run

2020-10-09 Thread Brian Hulette
+Valentyn Tymofieiev This sounds like it's related to ARROW-8983 (pyarrow takes a long time to download after 0.16.0), discussed on the arrow dev list [2]. I'm not sure what would've triggered this to start happening for you today though. [1] https://issues.apache.org/jira/browse/ARROW-8983 [2]