[build system] restarting jenkins now

2020-08-14 Thread shane knapp ☠
there isn't much activity right now, and i'd like to restart jenkins quickly as it's consuming a lot of memory on the head node. shouldn't be more than a couple of minutes downtime... if something goes awry i'll send an email here. if you don't hear from me again, please carry on. :) -- Shane

Re: Out of scope RDDs not getting cleaned up

2020-08-14 Thread jainbhavya53
Log file mentioned in the post --> FuturesTImeoutException.txt -- Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ ---

Out of scope RDDs not getting cleaned up

2020-08-14 Thread jainbhavya53
Hi, I am using spark 2.1 and I am leveraging spark streaming for my data pipeline. Now, in my case the batch size is 3 minutes and we persist couple of RDDs while processing a batch and after processing we rely on Spark's ContextCleaner to clean out RDDs which are no longer in scope. So we have s

Re: SPIP: Catalog API for view metadata

2020-08-14 Thread Walaa Eldin Moustafa
Wenchen, agreed with what you said. I was referring to situations where the underlying table schema evolves (say by introducing a nested field in a Struct), and also what you mentioned in cases of SELECT *. The Hive metastore handling of those does not automatically update view schema (even though

Re: SPIP: Catalog API for view metadata

2020-08-14 Thread Wenchen Fan
View should have a fixed schema like a table. It should either be inferred from the query when creating the view, or be specified by the user manually like CREATE VIEW v(a, b) AS SELECT Users can still alter view schema manually. Basically a view is just a named SQL query, which mostly has fix