Hi,
I was exploring the possibility of CTAS with spark-sql (SPARK-1.3.1) for saving
the big results into CSV formatted files for offline viewing. These are the two
things that I did
1. CREATE TABLE IF NOT EXISTS csv_dump27 ROW FORMAT DELIMITED FIELDS
TERMINATED BY ',' LINES TERMINATED BY '\
Hi Guys,
I built a ML pipeline that includes multilayer perceptron
classifier, I got the following error message when I tried to save the
pipeline model. It seems like MLPC model can not be saved which means I have
no ways to save the trained model. Is there any way to save the mode
Dan,
You could probably just register a JVM shutdown hook yourself:
https://docs.oracle.com/javase/7/docs/api/java/lang/Runtime.html#addShutdownHook(java.lang.Thread
)
This at least would let you close the connections when the application as a
whole has completed (in standalone) or when your exec
I'm seeing the same. :(
On Fri, Mar 18, 2016 at 10:57 AM Ted Yu wrote:
> I tried again this morning :
>
> $ wget
> https://s3.amazonaws.com/spark-related-packages/spark-1.6.1-bin-hadoop2.6.tgz
> --2016-03-18 07:55:30--
> https://s3.amazonaws.com/spark-related-packages/spark-1.6.1-bin-hadoop2.6.t
Name:
Zaloni's Bedrock & Mica
URL: http://www.zaloni.com/products/
Description:
Zaloni's data
lake
management platform (Bedrock) and self-service data preparation solution
(Mica) leverage Spark for
fast execution of transformations and data exploration.
It looks like the Scala 2.10 Jenkins build is working:
https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Compile/job/spark-master-compile-sbt-scala-2.10/
Can you share more details about how you're compiling with 2.10 (e.g. which
commands you ran, git SHA, etc)?
On Wed, Mar 16, 2016 at 11:
Hi Reynold, thanks for the info.
On Thu, Mar 17, 2016 at 2:18 PM, Reynold Xin wrote:
> If one really feels strongly that we should go through all the overhead to
> setup an ASF subproject for these modules that won't work with the new
> structured streaming, and want to spearhead to setup separat
Any objections? Please articulate your use case. SparkEnv is a weird one
because it was documented as "private" but not marked as so in class
visibility.
* NOTE: This is not intended for external use. This is exposed for Shark
and may be made private
* in a future release.
I do see Hive
After further thought, I think following both of your suggestions- adding a
shutdown hook and making the threads non-daemon- may have the result I'm
looking for. I'll check and see if there are other reasons not to use
daemon threads in our networking internals. More generally though, what do
y'a
On Wed, Mar 16, 2016 at 3:29 PM, Mridul Muralidharan
wrote:
> b) Shuffle manager (to get shuffle reader)
>
What's the use case for shuffle manager/reader? This seems like using super
internal APIs in applications.
10 matches
Mail list logo