An observation Once compiled with MVN the job submit works as follows:
+ /usr/lib/spark-1.5.2-bin-hadoop2.6/bin/spark-submit --packages com.databricks:spark-csv_2.11:1.3.0 --class ImportCSV --master spark:// 50.140.197.217:7077 --executor-memory=12G --executor-cores=12 --num-executors=2 *target/scala-1.0.jar* With sbt it takes this form + /usr/lib/spark-1.5.2-bin-hadoop2.6/bin/spark-submit --packages com.databricks:spark-csv_2.11:1.3.0 --class ImportCSV --master spark:// 50.140.197.217:7077 --executor-memory=12G --executor-cores=12 --num-executors=2 *target/scala-2.10/importcsv_2.10-1.0.jar* They both return the same results. However, why mvnjar file name is different (may be a naive question!)? thanks Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* http://talebzadehmich.wordpress.com On 15 March 2016 at 22:43, Mich Talebzadeh <mich.talebza...@gmail.com> wrote: > Many thanks Ted and thanks for heads up Jakob > > Just these two changes to dependencies > > <dependency> > <groupId>org.apache.spark</groupId> > <artifactId>spark-core*_2.10*</artifactId> > <version>1.5.1</version> > </dependency> > <dependency> > <groupId>org.apache.spark</groupId> > <artifactId>spark-sql*_2.10*</artifactId> > <version>1.5.1</version> > </dependency> > > > [DEBUG] endProcessChildren: artifact=spark:scala:jar:1.0 > [INFO] > ------------------------------------------------------------------------ > [INFO] BUILD SUCCESS > [INFO] > ------------------------------------------------------------------------ > [INFO] Total time: 01:04 min > [INFO] Finished at: 2016-03-15T22:55:08+00:00 > [INFO] Final Memory: 32M/1089M > [INFO] > ------------------------------------------------------------------------ > > Dr Mich Talebzadeh > > > > LinkedIn * > https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw > <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* > > > > http://talebzadehmich.wordpress.com > > > > On 15 March 2016 at 22:18, Jakob Odersky <ja...@odersky.com> wrote: > >> Hi Mich, >> probably unrelated to the current error you're seeing, however the >> following dependencies will bite you later: >> <artifactId>spark-hive_2.10</artifactId> >> <artifactId>spark-csv_2.11</artifactId> >> the problem here is that you're using libraries built for different >> Scala binary versions (the numbers after the underscore). The simple >> fix here is to specify the Scala binary version you're project builds >> for (2.10 in your case, however note that version is EOL, you should >> upgrade to scala 2.11.8 if possible). >> >> On a side note, sbt takes care of handling correct scala versions for >> you (the double %% actually is a shorthand for appending >> "_scalaBinaryVersion" to your dependency). It also enables you to >> build and publish your project seamlessly against multiple versions. I >> would strongly recommend to use it in Scala projects. >> >> cheers, >> --Jakob >> >> >> >> On Tue, Mar 15, 2016 at 3:08 PM, Mich Talebzadeh >> <mich.talebza...@gmail.com> wrote: >> > Hi, >> > >> > I normally use sbt and using this sbt file works fine for me >> > >> > cat ImportCSV.sbt >> > name := "ImportCSV" >> > version := "1.0" >> > scalaVersion := "2.10.4" >> > libraryDependencies += "org.apache.spark" %% "spark-core" % "1.5.1" >> > libraryDependencies += "org.apache.spark" %% "spark-sql" % "1.5.1" >> > libraryDependencies += "org.apache.spark" %% "spark-hive" % "1.5.1" >> > libraryDependencies += "com.databricks" % "spark-csv_2.11" % "1.3.0" >> > >> > This is my first trial using Mavan and pom >> > >> > >> > my pom.xml file looks like this but throws error at build >> > >> > >> > [DEBUG] com.univocity:univocity-parsers:jar:1.5.1:compile >> > [INFO] >> > ------------------------------------------------------------------------ >> > [INFO] BUILD FAILURE >> > [INFO] >> > ------------------------------------------------------------------------ >> > [INFO] Total time: 1.326 s >> > [INFO] Finished at: 2016-03-15T22:17:29+00:00 >> > [INFO] Final Memory: 14M/455M >> > [INFO] >> > ------------------------------------------------------------------------ >> > [ERROR] Failed to execute goal on project scala: Could not resolve >> > dependencies for project spark:scala:jar:1.0: The following artifacts >> could >> > not be resolved: org.apache.spark:spark-core:jar:1.5.1, >> > org.apache.spark:spark-sql:jar:1.5.1: Failure to find >> > org.apache.spark:spark-core:jar:1.5.1 in >> > https://repo.maven.apache.org/maven2 was cached in the local >> repository, >> > resolution will not be reattempted until the update interval of central >> has >> > elapsed or updates are forced -> [Help 1] >> > >> > >> > My pom file is >> > >> > >> > cat pom.xml >> > <project xmlns="http://maven.apache.org/POM/4.0.0" >> > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" >> > xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 >> > http://maven.apache.org/maven-v4_0_0.xsd"> >> > <modelVersion>4.0.0</modelVersion> >> > <groupId>spark</groupId> >> > <version>1.0</version> >> > <name>${project.artifactId}</name> >> > >> > <properties> >> > <maven.compiler.source>1.7</maven.compiler.source> >> > <maven.compiler.target>1.7</maven.compiler.target> >> > <encoding>UTF-8</encoding> >> > <scala.version>2.10.4</scala.version> >> > <maven-scala-plugin.version>2.15.2</maven-scala-plugin.version> >> > </properties> >> > >> > <dependencies> >> > <dependency> >> > <groupId>org.scala-lang</groupId> >> > <artifactId>scala-library</artifactId> >> > <version>2.10.2</version> >> > </dependency> >> > <dependency> >> > <groupId>org.apache.spark</groupId> >> > <artifactId>spark-core</artifactId> >> > <version>1.5.1</version> >> > </dependency> >> > <dependency> >> > <groupId>org.apache.spark</groupId> >> > <artifactId>spark-sql</artifactId> >> > <version>1.5.1</version> >> > </dependency> >> > <dependency> >> > <groupId>org.apache.spark</groupId> >> > <artifactId>spark-hive_2.10</artifactId> >> > <version>1.5.0</version> >> > </dependency> >> > <dependency> >> > <groupId>com.databricks</groupId> >> > <artifactId>spark-csv_2.11</artifactId> >> > <version>1.3.0</version> >> > </dependency> >> > </dependencies> >> > >> > <build> >> > <sourceDirectory>src/main/scala</sourceDirectory> >> > <plugins> >> > <plugin> >> > <groupId>org.scala-tools</groupId> >> > <artifactId>maven-scala-plugin</artifactId> >> > <version>${maven-scala-plugin.version}</version> >> > <executions> >> > <execution> >> > <goals> >> > <goal>compile</goal> >> > </goals> >> > </execution> >> > </executions> >> > <configuration> >> > <jvmArgs> >> > <jvmArg>-Xms64m</jvmArg> >> > <jvmArg>-Xmx1024m</jvmArg> >> > </jvmArgs> >> > </configuration> >> > </plugin> >> > <plugin> >> > <groupId>org.apache.maven.plugins</groupId> >> > <artifactId>maven-shade-plugin</artifactId> >> > <version>1.6</version> >> > <executions> >> > <execution> >> > <phase>package</phase> >> > <goals> >> > <goal>shade</goal> >> > </goals> >> > <configuration> >> > <filters> >> > <filter> >> > <artifact>*:*</artifact> >> > <excludes> >> > <exclude>META-INF/*.SF</exclude> >> > <exclude>META-INF/*.DSA</exclude> >> > <exclude>META-INF/*.RSA</exclude> >> > </excludes> >> > </filter> >> > </filters> >> > <transformers> >> > <transformer >> > >> implementation="org.apache.maven.plugins.shade.resource.ManifestResourceTransformer"> >> > <mainClass>com.group.id.Launcher1</mainClass> >> > </transformer> >> > </transformers> >> > </configuration> >> > </execution> >> > </executions> >> > </plugin> >> > </plugins> >> > </build> >> > >> > <artifactId>scala</artifactId> >> > </project> >> > >> > >> > I am sure I have omitted something? >> > >> > >> > Thanks >> > >> > >> > Dr Mich Talebzadeh >> > >> > >> > >> > LinkedIn >> > >> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw >> > >> > >> > >> > http://talebzadehmich.wordpress.com >> > >> > >> > >