Hi,

I have made a number of code changes to make the SimpleApp work.
Your error messages are quite good.

package co.uk.backbutton.sparksimpleapp;
/**
 *
 * @author zahid
 */
/* SimpleApp.java */

import org.apache.spark.SparkConf;
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext;

public class SimpleApp {
  public static void main(String[] args) {
    String filename =
"/home/zahid/spark/spark-3.0.0-preview2-bin-hadoop2.7/README.md"; // Should
be some file on your system
//     SparkSession spark = SparkSession.builder().appName("Simple
Application").getOrCreate();


     SparkConf sparkConf = new SparkConf()
             .setAppName("Simple Application")
             .setMaster("local")
             .set("spark.executor.memory","2g");

        // start a spark context
        JavaSparkContext sc = new JavaSparkContext(sparkConf);

        JavaRDD<String> inputFile = sc.textFile(filename);

   /* Dataset <String> logData = sparkConf.read()
            .textFile(logFile)
            .cache();
    */

       long numAs =  inputFile.filter(s -> s.contains("a")).count();
       long numBs = inputFile.filter(s -> s.contains("b")).count();

//     long numAs = logData.filter(s -> s.contains("a")).count();
//     long numBs = logData.filter(s -> s.contains("b")).count();

    System.out.println("Lines with a: " + numAs + ", lines with b: " +
numBs);

//     spark.stop();
  }
}


cd /home/zahid/NetBeansProjects/SparkSimpleApp; JAVA_HOME=/home/zahid/jdk13
/home/zahid/netbeans/java/maven/bin/mvn "-Dexec.args=-classpath %classpath
co.uk.backbutton.sparksimpleapp.SimpleApp"
-Dexec.executable=/home/zahid/jdk13/bin/java process-classes
org.codehaus.mojo:exec-maven-plugin:1.5.0:exec
Scanning for projects...

------------------------------------------------------------------------
Building Simple Project 1.0-SNAPSHOT
------------------------------------------------------------------------

--- maven-resources-plugin:2.6:resources (default-resources) @
SparkSimpleApp ---
Using 'UTF-8' encoding to copy filtered resources.
skip non existing resourceDirectory
/home/zahid/NetBeansProjects/SparkSimpleApp/src/main/resources

--- maven-compiler-plugin:3.1:compile (default-compile) @ SparkSimpleApp ---
Changes detected - recompiling the module!
Compiling 1 source file to
/home/zahid/NetBeansProjects/SparkSimpleApp/target/classes

--- exec-maven-plugin:1.5.0:exec (default-cli) @ SparkSimpleApp ---
WARNING: An illegal reflective access operation has occurred
WARNING: Illegal reflective access by org.apache.spark.unsafe.Platform
(file:/home/zahid/.m2/repository/org/apache/spark/spark-unsafe_2.11/2.4.4/spark-unsafe_2.11-2.4.4.jar)
to method java.nio.Bits.unaligned()
WARNING: Please consider reporting this to the maintainers of
org.apache.spark.unsafe.Platform
WARNING: Use --illegal-access=warn to enable warnings of further illegal
reflective access operations
WARNING: All illegal access operations will be denied in a future release
Using Spark's default log4j profile:
org/apache/spark/log4j-defaults.properties
20/03/01 00:15:52 INFO SparkContext: Running Spark version 2.4.4
20/03/01 00:15:52 WARN NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
20/03/01 00:15:52 INFO SparkContext: Submitted application: Simple
Application
20/03/01 00:15:52 INFO SecurityManager: Changing view acls to: zahid
20/03/01 00:15:52 INFO SecurityManager: Changing modify acls to: zahid
20/03/01 00:15:52 INFO SecurityManager: Changing view acls groups to:
20/03/01 00:15:52 INFO SecurityManager: Changing modify acls groups to:
20/03/01 00:15:52 INFO SecurityManager: SecurityManager: authentication
disabled; ui acls disabled; users  with view permissions: Set(zahid);
groups with view permissions: Set(); users  with modify permissions:
Set(zahid); groups with modify permissions: Set()
20/03/01 00:15:52 INFO Utils: Successfully started service 'sparkDriver' on
port 34347.
20/03/01 00:15:52 INFO SparkEnv: Registering MapOutputTracker
20/03/01 00:15:52 INFO SparkEnv: Registering BlockManagerMaster
20/03/01 00:15:52 INFO BlockManagerMasterEndpoint: Using
org.apache.spark.storage.DefaultTopologyMapper for getting topology
information
20/03/01 00:15:52 INFO BlockManagerMasterEndpoint:
BlockManagerMasterEndpoint up
20/03/01 00:15:52 INFO DiskBlockManager: Created local directory at
/tmp/blockmgr-4e60ffbd-a384-434f-9033-63506d90acf7
20/03/01 00:15:52 INFO MemoryStore: MemoryStore started with capacity 987.6
MB
20/03/01 00:15:52 INFO SparkEnv: Registering OutputCommitCoordinator
20/03/01 00:15:53 INFO Utils: Successfully started service 'SparkUI' on
port 4040.
20/03/01 00:15:53 INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at
http://192.168.0.42:4040
20/03/01 00:15:53 INFO Executor: Starting executor ID driver on host
localhost
20/03/01 00:15:53 INFO Utils: Successfully started service
'org.apache.spark.network.netty.NettyBlockTransferService' on port 40923.
20/03/01 00:15:53 INFO NettyBlockTransferService: Server created on
192.168.0.42:40923
20/03/01 00:15:53 INFO BlockManager: Using
org.apache.spark.storage.RandomBlockReplicationPolicy for block replication
policy
20/03/01 00:15:53 INFO BlockManagerMaster: Registering BlockManager
BlockManagerId(driver, 192.168.0.42, 40923, None)
20/03/01 00:15:53 INFO BlockManagerMasterEndpoint: Registering block
manager 192.168.0.42:40923 with 987.6 MB RAM, BlockManagerId(driver,
192.168.0.42, 40923, None)
20/03/01 00:15:53 INFO BlockManagerMaster: Registered BlockManager
BlockManagerId(driver, 192.168.0.42, 40923, None)
20/03/01 00:15:53 INFO BlockManager: Initialized BlockManager:
BlockManagerId(driver, 192.168.0.42, 40923, None)
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_0 stored as values in
memory (estimated size 107.2 KB, free 987.5 MB)
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_0_piece0 stored as
bytes in memory (estimated size 20.4 KB, free 987.5 MB)
20/03/01 00:15:53 INFO BlockManagerInfo: Added broadcast_0_piece0 in memory
on 192.168.0.42:40923 (size: 20.4 KB, free: 987.6 MB)
20/03/01 00:15:53 INFO SparkContext: Created broadcast 0 from textFile at
SimpleApp.java:31
20/03/01 00:15:53 INFO FileInputFormat: Total input paths to process : 1
20/03/01 00:15:53 INFO SparkContext: Starting job: count at
SimpleApp.java:38
20/03/01 00:15:53 INFO DAGScheduler: Got job 0 (count at SimpleApp.java:38)
with 1 output partitions
20/03/01 00:15:53 INFO DAGScheduler: Final stage: ResultStage 0 (count at
SimpleApp.java:38)
20/03/01 00:15:53 INFO DAGScheduler: Parents of final stage: List()
20/03/01 00:15:53 INFO DAGScheduler: Missing parents: List()
20/03/01 00:15:53 INFO DAGScheduler: Submitting ResultStage 0
(MapPartitionsRDD[2] at filter at SimpleApp.java:38), which has no missing
parents
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_1 stored as values in
memory (estimated size 4.2 KB, free 987.5 MB)
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_1_piece0 stored as
bytes in memory (estimated size 2.5 KB, free 987.5 MB)
20/03/01 00:15:53 INFO BlockManagerInfo: Added broadcast_1_piece0 in memory
on 192.168.0.42:40923 (size: 2.5 KB, free: 987.6 MB)
20/03/01 00:15:53 INFO SparkContext: Created broadcast 1 from broadcast at
DAGScheduler.scala:1161
20/03/01 00:15:53 INFO DAGScheduler: Submitting 1 missing tasks from
ResultStage 0 (MapPartitionsRDD[2] at filter at SimpleApp.java:38) (first
15 tasks are for partitions Vector(0))
20/03/01 00:15:53 INFO TaskSchedulerImpl: Adding task set 0.0 with 1 tasks
20/03/01 00:15:53 INFO TaskSetManager: Starting task 0.0 in stage 0.0 (TID
0, localhost, executor driver, partition 0, PROCESS_LOCAL, 7927 bytes)
20/03/01 00:15:53 INFO Executor: Running task 0.0 in stage 0.0 (TID 0)
20/03/01 00:15:53 INFO HadoopRDD: Input split:
file:/home/zahid/spark/spark-3.0.0-preview2-bin-hadoop2.7/README.md:0+4666
20/03/01 00:15:53 INFO Executor: Finished task 0.0 in stage 0.0 (TID 0).
832 bytes result sent to driver
20/03/01 00:15:53 INFO TaskSetManager: Finished task 0.0 in stage 0.0 (TID
0) in 86 ms on localhost (executor driver) (1/1)
20/03/01 00:15:53 INFO TaskSchedulerImpl: Removed TaskSet 0.0, whose tasks
have all completed, from pool
20/03/01 00:15:53 INFO DAGScheduler: ResultStage 0 (count at
SimpleApp.java:38) finished in 0.162 s
20/03/01 00:15:53 INFO DAGScheduler: Job 0 finished: count at
SimpleApp.java:38, took 0.194265 s
20/03/01 00:15:53 INFO SparkContext: Starting job: count at
SimpleApp.java:39
20/03/01 00:15:53 INFO DAGScheduler: Got job 1 (count at SimpleApp.java:39)
with 1 output partitions
20/03/01 00:15:53 INFO DAGScheduler: Final stage: ResultStage 1 (count at
SimpleApp.java:39)
20/03/01 00:15:53 INFO DAGScheduler: Parents of final stage: List()
20/03/01 00:15:53 INFO DAGScheduler: Missing parents: List()
20/03/01 00:15:53 INFO DAGScheduler: Submitting ResultStage 1
(MapPartitionsRDD[3] at filter at SimpleApp.java:39), which has no missing
parents
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_2 stored as values in
memory (estimated size 4.2 KB, free 987.5 MB)
20/03/01 00:15:53 INFO MemoryStore: Block broadcast_2_piece0 stored as
bytes in memory (estimated size 2.5 KB, free 987.5 MB)
20/03/01 00:15:53 INFO BlockManagerInfo: Added broadcast_2_piece0 in memory
on 192.168.0.42:40923 (size: 2.5 KB, free: 987.6 MB)
20/03/01 00:15:53 INFO SparkContext: Created broadcast 2 from broadcast at
DAGScheduler.scala:1161
20/03/01 00:15:53 INFO DAGScheduler: Submitting 1 missing tasks from
ResultStage 1 (MapPartitionsRDD[3] at filter at SimpleApp.java:39) (first
15 tasks are for partitions Vector(0))
20/03/01 00:15:53 INFO TaskSchedulerImpl: Adding task set 1.0 with 1 tasks
20/03/01 00:15:53 INFO TaskSetManager: Starting task 0.0 in stage 1.0 (TID
1, localhost, executor driver, partition 0, PROCESS_LOCAL, 7927 bytes)
20/03/01 00:15:53 INFO Executor: Running task 0.0 in stage 1.0 (TID 1)
20/03/01 00:15:53 INFO HadoopRDD: Input split:
file:/home/zahid/spark/spark-3.0.0-preview2-bin-hadoop2.7/README.md:0+4666
20/03/01 00:15:53 INFO Executor: Finished task 0.0 in stage 1.0 (TID 1).
875 bytes result sent to driver
20/03/01 00:15:53 INFO TaskSetManager: Finished task 0.0 in stage 1.0 (TID
1) in 19 ms on localhost (executor driver) (1/1)
20/03/01 00:15:53 INFO TaskSchedulerImpl: Removed TaskSet 1.0, whose tasks
have all completed, from pool
20/03/01 00:15:53 INFO DAGScheduler: ResultStage 1 (count at
SimpleApp.java:39) finished in 0.032 s
20/03/01 00:15:53 INFO DAGScheduler: Job 1 finished: count at
SimpleApp.java:39, took 0.035838 s
Lines with a: 65, lines with b: 33
20/03/01 00:15:53 INFO SparkContext: Invoking stop() from shutdown hook
20/03/01 00:15:53 INFO SparkUI: Stopped Spark web UI at
http://192.168.0.42:4040
20/03/01 00:15:53 INFO MapOutputTrackerMasterEndpoint:
MapOutputTrackerMasterEndpoint stopped!
20/03/01 00:15:53 INFO MemoryStore: MemoryStore cleared
20/03/01 00:15:53 INFO BlockManager: BlockManager stopped
20/03/01 00:15:53 INFO BlockManagerMaster: BlockManagerMaster stopped
20/03/01 00:15:53 INFO
OutputCommitCoordinator$OutputCommitCoordinatorEndpoint:
OutputCommitCoordinator stopped!
20/03/01 00:15:53 INFO SparkContext: Successfully stopped SparkContext
20/03/01 00:15:53 INFO ShutdownHookManager: Shutdown hook called
20/03/01 00:15:53 INFO ShutdownHookManager: Deleting directory
/tmp/spark-f955ef19-35a5-43b3-9e05-4fe0cc25590d
------------------------------------------------------------------------
BUILD SUCCESS
------------------------------------------------------------------------
Total time: 3.547 s
Finished at: 2020-03-01T00:15:53+00:00
Final Memory: 39M/132M
------------------------------------------------------------------------


Backbutton.co.uk
¯\_(ツ)_/¯
♡۶Java♡۶RMI ♡۶
Make Use Method {MUM}
makeuse.org
<http://www.backbutton.co.uk>

Reply via email to