Hi all, I am doing some operations on a DataSet<Tuple3<Integer,Integer,Double>> … (see code below) When I run my program on a cluster with 3 machines I can see within the web client that only my master is executing the program. Do I have to specify somewhere that all machines have to participate? Usually the cluster executes in parallel.
Any suggestions? Best regards, Lydia DataSet<Tuple3<Integer, Integer, Double>> matrixA = readMatrix(env, input); DataSet<Tuple3<Integer, Integer, Double>> initial = matrixA.groupBy(0).sum(2); //normalize by maximum value initial = initial.cross(initial.max(2)).map(new normalizeByMax()); matrixA.join(initial).where(1).equalTo(0) .map(new ProjectJoinResultMapper()).groupBy(0, 1).sum(2);