hi,all,i run a job,it is
:---------------------------------------------------------val data =
env.readTextFile("hdfs:///....")//DataSet[(String,Array[String])]val dataVec =
computeDataVect(data)//DataSet[(String,Int,Array[(Int,Double)])]val rescomm =
computeCosSims (dataVec)//DataSet[(String,Array[(String,Double)])]
but when run on the yarn cluster,the result was error,the job can success;and
run on the local,in eclipse on my computer,the result is correct.
so,i run twice,first:val data =
env.readTextFile("hdfs:///....")//DataSet[(String,Array[String])]val dataVec =
computeDataVect(data)//DataSet[(String,Int,Array[(Int,Double)])]dataVec.writeAsText("hdfs///vec")//the
vector is correct,
second:val readVec =
env.readTextFile("hdfs:///vec").map(...)//DataSet[(String,Int,Array[(Int,Double)])]val
rescomm = computeCosSims
(dataVec)//DataSet[(String,Array[(String,Double)])]and the result is correct,is
the same as on local,in eclispe.----------------------------------someone can
solve the problem?