hi,all,i run a job,it is 
:---------------------------------------------------------val data = 
env.readTextFile("hdfs:///....")//DataSet[(String,Array[String])]val dataVec = 
computeDataVect(data)//DataSet[(String,Int,Array[(Int,Double)])]val rescomm = 
computeCosSims (dataVec)//DataSet[(String,Array[(String,Double)])]
but when run on the yarn cluster,the result was error,the job can success;and 
run on the local,in eclipse on my computer,the result is correct.
so,i run twice,first:val data = 
env.readTextFile("hdfs:///....")//DataSet[(String,Array[String])]val dataVec = 
computeDataVect(data)//DataSet[(String,Int,Array[(Int,Double)])]dataVec.writeAsText("hdfs///vec")//the
 vector is correct,
second:val readVec = 
env.readTextFile("hdfs:///vec").map(...)//DataSet[(String,Int,Array[(Int,Double)])]val
 rescomm = computeCosSims 
(dataVec)//DataSet[(String,Array[(String,Double)])]and the result is correct,is 
the same as on local,in eclispe.----------------------------------someone can 
solve the problem?

Reply via email to