Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Hulio andres
Most likely directory write permission not permission. The app user doesn't have permission to write files to that directory. > Sent: Friday, July 17, 2020 at 6:03 PM > From: "Nagendra Darla" > To: "Hulio andres" > Cc: user@spark.apache.org > Subject: R

Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Slava Rodionov
te: > >> >> https://examples.javacodegeeks.com/java-io-filenotfoundexception-how-to-solve-file-not-found-exception/ >> >> Are you a programmer ? >> >> Regards, >> >> Hulio >> >> >> >> > Sent: Friday, July 17, 2020 at 2:41 A

Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Nagendra Darla
error with spark jobs which create / updates / deletes lots of files on S3 buckets. On Thu, Jul 16, 2020 at 10:28 PM Hulio andres wrote: > > https://examples.javacodegeeks.com/java-io-filenotfoundexception-how-to-solve-file-not-found-exception/ > > Are you a programmer ? > >

Re: File not found exceptions on S3 while running spark jobs

2020-07-16 Thread Hulio andres
https://examples.javacodegeeks.com/java-io-filenotfoundexception-how-to-solve-file-not-found-exception/ Are you a programmer ? Regards, Hulio > Sent: Friday, July 17, 2020 at 2:41 AM > From: "Nagendra Darla" > To: user@spark.apache.org > Subject: File not found

File not found exceptions on S3 while running spark jobs

2020-07-16 Thread Nagendra Darla
Hello All, I am converting existing parquet table (size: 50GB) into Delta format. It took around 1hr 45 mins to convert. And I see that there are lot of FileNotFoundExceptions in the logs Caused by: java.io.FileNotFoundException: No such file or directory: s3a://old-data/delta-data/PL1/output/deno

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-05 Thread ArtemisDev
<18183124...@163.com>; *Date:* Thu, Jul 2, 2020 08:39 PM *To:* "user"; *Subject:* Re: File Not Found: /tmp/spark-events in Spark 3.0 Hi, First, the /tmp/spark-events is the default storage location of spark eventLog, but the log is stored only when you set the 'spark.eventLog.en

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Xin Jinhan
Hi, First, the '/tmp/spark-events' is the default storage location of spark eventLog, but the log will be stored in it only when the 'spark.eventLog.enabled' is true, which your spark 2.4.6 may set to false. So you can try to set false and the error may disappear. Second, I suggest enable eventL

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Zero
This could be the result of you not setting the location of eventLog properly. By default, it's/TMP/Spark-Events, and since the files in the/TMP directory are cleaned up regularly, you could have this problem. -- Original -- From: "Xin Jinhan"<18183124...@163.com

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Xin Jinhan
Hi, First, the /tmp/spark-events is the default storage location of spark eventLog, but the log is stored only when you set the 'spark.eventLog.enabled=true', which maybe your spark 2.4.6 set to false. So you can just set it to false and the error will disappear. Second, I suggest to open the e

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-06-30 Thread Jeff Evans
luster (Spark 3.0 with multiple workers without hadoop), we have > encountered a Spark interpreter exception caused by a I/O File Not Found > exception due to the non-existence of the /tmp/spark-events directory. > We had to create the /tmp/spark-events directory manually in order to > r

File Not Found: /tmp/spark-events in Spark 3.0

2020-06-29 Thread ArtemisDev
While launching a spark job from Zeppelin against a standalone spark cluster (Spark 3.0 with multiple workers without hadoop), we have encountered a Spark interpreter exception caused by a I/O File Not Found exception due to the non-existence of the /tmp/spark-events directory.  We had to

Re: SparkStraming job break with shuffle file not found

2018-03-28 Thread Lucas Kacher
I have been running into this as well, but I am using S3 for checkpointing so I chalked it up to network partitioning with s3-isnt-hdfs as my storage location. But it seems that you are indeed using hdfs, so I wonder if there is another underlying issue. On Wed, Mar 28, 2018 at 8:21 AM, Jone Zhang

SparkStraming job break with shuffle file not found

2018-03-28 Thread Jone Zhang
The spark streaming job running for a few days,then fail as below What is the possible reason? *18/03/25 07:58:37 ERROR yarn.ApplicationMaster: User class threw exception: org.apache.spark.SparkException: Job aborted due to stage failure: Task 16 in stage 80018.0 failed 4 times, most recent failur

Re: Spark Job crash due to File Not found when shuffle intermittently

2017-07-25 Thread Martin Peng
cool~ Thanks Kang! I will check and let you know. Sorry for delay as there is an urgent customer issue today. Best Martin 2017-07-24 22:15 GMT-07:00 周康 : > * If the file exists but is a directory rather than a regular file, does > * not exist but cannot be created, or cannot be opened for any ot

Re: Spark Job crash due to File Not found when shuffle intermittently

2017-07-24 Thread 周康
* If the file exists but is a directory rather than a regular file, does * not exist but cannot be created, or cannot be opened for any other * reason then a FileNotFoundException is thrown. After searching into FileOutputStream i saw this annotation.So you can check executor node first(may be no

Re: Spark Job crash due to File Not found when shuffle intermittently

2017-07-24 Thread 周康
You can also check whether space left in the executor node enough to store shuffle file or not. 2017-07-25 13:01 GMT+08:00 周康 : > First,spark will handle task fail so if job ended normally , this error > can be ignore. > Second, when using BypassMergeSortShuffleWriter, it will first write data >

Re: Spark Job crash due to File Not found when shuffle intermittently

2017-07-24 Thread 周康
First,spark will handle task fail so if job ended normally , this error can be ignore. Second, when using BypassMergeSortShuffleWriter, it will first write data file then write an index file. You can check "Failed to delete temporary index file at" or "fail to rename file" in related executor node'

Re: Spark Job crash due to File Not found when shuffle intermittently

2017-07-24 Thread Martin Peng
Is there anyone at share me some lights about this issue? Thanks Martin 2017-07-21 18:58 GMT-07:00 Martin Peng : > Hi, > > I have several Spark jobs including both batch job and Stream jobs to > process the system log and analyze them. We are using Kafka as the pipeline > to connect each jobs. >

Spark Job crash due to File Not found when shuffle intermittently

2017-07-21 Thread Martin Peng
Hi, I have several Spark jobs including both batch job and Stream jobs to process the system log and analyze them. We are using Kafka as the pipeline to connect each jobs. Once upgrade to Spark 2.1.0 + Spark Kafka Streaming 010, I found some of the jobs(both batch or streaming) are thrown below e

RE: spark-submit: file not found exception occurs

2017-06-16 Thread LisTree Team
you may use hdfs file not local file under yarn. Original Message Subject: spark-submit: file not found exception occurs From: Shupeng Geng <shupeng.g...@envisioncn.com> Date: Thu, June 15, 2017 8:14 pm To: "user@spark.apache.org" <user@spark.apache.org>,

spark-submit: file not found exception occurs

2017-06-15 Thread Shupeng Geng
Hi, everyone, An annoying problem occurs to me. When submitting a spark job, the jar file not found exception is thrown as follows: does not existread "main" java.io.FileNotFoundException: File file:/home/algo/shupeng/eeop_bridger/EeopBridger-1.0-SNAPSHOT.jar

Re: File not found exception while reading from folder using textFileStream

2016-05-18 Thread Saisai Shao
>From my understanding, we should copy the file into another folder and move to source folder after copy is finished, otherwise we will read the half-copied data or meet the issue as you mentioned above. On Wed, May 18, 2016 at 8:32 PM, Ted Yu wrote: > The following should handle the situation y

Re: File not found exception while reading from folder using textFileStream

2016-05-18 Thread Ted Yu
The following should handle the situation you encountered: diff --git a/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala b/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.sca index ed93058..f79420b 100644 --- a/streaming/src/main/scala

File not found exception while reading from folder using textFileStream

2016-05-18 Thread Yogesh Vyas
Hi, I am trying to read the files in a streaming way using Spark Streaming. For this I am copying files from my local folder to the source folder from where spark reads the file. After reading and printing some of the files, it gives the following error: Caused by: org.apache.hadoop.ipc.RemoteExce

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
For future reference, this should be fixed with PR #10337 ( https://github.com/apache/spark/pull/10337) On 16 December 2015 at 11:01, Jakob Odersky wrote: > Yeah, the same kind of error actually happens in the JIRA. It actually > succeeds but a load of exceptions are thrown. Subsequent runs don'

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
Yeah, the same kind of error actually happens in the JIRA. It actually succeeds but a load of exceptions are thrown. Subsequent runs don't produce any errors anymore. On 16 December 2015 at 10:55, Ted Yu wrote: > The first run actually worked. It was the amount of exceptions preceding > the resu

Re: File not found error running query in spark-shell

2015-12-16 Thread Ted Yu
The first run actually worked. It was the amount of exceptions preceding the result that surprised me. I want to see if there is a way of getting rid of the exceptions. Thanks On Wed, Dec 16, 2015 at 10:53 AM, Jakob Odersky wrote: > When you re-run the last statement a second time, does it wor

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
When you re-run the last statement a second time, does it work? Could it be related to https://issues.apache.org/jira/browse/SPARK-12350 ? On 16 December 2015 at 10:39, Ted Yu wrote: > Hi, > I used the following command on a recently refreshed checkout of master > branch: > > ~/apache-maven-3.3.

File not found error running query in spark-shell

2015-12-16 Thread Ted Yu
Hi, I used the following command on a recently refreshed checkout of master branch: ~/apache-maven-3.3.3/bin/mvn -Phive -Phive-thriftserver -Pyarn -Phadoop-2.4 -Dhadoop.version=2.7.0 package -DskipTests I was then running simple query in spark-shell: Seq( (83, 0, 38), (26, 0, 79),

Re: Required file not found: sbt-interface.jar

2015-11-02 Thread Ted Yu
ala-2.10.4 > > > > [INFO] --- scala-maven-plugin:3.2.2:compile (scala-compile-first) @ > spark-launcher_2.10 --- > [INFO] Using zinc server for incremental compilation > [error] Required file not found: sbt-interface.jar > [error] See zinc -help for in

Required file not found: sbt-interface.jar

2015-11-02 Thread Todd
Hi, I am trying to build spark 1.5.1 in my environment, but encounter the following error complaining Required file not found: sbt-interface.jar: The error message is below and I am building with: ./make-distribution.sh --name spark-1.5.1-bin-2.6.0 --tgz --with-tachyon -Phadoop-2.6

Re: Checkpoint file not found

2015-08-03 Thread Anand Nalya
Hi, Its an application that maintains some state from the DStream using updateStateByKey() operation. It then selects some of the records from current batch using some criteria over current values and the state and carries over the remaining values to next batch. Following is the pseudo code : va

Re: Checkpoint file not found

2015-08-03 Thread Tathagata Das
Can you tell us more about streaming app? DStream operation that you are using? On Sun, Aug 2, 2015 at 9:14 PM, Anand Nalya wrote: > Hi, > > I'm writing a Streaming application in Spark 1.3. After running for some > time, I'm getting following execption. I'm sure, that no other process is > modi

Checkpoint file not found

2015-08-02 Thread Anand Nalya
Hi, I'm writing a Streaming application in Spark 1.3. After running for some time, I'm getting following execption. I'm sure, that no other process is modifying the hdfs file. Any idea, what might be the cause of this? 15/08/02 21:24:13 ERROR scheduler.DAGSchedulerEventProcessLoop: DAGSchedulerEv

Re: Re: Application jar file not found exception when submitting application

2015-07-06 Thread bit1...@163.com
: Application jar file not found exception when submitting application Before running your script, could you confirm that "/data/software/spark-1.3.1-bin-2.4.0/applications/pss.am.core-1.0-SNAPSHOT-shaded.jar" exists? You might forget to build this jar. Best Regards, Shixiong Zhu 2015-07-06

Re: Application jar file not found exception when submitting application

2015-07-06 Thread Shixiong Zhu
Before running your script, could you confirm that " /data/software/spark-1.3.1-bin-2.4.0/applications/pss.am.core-1.0-SNAPSHOT-shaded.jar" exists? You might forget to build this jar. Best Regards, Shixiong Zhu 2015-07-06 18:14 GMT+08:00 bit1...@163.com : > Hi, > I have following shell script th

Application jar file not found exception when submitting application

2015-07-06 Thread bit1...@163.com
Hi, I have following shell script that will submit the application to the cluster. But whenever I start the application, I encounter FileNotFoundException, after retrying for serveral times, I can successfully submit it! SPARK=/data/software/spark-1.3.1-bin-2.4.0 APP_HOME=/data/software/spark-

Re: saveAsTextFiles file not found exception

2014-08-12 Thread Chen Song
Thanks for putting this together, Andrew. On Tue, Aug 12, 2014 at 2:11 AM, Andrew Ash wrote: > Hi Chen, > > Please see the bug I filed at > https://issues.apache.org/jira/browse/SPARK-2984 with the > FileNotFoundException on _temporary directory issue. > > Andrew > > > On Mon, Aug 11, 2014 at 1

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Andrew Ash
Hi Chen, Please see the bug I filed at https://issues.apache.org/jira/browse/SPARK-2984 with the FileNotFoundException on _temporary directory issue. Andrew On Mon, Aug 11, 2014 at 10:50 PM, Andrew Ash wrote: > Not sure which stalled HDFS client issue your'e referring to, but there > was one

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Andrew Ash
Not sure which stalled HDFS client issue your'e referring to, but there was one fixed in Spark 1.0.2 that could help you out -- https://github.com/apache/spark/pull/1409. I've still seen one related to Configuration objects not being threadsafe though so you'd still need to keep speculation on to

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Chen Song
Andrew that is a good finding. Yes, I have speculative execution turned on, becauseI saw tasks stalled on HDFS client. If I turned off speculative execution, is there a way to circumvent the hanging task issue? On Mon, Aug 11, 2014 at 11:13 AM, Andrew Ash wrote: > I've also been seeing simil

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Andrew Ash
I've also been seeing similar stacktraces on Spark core (not streaming) and have a theory it's related to spark.speculation being turned on. Do you have that enabled by chance? On Mon, Aug 11, 2014 at 8:10 AM, Chen Song wrote: > Bill > > Did you get this resolved somehow? Anyone has any insigh

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Chen Song
Bill Did you get this resolved somehow? Anyone has any insight into this problem? Chen On Mon, Aug 11, 2014 at 10:30 AM, Chen Song wrote: > The exception was thrown out in application master(spark streaming driver) > and the job shut down after this exception. > > > On Mon, Aug 11, 2014 at 10

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Chen Song
The exception was thrown out in application master(spark streaming driver) and the job shut down after this exception. On Mon, Aug 11, 2014 at 10:29 AM, Chen Song wrote: > I got the same exception after the streaming job runs for a while, The > ERROR message was complaining about a temp file no

Re: saveAsTextFiles file not found exception

2014-08-11 Thread Chen Song
I got the same exception after the streaming job runs for a while, The ERROR message was complaining about a temp file not being found in the output folder. 14/08/11 08:05:08 ERROR JobScheduler: Error running job streaming job 140774430 ms.0 java.io.FileNotFoundException: File hdfs://hadoopc/u

Re: saveAsTextFiles file not found exception

2014-07-25 Thread Bill Jay
I just saw another error after my job was run for 2 hours: org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException): No lease on /apps/data/vddil/real-time/checkpoint/temp: File does not exist. Holder DFSClient_NONMAPREDUCE_327993456_13 does not have any

Re: saveAsTextFiles file not found exception

2014-07-25 Thread Tathagata Das
Can you give a stack trace and logs of the exception? Its hard to say anything without any associated stack trace and logs. TD On Fri, Jul 25, 2014 at 1:32 PM, Bill Jay wrote: > Hi, > > I am running a Spark Streaming job that uses saveAsTextFiles to save > results into hdfs files. However, it

saveAsTextFiles file not found exception

2014-07-25 Thread Bill Jay
Hi, I am running a Spark Streaming job that uses saveAsTextFiles to save results into hdfs files. However, it has an exception after 20 batches result-140631234/_temporary/0/task_201407251119__m_03 does not exist. When the job is running, I do not change any file in the folder. Does

Re: file not found

2014-05-27 Thread jaranda
Thanks for the heads up, I also experienced this issue. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/file-not-found-tp1854p6438.html Sent from the Apache Spark User List mailing list archive at Nabble.com.

Re: File present but file not found exception

2014-05-19 Thread Koert Kuipers
lightly >> increased. >> >> But anyways, this is a pretty silly problem, but could not get over. >> >> I have a file in my localFS, but when i try to create an RDD out of it, >> tasks fails with file not found exception is thrown at the log files. >> >> *

File present but file not found exception

2014-05-15 Thread Sai Prasanna
Hi Everyone, I think all are pretty busy, the response time in this group has slightly increased. But anyways, this is a pretty silly problem, but could not get over. I have a file in my localFS, but when i try to create an RDD out of it, tasks fails with file not found exception is thrown at

Re: File present but file not found exception

2014-05-12 Thread Sai Prasanna
problem, but could not get over. > > I have a file in my localFS, but when i try to create an RDD out of it, > tasks fails with file not found exception is thrown at the log files. > > *var file = sc.textFile("file:///home/sparkcluster/spark/input.txt");* > *file.top(1);

Re: File present but file not found exception

2014-05-11 Thread Koert Kuipers
t anyways, this is a pretty silly problem, but could not get over. > > I have a file in my localFS, but when i try to create an RDD out of it, > tasks fails with file not found exception is thrown at the log files. > > *var file = sc.textFile("file:///home/sparkclu