Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Hulio andres
Most likely directory write permission not permission. The app user doesn't have permission to write files to that directory. > Sent: Friday, July 17, 2020 at 6:03 PM > From: "Nagendra Darla" > To: "Hulio andres" > Cc: user@spark.apache.org > Subject: R

Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Slava Rodionov
Hi those are only my thoughts, not a solution, hope they may help you. First of all, we need a full stacktrace not just an exception to make a conclusion. I see you're using s3a. Where do you run your job? Is that EMR? Normally you need to make S3 more consistent first to make it usable. This mean

Re: File not found exceptions on S3 while running spark jobs

2020-07-17 Thread Nagendra Darla
Hi, Thanks I know about FileNotFound Exception. This error is with S3 buckets which has a delay in showing newly created files. These files eventually shows up after some time. These errors are coming up while running a parquet table into Delta table. My question is more around avoiding this er

Re: File not found exceptions on S3 while running spark jobs

2020-07-16 Thread Hulio andres
https://examples.javacodegeeks.com/java-io-filenotfoundexception-how-to-solve-file-not-found-exception/ Are you a programmer ? Regards, Hulio > Sent: Friday, July 17, 2020 at 2:41 AM > From: "Nagendra Darla" > To: user@spark.apache.org > Subject: File not found exceptions on S3 while runni

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-05 Thread ArtemisDev
<18183124...@163.com>; *Date:* Thu, Jul 2, 2020 08:39 PM *To:* "user"; *Subject:* Re: File Not Found: /tmp/spark-events in Spark 3.0 Hi, First, the /tmp/spark-events is the default storage location of spark eventLog, but the log is stored only when you set the 'spark.eventLog.en

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Xin Jinhan
Hi, First, the '/tmp/spark-events' is the default storage location of spark eventLog, but the log will be stored in it only when the 'spark.eventLog.enabled' is true, which your spark 2.4.6 may set to false. So you can try to set false and the error may disappear. Second, I suggest enable eventL

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Zero
This could be the result of you not setting the location of eventLog properly. By default, it's/TMP/Spark-Events, and since the files in the/TMP directory are cleaned up regularly, you could have this problem. -- Original -- From: "Xin Jinhan"<18183124...@163.com

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-07-02 Thread Xin Jinhan
Hi, First, the /tmp/spark-events is the default storage location of spark eventLog, but the log is stored only when you set the 'spark.eventLog.enabled=true', which maybe your spark 2.4.6 set to false. So you can just set it to false and the error will disappear. Second, I suggest to open the e

Re: File Not Found: /tmp/spark-events in Spark 3.0

2020-06-30 Thread Jeff Evans
This should only be needed if the spark.eventLog.enabled property was set to true. Is it possible the job configuration is different between your two environments? On Mon, Jun 29, 2020 at 9:21 AM ArtemisDev wrote: > While launching a spark job from Zeppelin against a standalone spark > cluster

Re: File not found exception while reading from folder using textFileStream

2016-05-18 Thread Saisai Shao
>From my understanding, we should copy the file into another folder and move to source folder after copy is finished, otherwise we will read the half-copied data or meet the issue as you mentioned above. On Wed, May 18, 2016 at 8:32 PM, Ted Yu wrote: > The following should handle the situation y

Re: File not found exception while reading from folder using textFileStream

2016-05-18 Thread Ted Yu
The following should handle the situation you encountered: diff --git a/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala b/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.sca index ed93058..f79420b 100644 --- a/streaming/src/main/scala

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
For future reference, this should be fixed with PR #10337 ( https://github.com/apache/spark/pull/10337) On 16 December 2015 at 11:01, Jakob Odersky wrote: > Yeah, the same kind of error actually happens in the JIRA. It actually > succeeds but a load of exceptions are thrown. Subsequent runs don'

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
Yeah, the same kind of error actually happens in the JIRA. It actually succeeds but a load of exceptions are thrown. Subsequent runs don't produce any errors anymore. On 16 December 2015 at 10:55, Ted Yu wrote: > The first run actually worked. It was the amount of exceptions preceding > the resu

Re: File not found error running query in spark-shell

2015-12-16 Thread Ted Yu
The first run actually worked. It was the amount of exceptions preceding the result that surprised me. I want to see if there is a way of getting rid of the exceptions. Thanks On Wed, Dec 16, 2015 at 10:53 AM, Jakob Odersky wrote: > When you re-run the last statement a second time, does it wor

Re: File not found error running query in spark-shell

2015-12-16 Thread Jakob Odersky
When you re-run the last statement a second time, does it work? Could it be related to https://issues.apache.org/jira/browse/SPARK-12350 ? On 16 December 2015 at 10:39, Ted Yu wrote: > Hi, > I used the following command on a recently refreshed checkout of master > branch: > > ~/apache-maven-3.3.

Re: file not found

2014-05-27 Thread jaranda
Thanks for the heads up, I also experienced this issue. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/file-not-found-tp1854p6438.html Sent from the Apache Spark User List mailing list archive at Nabble.com.