Without spaces was the first thing I tried. The information in the pdf file inspired me to try the space.
On Fri, Jan 12, 2024 at 10:23 PM Koert Kuipers <ko...@tresata.com> wrote: > try it without spaces? > export SPARK_LOCAL_DIRS="/tmp,/share/xxxx" > > On Fri, Jan 12, 2024 at 5:00 PM Andrew Petersen <aapet...@ncsu.edu.invalid> > wrote: > >> Hello Spark community >> >> SPARK_LOCAL_DIRS or >> spark.local.dir >> is supposed to accept a list. >> >> I want to list one local (fast) drive, followed by a gpfs network drive, >> similar to what is done here: >> >> https://cug.org/proceedings/cug2016_proceedings/includes/files/pap129s2-file1.pdf >> "Thus it is preferable to bias the data towards faster storage by >> including multiple directories on the faster devices (e.g., SPARK LOCAL >> DIRS=/tmp/spark1, /tmp/spark2, /tmp/spark3, /lus/scratch/sparkscratch/)." >> The purpose of this is to get both benefits of speed and avoiding "out of >> space" errors. >> >> However, for me, Spark is only considering the 1st directory on the list: >> export SPARK_LOCAL_DIRS="/tmp, /share/xxxx" >> >> I am using Spark 3.4.1. Does anyone have any experience getting this to >> work? If so can you suggest a simple example I can try and tell me which >> version of Spark you are using? >> >> Regards >> Andrew >> >> >> >> >> I am trying to use 2 local drives >> >> -- >> Andrew Petersen, PhD >> Advanced Computing, Office of Information Technology >> 2620 Hillsborough Street >> datascience.oit.ncsu.edu >> > > CONFIDENTIALITY NOTICE: This electronic communication and any files > transmitted with it are confidential, privileged and intended solely for > the use of the individual or entity to whom they are addressed. If you are > not the intended recipient, you are hereby notified that any disclosure, > copying, distribution (electronic or otherwise) or forwarding of, or the > taking of any action in reliance on the contents of this transmission is > strictly prohibited. Please notify the sender immediately by e-mail if you > have received this email by mistake and delete this email from your system. > > Is it necessary to print this email? If you care about the environment > like we do, please refrain from printing emails. It helps to keep the > environment forested and litter-free. -- Andrew Petersen, PhD Advanced Computing, Office of Information Technology 2620 Hillsborough Street datascience.oit.ncsu.edu