We just moved Pspp to Kubernetes containers where we use it to extract csvs 
from sav files. The sav files are about 1gb and each csv is about 150mb.

We’ve watched the file system as it does it and over 7gb of the file system is 
used while writing 150mb. I assume the SAVE command is doing lots of seeks and 
insertions in the file magnifying the file system usage. Any options to limit 
this behavior?

Here is the script we are using
GET FILE = "{}"

SAVE TRANSLATE
  /OUTFILE="{}"
  /TYPE=CSV
  /FIELDNAMES
  /REPLACE
  /KEEP={}
  /MISSING=RECODE
  /CELLS=LABELS.
Cheers
Dave

Reply via email to