On 03/16/2012 11:57 PM, Justin Shepherd wrote:
> 
> 
> On Mar 16, 2012, at 12:26, "Pádraig Brady" <p...@draigbrady.com> wrote:
> 
>> On 03/16/2012 04:11 PM, Jay Pipes wrote:
>>> Hi Stackers,
>>>
>>> So, in diagnosing a few things on TryStack yesterday, I ran into an 
>>> interesting problem with snapshotting that I'm hoping to get some advice on.
>>>
>>> == The Problem ==
>>>
>>
>>> QEMU was unhelpfully returning a vague error message of "error while 
>>> writing".
>>
>> That could be improved.
>> As an aside, since qemu-img is mainly dealing with large files,
>> it would be a prime candidate to call fallocate() from
>> to get good layout for the files and immediate feedback
>> if there isn't enough space.
>>
>> On a related note, I've a patch pending for after RC1
>> that should auto clean any of these partially written files:
>> https://review.openstack.org/#change,5442
>>
>>> As it turns out, the base operating system we install on our compute nodes 
>>> in TryStack has a (very) small root partition
>>
>>> == Possible Solutions ==
>>>
>>> So, there are a number of solutions that we can work on here, and I'm 
>>> wondering what the preference would be. Here are the solutions I have come 
>>> up with, along with a no-brainer improvement to Nova that would help in 
>>> diagnosing this problem:
>>>
>>> The no-brainer: Detect before attempting a snapshot that there is enough 
>>> space on a device to perform the operation, and if not, throw a useful 
>>> error message up the stack
>>
>> The space can change while writing, so you could still get the same error 
>> above.
>>
>>>
>>> Solutions to the disk space problem:
>>>
>>> (1) Silly Jay, change the damn size of the root partition in your PXE base 
>>> OS install!
>>>
>>> Now, I'm no expert in creating customized base disk images, but from 
>>> looking at the build_pxe_env.sh script in devstack [1], it seems pretty 
>>> trivial to change the ramdisk_size parameter in the startup options to 
>>> something larger than 2109600. We could do this and reimage the compute 
>>> nodes one by one.
>>>
>>> (2) Make the location in which the snapshot is made configurable.
>>>
>>> Right now, as mentioned above, tempfile.mkdtemp() is used, which creates a 
>>> directory in the user's TMPDIR (typically /tmp, which is usually on the 
>>> root partition).
>>>
>>> We could add an option (--libvirt-snapshot-dir?) that would allow 
>>> nova-compute to override where that snapshot is built.
>>>
>>> (3) Change the user (running nova-compute) TMPDIR setting to something 
>>> different than /tmp on the root partition).
>>
>> I'd lean towards (3).
>> That's something that depends on the environment (as you've nicely 
>> demonstrated),
>> and also for security reasons the admin should be able to set TMPDIR.
>> That's the standard way to do it, and it works already (hopefully).
> 
> Actually I would argue that the best way to accomplish this would be option 
> #2. That way an admin/operator has control over the location. Not 
> manipulating this by messing around with a users environment variable.

Well one can set the TMPDIR in the init script for the service.
That's a fairly standard mechanism.

(2) is good though if you would ever want to separate
--libvirt-snapshot-dir from, $TMPDIR

Now I can definitely see the need for changing TMPDIR from /tmp
for Jay's reasons and /tmp being tmpfs by default on debian for example:
http://lists.debian.org/debian-devel/2011/11/msg00281.html
I'm not sure if you'd need to separate them?
Though I'm always biased towards avoiding new config variables.
I suppose one could argue you might want /tmp for small fast accesses,
and something large and separate for manipulating large files.

Now that I look at the existing nova uses of tmp dirs
to store/stage large images, I see existing config vars:

FLAGS.xenapi_sr_base_path  # xens default Storage Repo
FLAGS.image_decryption_dir # nova/image/s3.py

So if you were following that you would implement (2) with:

FLAGS.libvirt_snapshot_dir

There might be opportunity to merge all three to:

FLAGS.nova_image_staging_dir

cheers,
Pádraig.

_______________________________________________
Mailing list: https://launchpad.net/~openstack
Post to     : openstack@lists.launchpad.net
Unsubscribe : https://launchpad.net/~openstack
More help   : https://help.launchpad.net/ListHelp

Reply via email to