Matthew Ahrens wrote:
Joseph Barbey wrote:
Robert Milkowski wrote:
JB> So, normally, when the script runs, all snapshots finish in maybe
a minute
JB> total. However, on Sundays, it continues to take longer and
longer. On
JB> 2/25 it took 30 minutes, and this last Sunday, it took 2:11. The
only
JB> thing special thing about Sunday's snapshots is that they are the
first
JB> ones created since the full backup (using NetBackup) on Saturday.
All
JB> other backups are incrementals.
hmmmmm do you have atime property set to off?
Maybe you spend most of the time in destroying snapshots due to much
larger delta coused by atime updates? You can possibly also gain some
performance by setting atime to off.
Yep, atime is set to off for all pools and filesystems. I looked
through the other possible properties, and nothing really looked like
it would really affect things.
One additional weird thing. My script hits each filesystem
(email-pool/A..Z) individually, so I can run zfs list -t snapshot and
find out how long each snapshot actually takes. Everything runs fine
until I get to around V or (normally) W. Then it can take a couple of
hours on the one FS. After that, the rest go quickly.
So, what operation exactly is taking "a couple of hours on the one FS"?
The only one I can imagine taking more than a minute would be 'zfs
destroy', but even that should be very rare on a snapshot. Is it always
the same FS that takes longer than the rest? Is the pool busy when you
do the slow operation?
I'll need to look at this. I expect the next wait will be again on Sunday.
My first-blush guess is that creating the snapshot itself is actually the
problem, but I'll likely have to wait until next Sunday to be sure. I've
added some debugging to my script, so I can see how long each part takes.
Also, it is always the 'W' filesytems's snapshot that seems to indicate a
long time to be created. Recently it showed up a little bit in the 'V'
filesystem as well.
You should be able to improve performance considerably (~26x) by using
just doing one 'zfs snapshot -r', 'zfs destroy -r', and 'zfs rename -r'.
(rename -r is in progress, should be available in OpenSolaris soon; the
others are in s10u3.)
When I first set this stuff up, we could not use the -r option anywhere, so
I didn't use it in my script. A quick test verifies that I now CAN use it
as indicated above. Once my test on Sunday is done, I'll be using -r in my
script as well.
Also, all 3 pools are still 'formatted' as v2. I'll try upgrading all 3
before Sunday, and see if that helps as well.
--
Joe Barbey IT Services/Network Services
office: (715) 425-4357 Davee Library room 166C
cell: (715) 821-0008 UW - River Falls
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss