On Sep 9, 2010, at 8:27 AM, Fei Xu <twinse...@hotmail.com> wrote: >> >> Service times here are crap. Disks are malfunctioning >> in some way. If >> your source disks can take seconds (or 10+ seconds) >> to reply, then of >> course your copy will be slow. Disk is probably >> having a hard time >> reading the data or something. >> > > > Yeah, that should not go over 15ms. I just cannot understand why it starts > ok with hundred GB files transfered and then suddenly fall to "sleep". > by the way, WDIDLE time is already disabled which might cause some issue. > I've changed to another system to test ZFS send between 8*1TB pool and 4*1TB > pool. hope everythings OK on this case.
This might be the dreaded WD TLER issue. Basically the drive keeps retrying a read operation over and over after a bit error trying to recover from a read error themselves. With ZFS one really needs to disable this and have the drives fail immediately. Check your drives to see if they have this feature, if so think about replacing the drives in the source pool that have long service times and make sure this feature is disabled on the destination pool drives. -Ross _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss