On Tue, Feb 2, 2010 at 7:41 PM, Brent Jones <br...@servuhome.net> wrote: > On Tue, Feb 2, 2010 at 12:05 PM, Arnaud Brand <t...@tib.cc> wrote: >> Hi folks, >> >> I'm having (as the title suggests) a problem with zfs send/receive. >> Command line is like this : >> pfexec zfs send -Rp tank/t...@snapshot | ssh remotehost pfexec zfs recv -v -F >> -d tank >> >> This works like a charm as long as the snapshot is small enough. >> >> When it gets too big (meaning somewhere between 17G and 900G), I get ssh >> errors (can't read from remote host). >> >> I tried various encryption options (the fastest being in my case arcfour) >> with no better results. >> I tried to setup a script to insert dd on the sending and receiving side to >> buffer the flow, still read errors. >> I tried with mbuffer (which gives better performance), it didn't get better. >> Today I tried with netcat (and mbuffer) and I got better throughput, but it >> failed at 269GB transferred. >> >> The two machines are connected to the switch with 2x1GbE (Intel) joined >> together with LACP. >> The switch logs show no errors on the ports. >> kstat -p | grep e1000g shows one recv error on the sending side. >> >> I can't find anything in the logs which could give me a clue about what's >> happening. >> >> I'm running build 131. >> >> If anyone has the slightest clue of where I could look or what I could do to >> pinpoint/solve the problem, I'd be very gratefull if (s)he could share it >> with me. >> >> Thanks and have a nice evening. >> >> Arnaud >> >> >> >> _______________________________________________ >> zfs-discuss mailing list >> zfs-discuss@opensolaris.org >> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss >> >> > > This issue seems to have started after snv_129 for me. I get "connect > reset by peer", or transfers (of any kind) simply timeout. > > Smaller transfers succeed most of the time, while larger ones usually > fail. Rolling back to snv_127 (my last one) does not exhibit this > issue. I have not had time to narrow down any causes, but I did find > one bug report that found some TCP test scenarios failed during one of > the builds, but unable to find that CR at this time. > > -- > Brent Jones > br...@servuhome.net >
Ah, I found the CR that seemed to describe the situation (broken pipe/connection reset by peer) http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6905510 -- Brent Jones br...@servuhome.net _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss