On Tue, Feb 2, 2010 at 7:41 PM, Brent Jones <br...@servuhome.net> wrote:
> On Tue, Feb 2, 2010 at 12:05 PM, Arnaud Brand <t...@tib.cc> wrote:
>> Hi folks,
>>
>> I'm having (as the title suggests) a problem with zfs send/receive.
>> Command line is like this :
>> pfexec zfs send -Rp tank/t...@snapshot | ssh remotehost pfexec zfs recv -v -F
>> -d tank
>>
>> This works like a charm as long as the snapshot is small enough.
>>
>> When it gets too big (meaning somewhere between 17G and 900G), I get ssh
>> errors (can't read from remote host).
>>
>> I tried various encryption options (the fastest being in my case arcfour)
>> with no better results.
>> I tried to setup a script to insert dd on the sending and receiving side to
>> buffer the flow, still read errors.
>> I tried with mbuffer (which gives better performance), it didn't get better.
>> Today I tried with netcat (and mbuffer) and I got better throughput, but it
>> failed at 269GB transferred.
>>
>> The two machines are connected to the switch with 2x1GbE (Intel) joined
>> together with LACP.
>> The switch logs show no errors on the ports.
>> kstat -p | grep e1000g shows one recv error on the sending side.
>>
>> I can't find anything in the logs which could give me a clue about what's
>> happening.
>>
>> I'm running build 131.
>>
>> If anyone has the slightest clue of where I could look or what I could do to
>> pinpoint/solve the problem, I'd be very gratefull if (s)he could share it
>> with me.
>>
>> Thanks and have a nice evening.
>>
>> Arnaud
>>
>>
>>
>> _______________________________________________
>> zfs-discuss mailing list
>> zfs-discuss@opensolaris.org
>> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>>
>>
>
> This issue seems to have started after snv_129 for me. I get "connect
> reset by peer", or transfers (of any kind) simply timeout.
>
> Smaller transfers succeed most of the time, while larger ones usually
> fail. Rolling back to snv_127 (my last one) does not exhibit this
> issue. I have not had time to narrow down any causes, but I did find
> one bug report that found some TCP test scenarios failed during one of
> the builds, but unable to find that CR at this time.
>
> --
> Brent Jones
> br...@servuhome.net
>

Ah, I found the CR that seemed to describe the situation (broken
pipe/connection reset by peer)

http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6905510


-- 
Brent Jones
br...@servuhome.net
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to