Re: [Discuss-gnuradio] USB Issues

Michael Dickens Sun, 15 Jan 2006 14:39:46 -0800

On Jan 14, 2006, at 6:10 PM, Eric Blossom wrote:

Generally speaking, reliable throughput on the USRP is dominated by
the OS's ability to deliver USB packets with small interpacket gaps.
[snip] The hardware (if properly implemented), should be
able to drive the USB at full speed. [snip]

Under Linux, [snip] We keep the endpoint queue non-empty bysubmitting multiple

asynchronous requests.

Agreed on all accounts (including the snipped stuff). My goal in myFUSB code was to deliver / retrieve as much data as possible with aslittle delay as possible, so as to keep whatever OSX internalsoftware and hardware pipes full. Moving from sync (in LIBUSB) toasync (in my FUSB) offers a substantial improvement - not a surprisethere. While I'm happy with a 4x increase in throughput, another2-3x will certainly be useful by someone eventually. Bottom linefrom the below discussion: I really can't think of anything else thatwould speed up FUSB transfers under MacOS X while using the currentcode-base. Thoughts? - MLD


-------

The ::write() code requires 2 parts: (1) the actual ::write()command; and (2) a callback to deal with buffering. In (1), the codefinds an available buffer (blocks if necessary until one isavailable), copies the incoming data into that buffer, then writesthe copied data to the async USB pipe. When this particular data iswritten, a callback (2) is executed which checks to make sure thecorrect amount of data was written, then makes the buffer availablefor use again.

The ::read() code requires 3 parts: (1) the actual ::read() command;(2) a thread running the async USB read code; and (3) a callback todeal with buffering. In (2), the code gets an available buffer(blocks if necessary until one is available), then calls the asyncUSB pipe to read the data; this is all done within a "while()" loop,and thus happens as quickly as the thread can execute. When thisparticular data is read, a callback (3) is executed which copies theactual amount of read data into an intermediate buffer, overwritingoldest data if necessary (and printing a warning). The actual ::read() command (1) simply copies data out of the intermediate buffer,blocking until any amount of data is available.


The "speed" factors are making sure that:

(1) there are enough buffers so that there is no blocking(NUM_QUEUE_ITEMS);(2) buffers are big enough to prevent blocking and overflow(MAX_BLOCK_SIZE);(3) each async calls transfer enough data to fill or clear whateverbuffers MacOS X uses internally (MAX_BLOCK_SIZE);

(4) each async USB data transfer call happens often enough; and

(5) whatever code is generating the data and calling ::write()or ::read() gets enough CPU time to sustain the required data rate.

Defaults: NUM_QUEUE_ITEMS = 10; MAX_BLOCK_SIZE = 16*1024. This -always- results in exactly 41 underruns and overruns as printed bythe "test_usrp_standard_rx" and "test_usrp_standard_tx" executablesin usrp/host/apps/ (call these GR O/U's).

(a) Increasing (1) from 2 to 10 increases the throughput from about24 MBps to 29 MBps. There is no increase beyond that. Still 41 ofthe GR O/U's. Leave this at 10 for now.

(b) For (2): At 4*1024, there are numerous read overflows (from mycode) but no underflows (from my code); data rates are around 26MBps, and # of GR O/U's is 41. At 16*1024, over/underflows (from mycode), but still 41 GR O/U's; throughput is around 29 MBps.Increasing to 64*1024 or 640*1024 has no real effect on throughput orover/underflows or GR O/U's.

(c) Increasing (1) to 1024 and (2) to 1024*1024 results in 32 GR O/U's, and throughput drops to about 28 MBps. Interestingly, all ofthe write underruns happen immediately (within 1 second) then therest go without errors (for about 3.5 seconds). The read overrunsalways happen spread out, no matter (1) or (2). This is an absurdexample since we never want to allocate that much DRAM for USBbuffering.

(d) increasing thread priority in (4) and (5) doesn't make anydifference.

Because the ::write() is not through an intermediate buffer whilethe ::read() is, but the results are identical for a given set ofparameters, this leads me to believe that the delays are caused byOSX, and not (4) or (5). The primary way to decrease delays insideOSX is to remove the extra CoreFoundation-layer calls by goingdirectly to the kernel. This removal would decrease the number ofrequired threads and eliminate the "RunLoop" requirement (as found inLIBUSB, causing async calls to be effectively sync, and my currentcode too but in a separate thread so that ascnc calls are reallyasync), which could only speed up the throughput.



_______________________________________________
Discuss-gnuradio mailing list
Discuss-gnuradio@gnu.org
http://lists.gnu.org/mailman/listinfo/discuss-gnuradio

Re: [Discuss-gnuradio] USB Issues

Reply via email to