On 2016年12月21日 05:11, Andrey Smirnov wrote:
Current code that handles Tx buffer desciprtor ring scanning employs the following algorithm: 1. Restore current buffer descriptor pointer from TBPTRn 2. Process current descriptor 3. If current descriptor has BD_WRAP flag set set current descriptor pointer to start of the descriptor ring 4. If current descriptor points to start of the ring exit the loop, otherwise increment current descriptor pointer and go to #2 5. Store current descriptor in TBPTRn As it can be seen the way the code is implemented results in buffer descriptor ring being scanned starting at offset/descriptor #0. While covering proverbial "99%" of the cases, this algorithm becomes problematic for a number of edge cases. Consider the following scenario: guest OS driver initializes descriptor ring to N individual descriptors and starts sending data out. Depending on the volume of traffic and probably guest OS driver implementation it is possible that an edge case where a packet, spread across 2 descriptors is placed in descriptors N - 1 and 0 in that order(it is easy to imagine similar examples involving more than 2 descriptors). What happens then is aforementioned algorithm starts at descriptor 0, sees a descriptor marked as BD_LAST, which it happily sends out as a separate packet(very much malformed at this point) then the iteration continues and the first part of the original packet is tacked to the next transmission which ends up being bogus as well. This behvaiour can be pretty reliably observed when scp'ing data from a guest OS via TAP interface for files larger than 160K (every time for 700K+). This patch changes the scanning algorithm to do the following: 1. Restore "current" and "start" buffer descriptor pointer from TBPTRn 2. If "current" descriptor has BD_WRAP flag set "next" descriptor pointer to start of the descriptor ring otherwise set "next" to descriptor right after "current" 3. Process current descriptor 4. If current descriptore has BD_LAST(end of a packet) set save "next" descriptor pointer in TBPTRn 5. Set "current" descriptor pointer to "next" 6. If "current" descriptor points to "start" (from #1) exit the loop loop, otherwise go to #2
Hi, I'm not familiar with this card but this seems could be simply addressed by exiting the loop when bd_flags != BD_TX_READY instead of bd_addr != ring_base (which seems buggy for heavy load)?