netinet

Julian Elischer Fri, 04 Feb 2011 10:57:00 -0800

On 2/4/11 9:38 AM, Robert Watson wrote:

On Thu, 3 Feb 2011, John Baldwin wrote:
  1) Move per John Baldwin to mp_maxid
2) Some signed/unsigned errors found by Mac OS compiler (fromMichael)
  3) a couple of copyright updates on the effected files.
Note that mp_maxid is the maxium valid ID, so you typically have todo things like:
    for (i = 0; i <= mp_maxid; i++) {
        if (CPU_ABSENT(i))
            continue;
        ...
    }
There is a CPU_FOREACH() macro that does the above (but assumes youwant to skip over non-existent CPUs).
I'm finding the network stack requires quite a bit more along theselines, btw. I'd love also to have:
  PACKAGE_FOREACH()
  CORE_FOREACH()
  HWTHREAD_FOREACH()

I agree, which is why I usually support adding such iterators thoughsome people scream about them.(e.g. FOREACH_THREAD_IN_PROC and there is one for iterating throughvnets too.)

  CURPACKAGE()
  CURCORE()
  CURTHREAD()


also current jail, vnet, etc. (these (kinda) exist)

Available when putting together thread worker pools, distributingwork, identifying where to channel work, making dispatch decisionsand so on. It seems likely that in some scenarios, it will bedesirable to have worker thread topology linked to hardware topology-- for example, a network stack worker per core, with distributionof work targeting the closest worker (subject to orderingconstraints)...
Hmmm, this is more complicated. Can sctp_queue_to_mcore() handlethe fact that a cpu_to_use value might not be valid? If not youmight want to maintain a separate "dense" virtual CPU ID tablenumbered 0 .. mp_ncpus - 1 that maps to "present" FreeBSD CPU IDs.I think Robert has done something similar to support RSS in TCP.Does that make sense?
This proves somewhat complicated. I basically have two models,depending on whether RSS is involved (which adds an externalfactor). Without RSS, I build a contiguous workstream number space,which is then mapped via a table to the CPU ID space, allowingmappings and hashing to be done easily -- however, these refer toordered flow processing streams (i.e., "threads") rather than CPUs,in the strict sense. In the future with dynamic configuration, thisbecomes important because what I do is rebalance ordered processingstreams rather than work to CPUs. With RSS there has to be a linkbetween work distribution and the CPU identifiers shared by devicedrivers, hardware, etc, in which case RSS identifies viable CPUs asit starts (probably not quite correctly, I'll be looking for areview of that code shortly, cleaning it up currently).
This issue came up some at the BSDCan devsummit last year: as moreand more kernel subsystems need to exploit parallelism explicitly,the thread programming model isn't bad, but lacks a strong tie tohardware topology in order to help manage work distribution. Oneidea idly bandied around was to do something along the lines ofKSE/GCD for the kernel: provide a layered "work" model with orderingconstraints, rather than exploit threads directly, for work-orientedsubsystems. This is effectively what netisr does, but in a networkstack-specific way. But with crypto code, IPSEC, storage stuff,etc, all looking to exploit parallelism, perhaps a more generalmodel is called for.
Robert


_______________________________________________
svn-src-head@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/svn-src-head
To unsubscribe, send any mail to "svn-src-head-unsubscr...@freebsd.org"

Re: svn commit: r218232 - head/sys/netinet

Reply via email to