On Wed, Jul 30, 2014 at 12:03:38PM -0700, Venky Venkatesan wrote:
> Neil,
> 
> >Hey all-
> >         I've been trying to update the fedora dpdk package to support VFIO
> >enabled drivers and ran into a problem in which ixgbe didn't compile because 
> >the
> >rxtx_vec code uses sse4.2 instruction intrinsics, which aren't supported in 
> >the
> >default config I have.  I tried to remedy this by replacing the intrinsics 
> >with
> >the __builtin macros, but it was pointed out (correctly), that this doesn't 
> >work
> >properly.  So this is my second attempt, which I actually like a bit better. 
> > I
> >noted that code that uses intrinsics (ixgbe and the acl library), don't need 
> >to
> >have those instructions turned on build-wide.  Rather, we can just enable the
> >instructions in the specific code we want to build with support for that, and
> >test for instruction support dynamically at run time.  This allows me to 
> >build
> >the dpdk for a generic platform, but in such a way that some optimizations 
> >can
> >be used if the executing cpu supports them at run time.
> >
> >Change notes:
> >
> >v2)
> >     * Added Log messages to run time check failures per Konstantin
> >     * Removed run time check caching in acl per Konstantin
> >
> >Signed-off-by: Neil Horman <nhorman at tuxdriver.com>
> >CC: Thomas Monjalon <thomas.monjalon at 6wind.com>
> >
> 
> One possible change to look at - the _mm_shuffle_epi8 is something that is
> available all the way from SSSE3, so you should be able to leave that in.
> The SSE4.2 dependency is only for the popcount (and __builtin_popcountll
> will work for that anyway). That should make things simpler.
> 
> -Venky
> 

ugh, actually both the ixgbe and acl libraries use the popcnt instruction, so I
think I'd rather just keep the sse4.2 instruction set as it is, unless you think
the __builtin_popcnt is equally performant.

Neil

Reply via email to