[Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-05-12 Thread Daniel Davies
Is clock-throttling of interest here? It would be really annoying if the code that chooses a macro implementation has to guess how much power will be consumed by each core. Or has to dynamically pick a macro implementation based on the current frequencies of all the cores. https://lemire.me/blog

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-13 Thread Ralf Gommers
Original Message- > From: NumPy-Discussion intel@python.org> On Behalf Of Matti Picus > Sent: Tuesday, February 11, 2020 11:19 PM > To: numpy-discussion@python.org > Subject: Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics > > On 11/2/20 8:02 pm, Devulapal

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-13 Thread Jerome Kieffer
On Wed, 12 Feb 2020 19:36:10 + "Devulapalli, Raghuveer" wrote: > j. _mm512_sqrt_ps/pd (I could be wrong on this one, but from the little > google search I did, it seems like power ISA doesn’t have a vectorized sqrt > instruction) Hi, starting at Power7 (we are at Power9), the sqrt is avai

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-12 Thread Devulapalli, Raghuveer
rsal intrinsic and then benchmark. Raghuveer -Original Message- From: NumPy-Discussion On Behalf Of Matti Picus Sent: Tuesday, February 11, 2020 11:19 PM To: numpy-discussion@python.org Subject: Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics On 11/2/20 8:02 pm, Devulapal

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-12 Thread Charles R Harris
On Wed, Feb 12, 2020 at 12:19 AM Matti Picus wrote: > On 11/2/20 8:02 pm, Devulapalli, Raghuveer wrote: > > > > On top of that the performance implications aren’t clear. Software > > implementations of hardware instructions might perform worse and might > > not even produce the same result. > > >

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-11 Thread Matti Picus
On 11/2/20 8:02 pm, Devulapalli, Raghuveer wrote: On top of that the performance implications aren’t clear. Software implementations of hardware instructions might perform worse and might not even produce the same result. The proposal for universal intrinsics does not enable replacing an

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-11 Thread Ralf Gommers
t easy to figure out) making that platform fall back to the generic non-SIMD version of the ufunc. Cheers, Ralf > > > *From:* NumPy-Discussion intel@python.org> *On Behalf Of *Ralf Gommers > *Sent:* Monday, February 10, 2020 9:17 PM > *To:* Discussion of Numerical Python &g

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-11 Thread Devulapalli, Raghuveer
ren’t clear. Software implementations of hardware instructions might perform worse and might not even produce the same result. From: NumPy-Discussion On Behalf Of Ralf Gommers Sent: Monday, February 10, 2020 9:17 PM To: Discussion of Numerical Python Subject: Re: [Numpy-discussion] NEP 38 - Univ

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-10 Thread Matti Picus
On 11/2/20 7:16 am, Ralf Gommers wrote: On Tue, Feb 4, 2020 at 2:00 PM Hameer Abbasi mailto:einstein.edi...@gmail.com>> wrote: —snip— > 1) Once NumPy adds the framework and initial set of Universal Intrinsic, if contributors want to leverage a new architecture specific SIMD

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-10 Thread Ralf Gommers
On Tue, Feb 4, 2020 at 2:00 PM Hameer Abbasi wrote: > —snip— > > > 1) Once NumPy adds the framework and initial set of Universal Intrinsic, > if contributors want to leverage a new architecture specific SIMD > instruction, will they be expected to add software implementation of this > instruction

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-04 Thread Hameer Abbasi
—snip— > 1) Once NumPy adds the framework and initial set of Universal Intrinsic, if > contributors want to leverage a new architecture specific SIMD instruction, > will they be expected to add software implementation of this instruction for > all other architectures too? In my opinion, if the

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-04 Thread Devulapalli, Raghuveer
worsens performance on another? Thanks, Raghuveer -Original Message- From: NumPy-Discussion On Behalf Of Daniele Nicolodi Sent: Tuesday, February 4, 2020 10:01 AM To: numpy-discussion@python.org Subject: Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics On 04-02-2020 08:08

Re: [Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-04 Thread Daniele Nicolodi
On 04-02-2020 08:08, Matti Picus wrote: > Together with Sayed Adel (cc) and Ralf, I am pleased to put the draft > version of NEP 38 [0] up for discussion. As per NEP 0, this is the next > step in the community accepting the approach layed out in the NEP. The > NEP PR [1] has already garnered a fair

[Numpy-discussion] NEP 38 - Universal SIMD intrinsics

2020-02-04 Thread Matti Picus
Together with Sayed Adel (cc) and Ralf, I am pleased to put the draft version of NEP 38 [0] up for discussion. As per NEP 0, this is the next step in the community accepting the approach layed out in the NEP. The NEP PR [1] has already garnered a fair amount of discussion about the viability of