Re: [flac-dev] About SSE intrinsincs in decoder

2022-07-05 Thread olivier tristan
You do not talk about the SSE 4.1 version in your bench. Have you tried this use case ? Thanks ! Le 04/07/2022 à 19:23, Martijn van Beurden a écrit : Op ma 4 jul. 2022 om 15:06 schreef olivier tristan : While I can understand the rationale for manual assembly as 32 bits x86 is dea

Re: [flac-dev] About SSE intrinsincs in decoder

2022-07-05 Thread Martijn van Beurden
Op di 5 jul. 2022 om 09:41 schreef olivier tristan : > You do not talk about the SSE 4.1 version in your bench. > > Have you tried this use case ? > I compared 4 compiles: one without any changes (so with all variants of the lpc functions, including the SSE4.1 ones) and three with variants of pla

[flac-dev] slow on apple silicon?

2022-07-05 Thread Scott Brown
Hi everyone, I've got a question regarding Apple Silicon chips and flac encoding. I'm not seeing the speed I think I should be seeing I compiled a universal flac 1.3.4 for Apple Silicon and 64 bit intel like so: ./configure -enable-static -disable-shared CFLAGS=" -arch arm64 -arch x86_64 -isysro

Re: [flac-dev] slow on apple silicon?

2022-07-05 Thread brianw
You raise excellent questions. I have not been paying attention to Apple's compiler support for the M1, M1 Pro, and M1 Max. Do you have the latest compiler from Apple? Are you sure that -O3 is the best option? I seem to recall that the 'best' has changed over the years. I also recall that Appl