On Wed, 27 Sep 2017 21:15:39 -0300 James Almer <jamr...@gmail.com> wrote:
> On 9/4/2017 5:53 PM, Michael Niedermayer wrote: > > On Mon, Sep 04, 2017 at 09:27:32PM +0200, wm4 wrote: > >> On Mon, 4 Sep 2017 21:18:35 +0200 > >> Michael Niedermayer <mich...@niedermayer.cc> wrote: > >> > >>> On Sat, Sep 02, 2017 at 09:47:38PM -0300, James Almer wrote: > >>>> From: Anton Khirnov <an...@khirnov.net> > >>>> > >>>> (cherry picked from commit e6bff23f1e11aefb16a2b5d6ee72bf7469c5a66e) > >>>> Signed-off-by: James Almer <jamr...@gmail.com> > >>>> --- > >>>> This is (afaics) the last API introduced to libav before the major bump. > >>>> > >>>> Now checking all the x86 flags that would require aligment of 16 bytes > >>>> or more. > >>>> > >>>> doc/APIchanges | 3 +++ > >>>> libavutil/cpu.c | 35 +++++++++++++++++++++++++++++++++++ > >>>> libavutil/cpu.h | 13 +++++++++++++ > >>>> libavutil/version.h | 2 +- > >>>> 4 files changed, 52 insertions(+), 1 deletion(-) > >>>> > >>>> diff --git a/doc/APIchanges b/doc/APIchanges > >>>> index 4effbf9364..6a57c210a9 100644 > >>>> --- a/doc/APIchanges > >>>> +++ b/doc/APIchanges > >>>> @@ -15,6 +15,9 @@ libavutil: 2015-08-28 > >>>> > >>>> API changes, most recent first: > >>>> > >>>> +2017-09-xx - xxxxxxx - lavu 55.75.100 / lavu 55.31.0 - cpu.h > >>>> + Add av_cpu_max_align() for querying maximum required data alignment. > >>>> + > >>>> 2017-09-01 - xxxxxxx - lavf 57.81.100 - avio.h > >>>> Add avio_read_partial(). > >>>> > >>>> diff --git a/libavutil/cpu.c b/libavutil/cpu.c > >>>> index a22da0fa8c..4f04da2460 100644 > >>>> --- a/libavutil/cpu.c > >>>> +++ b/libavutil/cpu.c > >>>> @@ -16,9 +16,11 @@ > >>>> * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA > >>>> 02110-1301 USA > >>>> */ > >>>> > >>>> +#include <stddef.h> > >>>> #include <stdint.h> > >>>> #include <stdatomic.h> > >>>> > >>>> +#include "attributes.h" > >>>> #include "cpu.h" > >>>> #include "cpu_internal.h" > >>>> #include "config.h" > >>>> @@ -299,3 +301,36 @@ int av_cpu_count(void) > >>>> > >>>> return nb_cpus; > >>>> } > >>>> + > >>>> +size_t av_cpu_max_align(void) > >>>> +{ > >>>> + int av_unused flags = av_get_cpu_flags(); > >>>> + > >>>> +#if ARCH_ARM || ARCH_AARCH64 > >>>> + if (flags & AV_CPU_FLAG_NEON) > >>>> + return 16; > >>>> +#elif ARCH_PPC > >>>> + if (flags & AV_CPU_FLAG_ALTIVEC) > >>>> + return 16; > >>>> +#elif ARCH_X86 > >>>> + if (flags & (AV_CPU_FLAG_AVX2 | > >>>> + AV_CPU_FLAG_AVX | > >>>> + AV_CPU_FLAG_FMA4 | > >>>> + AV_CPU_FLAG_FMA3)) > >>>> + return 32; > >>>> + if (flags & (AV_CPU_FLAG_XOP | > >>>> + AV_CPU_FLAG_AESNI | > >>>> + AV_CPU_FLAG_SSE42 | > >>>> + AV_CPU_FLAG_SSE4 | > >>>> + AV_CPU_FLAG_SSSE3 | > >>>> + AV_CPU_FLAG_SSE3 | > >>>> + AV_CPU_FLAG_SSE2 | > >>>> + AV_CPU_FLAG_SSE | > >>>> + AV_CPU_FLAG_AVXSLOW | > >>>> + AV_CPU_FLAG_SSE3SLOW | > >>>> + AV_CPU_FLAG_SSE2SLOW)) > >>>> + return 16; > >>>> +#endif > >>>> + > >>>> + return 8; > >>>> +} > >>>> diff --git a/libavutil/cpu.h b/libavutil/cpu.h > >>>> index de05593446..9e5d40affe 100644 > >>>> --- a/libavutil/cpu.h > >>>> +++ b/libavutil/cpu.h > >>>> @@ -21,6 +21,8 @@ > >>>> #ifndef AVUTIL_CPU_H > >>>> #define AVUTIL_CPU_H > >>>> > >>>> +#include <stddef.h> > >>>> + > >>>> #include "attributes.h" > >>>> > >>>> #define AV_CPU_FLAG_FORCE 0x80000000 /* force usage of selected > >>>> flags (OR) */ > >>> > >>> > >>>> @@ -113,4 +115,15 @@ int av_parse_cpu_caps(unsigned *flags, const char > >>>> *s); > >>>> */ > >>>> int av_cpu_count(void); > >>>> > >>>> +/** > >>>> + * Get the maximum data alignment that may be required by FFmpeg. > >>>> + * > >>>> + * Note that this is affected by the build configuration and the CPU > >>>> flags mask, > >>>> + * so e.g. if the CPU supports AVX, but libavutil has been built with > >>>> + * --disable-avx or the AV_CPU_FLAG_AVX flag has been disabled through > >>>> + * av_set_cpu_flags_mask(), then this function will behave as if AVX > >>>> is not > >>>> + * present. > >>>> + */ > >>>> +size_t av_cpu_max_align(void); > >>> > >>> This might interact badly with runtime cpu flags/mask changes > >>> > >>> If its used to choose the alignment for allocated frames and > >>> after some are allocated the cpu flags are changed there could be > >>> still frames in queues that may not have sufficient alignment for the > >>> new flags > >> > >> There's no such thing as runtime CPU flag changes. > > > > We even have an API to change the cpu flags at runtime. > > > > av_set_cpu_flags_mask() and av_force_cpu_flags() > > > > There is no restriction in the API on when they can be called. > > > > you can call av_force_cpu_flags(0) then open a decoder then call > > av_force_cpu_flags(AV_CPU_FLAG_MMX|AV_CPU_FLAG_SSE) > > then open a filter or encoder > > then run the code and it could crash as the allocated frames from > > earlier are not aligned enough for the 2nd filter or encoder > > > > There also may be other scenarios where this can occur > > Alright, I just arrived to this cherry pick during merges. So my > question is, do i apply it and skip the commit that makes use of it to > choose alignment within ffmpeg? > > Even if unused at first, I'd rather not skip its addition. API users can rely on it. So you better make sure that can't break. Other aspects don't really matter. _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel