Changeset f401a2af in ffmpeg

Timestamp:
Oct 4, 2022, 10:24:04 AM (2 years ago)
Author:
Martin Storsjö <martin@martin.st>
Branches:
master
Children:
faea56c9
Parents:
a6bf4d4
git-author:
Grzegorz Bernacki <gjb@semihalf.com> (10/03/22 14:10:14)
git-committer:
Martin Storsjö <martin@martin.st> (10/04/22 10:24:04)
Message:

lavc/aarch64: Add neon implementation for pix_abs8 functions.

Provide optimized implementation of pix_abs8 function for arm64.

Performance comparison tests are shown below:
pix_abs_1_1_c: 162.5
pix_abs_1_1_neon: 27.0
pix_abs_1_2_c: 174.0
pix_abs_1_2_neon: 23.5
pix_abs_1_3_c: 203.2
pix_abs_1_3_neon: 34.7

Benchmarks and tests are run with checkasm tool on AWS Graviton 3.

Co-authored-by: Martin Storsjö <martin@martin.st>
Signed-off-by: Grzegorz Bernacki <gjb@semihalf.com>
Signed-off-by: Martin Storsjö <martin@martin.st>

(No files)

Note: See TracChangeset for help on using the changeset viewer.