Changeset f401a2af in ffmpeg
- Timestamp:
-
Oct 4, 2022, 10:24:04 AM
(2 years ago)
- Author:
- Martin Storsjö <martin@martin.st>
- Branches:
- master
- Children:
- faea56c9
- Parents:
- a6bf4d4
- git-author:
- Grzegorz Bernacki <gjb@semihalf.com> (10/03/22 14:10:14)
- git-committer:
- Martin Storsjö <martin@martin.st> (10/04/22 10:24:04)
- Message:
-
lavc/aarch64: Add neon implementation for pix_abs8 functions.
Provide optimized implementation of pix_abs8 function for arm64.
Performance comparison tests are shown below:
pix_abs_1_1_c: 162.5
pix_abs_1_1_neon: 27.0
pix_abs_1_2_c: 174.0
pix_abs_1_2_neon: 23.5
pix_abs_1_3_c: 203.2
pix_abs_1_3_neon: 34.7
Benchmarks and tests are run with checkasm tool on AWS Graviton 3.
Co-authored-by: Martin Storsjö <martin@martin.st>
Signed-off-by: Grzegorz Bernacki <gjb@semihalf.com>
Signed-off-by: Martin Storsjö <martin@martin.st>
-
(No files)
-