Changeset e84d14df in ffmpeg
- Timestamp:
-
Jan 8, 2014, 1:43:30 AM
(11 years ago)
- Author:
- Ronald S. Bultje <rsbultje@gmail.com>
- Branches:
- master
- Children:
- 37b001d1
- Parents:
- b0517467
- git-author:
- Ronald S. Bultje <rsbultje@gmail.com> (01/04/14 15:08:47)
- git-committer:
- Ronald S. Bultje <rsbultje@gmail.com> (01/08/14 01:43:30)
- Message:
-
vp9/x86: idct_32x32_add_ssse3.
Sub-IDCTs will follow later. ped1080.webm goes from 9.295s to 8.191s
(13.5% faster). The IDCT itself goes from 4372 (intra) or 4337 (inter)
to 403 (intra) or 329 (inter) cycles for the DC-only form, 23755 (intra)
or 23723 (inter) to 3497 (intra) or 3607 (inter) cycles for the no-DC
form, which averages from 23393 (intra) or 16612 (inter) to 3449 (intra)
or 2392 (inter) for all 32x32s together, i.e. about ~7x faster (all
tests done on ped1080p.webm).
-
(No files)
-