[PATCH v3 0/3] arm64: Add optimized memset/memcpy/memove functions
Stefan Roese
sr at denx.de
Wed Aug 11 16:28:15 CEST 2021
On 11.08.21 16:25, Tom Rini wrote:
> On Wed, Aug 11, 2021 at 04:02:39PM +0200, Stefan Roese wrote:
>>
>> On an NXP LX2160 based platform it has been noticed, that the currently
>> implemented memset/memcpy functions for aarch64 are suboptimal.
>> Especially the memset() for clearing the NXP MC firmware memory is very
>> expensive (time-wise).
>>
>> By using optimized functions, a speedup of ~ factor 6 has been measured.
>
> To be clear, you re-measured with the cache check code added, and this
> is the speed up?
I forgot doing this. BTW: I was wrong with factor ~6. From my notices,
it is ~ factor 4 using the optimized memset() version.
I'll follow-up on this mail with some measurements for all affected
functions, using small and large sizes. Hopefully tomorrow.
Thanks,
Stefan
More information about the U-Boot
mailing list