[U-Boot] [PATCH 1/3] memcpy: use 32-bit copies if possible
Peter Tyser
ptyser at xes-inc.com
Wed Oct 7 16:35:19 CEST 2009
Hi Alessandro,
> --- a/lib_generic/string.c
> +++ b/lib_generic/string.c
> @@ -449,7 +449,16 @@ char * bcopy(const char * src, char * dest, int count)
> void * memcpy(void * dest,const void *src,size_t count)
> {
> char *tmp = (char *) dest, *s = (char *) src;
> + u32 *d32 = (u32 *)dest, *s32 = (u32 *) src;
>
> + /* if both are aligned, use 32-bit copy */
> + if ( (((int)dest & 3) | ((int)src & 3) | (count & 3)) == 0 ) {
> + count /= 4;
> + while (count--)
> + *d32++ = *s32++;
> + return dest;
> + }
> + /* else, use 1-byte copy */
> while (count--)
> *tmp++ = *s++;
If we're adding this logic, what about adding it such that:
if (src/dest are 32-bit aligned and count > 3) {
perform 32-bit copies till count <= 3
}
perform remaining 8-bit copies till count == 0
You'd still get the performance boost but not have the requirement that
count is evenly divisible by 4. You could do byte copies before the
32-bit copies to align the src/dest in some cases, but that might be
overkill...
Same comment goes for the memset implementation.
Best,
Peter
More information about the U-Boot
mailing list