[U-Boot] [PATCH] crc32: Correct endianness of crc32 result
Albert ARIBAUD
albert.u.boot at aribaud.net
Thu Apr 18 08:20:27 CEST 2013
Hi Simon,
On Wed, 17 Apr 2013 13:59:48 -0700, Simon Glass <sjg at chromium.org>
wrote:
> Hi Albert,
>
> On Wed, Apr 17, 2013 at 12:23 PM, Albert ARIBAUD
> <albert.u.boot at aribaud.net> wrote:
> > Hi Simon -- and sorry for the dupe.
> >
> > On Wed, 17 Apr 2013 11:28:07 -0700, Simon Glass <sjg at chromium.org>
> > wrote:
> >
> >> I tried using:
> >>
> >> #ifdef USE_HOSTCC
> >> crc = htobe32(crc);
> >> #else
> >> crc = cpu_to_be32(crc);
> >> #endif
> >> memcpy(output, &crc, sizeof(crc));
> >>
> >>
> >> This is one instruction (4 bytes, 16%) smaller, but I suspect quite a
> >> lot slower due to the overhead of a very small memcpy().
> >>
> >> 43e2c1d8: e28d1008 add r1, sp, #8
> >> 43e2c1dc: e3a02004 mov r2, #4
> >> 43e2c1e0: e6bf0f30 rev r0, r0
> >> 43e2c1e4: e5210004 str r0, [r1, #-4]!
> >> 43e2c1e8: e1a00004 mov r0, r4
> >> 43e2c1ec: eb001af7 bl 43e32dd0 <memcpy>
> >
> > How about replacing the memcpy with an explicit put_unaligned(),
> > similar to what was done in
> >
> > http://www.mail-archive.com/u-boot@lists.denx.de/msg109555.html
> >
> > with get_unaligned()? The code will be longer than above, but shorter
> > than the above plus the memcpy(), and faster too -- actually, I'm
> > surprised that the compiler does not unroll the memcpy() on its own,
> > considering the size argument is a constant.
>
> Do you mean like this?
>
> #ifdef USE_HOSTCC
> crc = htobe32(crc);
> memcpy(output, &crc, sizeof(crc));
> #else
> crc = cpu_to_be32(crc);
> put_unaligned(crc, (uint32_t *)output);
> #endif
>
> This produces the same code as my original patch. If this is
> acceptable then I will do that, although it doesn't really seem any
> better.
Wolfgang may not like it any more than put_unaligned_be32() as it
builds upon it (and the disk patch could have used the be32 version as
well). Personally, I think we should allow and use these...
... and work on optimizing their implementation for ARM; we should be
able to reduce such put()s and get()s to a few instructions. And to
avoid any misunderstanding, yes, I volunteer for the optimizing work. :)
> Regards,
> Simon
>
> [and sorry for my dup]
Actually I'm the culprit.
Amicalement,
--
Albert.
More information about the U-Boot
mailing list