[U-Boot] [PATCH] ARM: UniPhier: remove __packed that causes a problem on GCC 4.9

Masahiro Yamada yamada.m at jp.panasonic.com
Thu Jan 8 10:03:46 CET 2015


On Wed, 7 Jan 2015 14:00:05 -0500
Tom Rini <trini at ti.com> wrote:

> On Wed, Jan 07, 2015 at 07:41:38PM +0900, Masahiro Yamada wrote:
> > The DDR PHY training function, ddrphy_prepare_training() would not
> > work if compiled with GCC 4.9.
> > 
> > The struct ddrphy (arch/arm/include/asm/arch-uniphier/ddrphy-regs.h)
> > is specified with __packed because it represents a hardware register
> > mapping, but it turned out to cause a problem on GCC 4.9.
> > 
> > If -mno-unaligned-access is specified (actually it is in
> > arch/arm/cpu/armv7/config.mk), GCC 4.9 is aware of the
> > __attribute__((packed)) and generates extra instructions to perform
> > the memory access in a way that does not cause unaligned access.
> > (Actually it is bogus here because the register base, the first
> > argument of the ddrphy_prepare_training(), is always given with a
> > 4-byte aligned address.)
> > 
> > Anyway, as a result, readl() / writel() is divided into byte-wise
> > accesses.  The problem is that this hardware only accepts 4-byte
> > register access.  Byte-wise accesses lead to unexpected behavior.
> > 
> > There are some options to avoid this problem.
> > 
> > [1] Remove -mno-unaligned-access
> > [2] Add __aligned(4) along with __packed to struct ddrphy
> > [3] Remove __packed from struct ddrphy
> > 
> > [1] solves the problem for ARMv7, but it does not for pre-ARMv6 and
> > ARMv6-M architectures where -mno-unaligned-access is default.
> > So, [1] does not seem reasonable in terms of code portability.
> > 
> > Both [2] and [3] work well, but [2] seems too much.  All the members
> > of struct ddrphy have the u32 type.  No padding would be inserted
> > even if __packed is dropped.
> > 
> > Signed-off-by: Masahiro Yamada <yamada.m at jp.panasonic.com>
> 
> I wanted to think about this for a minute.  I argued with myself a bit
> about [2] being the best choice, and lost.  I'm fairly sure that
> __packed on an struct of all u32 (on a 32bit platform) is in fact the
> wrong thing to do so yes, [3] is right.
> 
> Reviewed-by: Tom Rini <trini at ti.com>
> 


There is another (and I think the best) option:


[4] Sync writeb(), writew(), writel(), readlb(), readw(), readl() with Linux



This pitfall does not occur on Linux
because ARM Linux hard-codes writel()/readl() etc.




Linux:

#define __raw_writel __raw_writel
static inline void __raw_writel(u32 val, volatile void __iomem *addr)
{
        asm volatile("str %1, %0"
                     : "+Qo" (*(volatile u32 __force *)addr)
                     : "r" (val));
}




U-Boot:

#define __arch_putl(v,a)                (*(volatile unsigned int *)(a) = (v))

#define __raw_writel(v,a)       __arch_putl(v,a)






On Linux, writel() is always converted to "str" instruction.

On U-Boot, it depends on compiler's decesion.




Best Regards
Masahiro Yamada



More information about the U-Boot mailing list