[U-Boot] [PATCH v8 0/38] Switch over to real Kbuild

Masahiro Yamada yamada.m at jp.panasonic.com
Mon Feb 3 04:46:30 CET 2014


Hello Simon and Tom,


This is my analysis of Kbuild performance.



[1] -j option is working

What I must say first is -j option is working correctly with Kbuild.

You can double-check by following the steps below.

Apply Kbuild series v8 on
commit 07e2822d158940a0e8ba45b6ab0344ffa1011a07.


First, build without -j option.

$ make mrproper
$ time make CROSS_COMPILE=arm-linux-gnueabi-  snow_config all
Configuring for snow board...
  GEN     include/autoconf.mk.dep
  [snip]

real	1m15.089s
user	0m44.092s
sys	0m32.513s


Next, build with -j8 option.
(Run "make mrproper" every time because we want to be sure
that there are no generated files before build.)

$ make mrproper
$ time make -j8 CROSS_COMPILE=arm-linux-gnueabi-  snow_config all 
  [snip]

real	0m17.223s
user	0m50.010s
sys	0m29.038s


It is much faster with -j8 option than without -j option.
(4.3 times faster on my box.)



You will easily notice another proof that -j option is working.


If you do not add -j option, the shorten log will be displayed
in the alphabetical order of output file name:

  LD      arch/arm/cpu/built-in.o
  CC      arch/arm/cpu/armv7/cache_v7.o
  CC      arch/arm/cpu/armv7/cpu.o
  CC      arch/arm/cpu/armv7/syslib.o
  CC      arch/arm/cpu/armv7/s5p-common/cpu_info.o
  CC      arch/arm/cpu/armv7/s5p-common/timer.o
  CC      arch/arm/cpu/armv7/s5p-common/sromc.o
  CC      arch/arm/cpu/armv7/s5p-common/pwm.o
  LD      arch/arm/cpu/armv7/s5p-common/built-in.o
  LD      arch/arm/cpu/armv7/built-in.o
  AS      arch/arm/cpu/armv7/start.o
  CC      arch/arm/cpu/armv7/exynos/clock.o
  CC      arch/arm/cpu/armv7/exynos/power.o
 

On the other hand, if you add -j option, the log will be shown
in a different order:

  LD      arch/arm/cpu/built-in.o
  CC      arch/arm/cpu/armv7/cache_v7.o
  CC      arch/arm/cpu/armv7/exynos/clock.o
  CC      disk/part.o
  AS      arch/arm/lib/crt0.o
  CC      board/samsung/common/board.o
  CC      board/samsung/smdk5250/smdk5250_spl.o
  CC      common/main.o
  LD      drivers/block/built-in.o
  LD      board/samsung/common/built-in.o
  CC      arch/arm/cpu/armv7/cpu.o
  AS      arch/arm/lib/relocate.o
  CC      disk/part_dos.o
  CC      arch/arm/cpu/armv7/exynos/power.o



[2] Is Kbuild slower than the old U-Boot build system?

Yes, Kbuild is definitely slower.
(But, as far as I tested,  I don't think it is 3 times slower.)

Let's compare the build time with the conventional build system.

Checkout master branch.
(commit 07e2822d158940a0e8ba45b6ab0344ffa1011a07)

$ make mrproper
$ time make CROSS_COMPILE=arm-linux-gnueabi-  snow

real	0m45.612s
user	0m28.367s
sys	0m18.969s


So, Kbuild(=real 1m15.089s) is 1.6 times slower
than the old U-Boot build system.


[3] Why is Kbuild slower?

One reason is "fixdep".

The helper program, fixdep, parses the source file
and all headers included from it to search all CONFIG
macro used there. This is a rather heavy task.

If you don't know the reason why "fixdep" is necessary,
please read the comment block of scripts/basic/fixdep.c
It is true that fixdep is meaningless for now,
but it will be a great help when switching to Kconfig.
We will get more return than we pay.
(And Kconfig series is almost ready.
I will test more and post version 1 within a couple of weeks.)


Let' check how big the impact of fixdep is.

I prepared a patch for you to disable fixdep:
http://patchwork.ozlabs.org/patch/316057/

Apply it on
commit 07e2822d158940a0e8ba45b6ab0344ffa1011a07 + Kbuild v8

And then, build.

$ make mrproper
$ time make CROSS_COMPILE=arm-linux-gnueabi-  snow_config all 

real	1m2.436s
user	0m38.281s
sys	0m25.871s

It is faster by 1.2 times faster without fixdep than it is with fixdep.


Another big factor is "arg-check".
This excellent routine is defined in scripts/Kbuild.include.

The dependency tracking of U-Boot old build system is
absolutely unreliable.
It compares the timestamp between object files and source files,
but never checks the arguments given to the compiler.

Kbuild checks both of them to precisely detect which objects must
be re-built.

To see how heavy "arg-check" task is,
build with KBUILD_NOCMDDEP=1 option to disable "arg-check".

$ make mrproper
$ time make CROSS_COMPILE=arm-linux-gnueabi- KBUILD_NOCMDDEP=1 snow_config all

real	0m41.882s
user	0m28.432s
sys	0m14.971s

See?
It is as fast as the old U-boot system(=0m45.612) .


Conclusion:
The main reasons of the slow down with Kbuild are "fixdep" and "arg-check".
Both of them are really important features for Kbuild and Kconfig.
    - "fixdep" is mandatory for our better life with Kconfig.
    - "arg-check" is for perfect dependency tracking.



> > What's your plan about this series?
> > Are we ready to switch to Kbuild, or need more review?
> 
> Lets get the performance problem Simon found figured out, but then
> otherwise, yes, I think we're about ready to merge.

Tom, are you satisfied with my analisys?

But, please hold merging Kbuild series.
I will post version 9 with a little minor update.



Best Regards
Masahiro Yamada



More information about the U-Boot mailing list