[U-Boot] [PATCH 0/3] arm: reduce .bss section clear time
Pantelis Antoniou
panto at antoniou-consulting.com
Wed Jan 28 15:18:14 CET 2015
Hi Przemyslaw,
> On Jan 28, 2015, at 16:10 , Przemyslaw Marczak <p.marczak at samsung.com> wrote:
>
> Hello Stefan,
>
> On 01/28/2015 02:12 PM, Stefan Roese wrote:
>> Hi Przemyslaw,
>>
>> On 28.01.2015 13:55, Przemyslaw Marczak wrote:
>>> This patchset reduces the boot time for ARM architecture,
>>> Exynos boards, and boards with DFU enabled(ARM).
>>>
>>> For tested Trats2 device, this was done in three steps.
>>>
>>> First was enable the arch memcpy and memset.
>>> The second step was enable memset for .bss clear.
>>> The third step for reduce this operation is to keep .bss section
>>> small as possible.
>>>
>>> The .bss section will grow if we have a lot of static variables.
>>> This section is cleared before jump to the relocated U-Boot,
>>> and it's done word by word. To reduce the time for this step,
>>> we can enable arch memset, which uses multiple ARM registers.
>>>
>>> For configs with DFU enabled, we can find the dfu buffer in this section,
>>> which has at least 8MB (32MB for trats2). This is a lot of useless data,
>>> which is not required for standard boot. So this buffer should be dynamic
>>> allocated.
>>>
>>> Przemyslaw Marczak (3):
>>> exynos: config: enable arch memcpy and arch memset
>>> arm: relocation: clear .bss section with arch memset if defined
>>> dfu: mmc: file buffer: remove static allocation
>>>
>>> arch/arm/lib/crt0.S | 10 +++++++++-
>>> drivers/dfu/dfu_mmc.c | 25 ++++++++++++++++++++++---
>>> include/configs/exynos-common.h | 3 +++
>>> 3 files changed, 34 insertions(+), 4 deletions(-)
>>
>> Looking at the commit messages of this patchset I can conclude that your
>> overall boot time reduction is:
>>
>> from ~1527ms
>> to ~464ms
>>
>> This is amazing! Congrats. :)
>>
>
> Thank you. I was also amazed.
>
> The time results are taken with from the clock cycle counter, I think it's reliable. Some day I would like to check it using the oscilloscope.
>
>> We really should in general make more use of the optimized functions and
>> take care that the buffers (e.g. the DFU buffer in this case) are used
>> in a sane way.
>>
>> Thanks,
>> Stefan
>>
>>
>
> Yes you're right, I thought that Exynos config has enabled arch memcpy/set lib, before I checked this…
>
Those numbers are indeed incredible; I suppose the caches are disabled?
> Best regards,
> --
> Przemyslaw Marczak
> Samsung R&D Institute Poland
> Samsung Electronics
> p.marczak at samsung.com
Regards
— Pantelis
More information about the U-Boot
mailing list