[U-Boot] [PATCH 1/1] efi_loader: MAX_UTF8_PER_UTF16 = 3

Heinrich Schuchardt xypron.glpk at gmx.de
Mon Oct 9 18:59:44 UTC 2017


On 10/09/2017 08:43 PM, Alexander Graf wrote:
> 
> 
>> Am 09.10.2017 um 20:39 schrieb Heinrich Schuchardt <xypron.glpk at gmx.de>:
>>
>> A code point encoded by one UTF-16 symbol is converted to a
>> maximum of three UTF-8 symbols.
>>
>> 0xffff could be encoded as 0xef 0xbf 0xbf.
>> The first byte carries four bits, the second and third byte
>> carry six bits each.
>>
>> A code point encoded by two UTF-16 symbols is converted to four
>> UTF-8 symbols.
> 
> Yes, so why do you set it to 3?

The constant is not bytes per code point but bytes per UTF-16.

4 / 2 = 2 < 3

Cheers

Heinrich

> 
> Please also explain what exactly this actually fixes :).
> 
> 
> Alex
> 
>>
>> Fixes: 78178bb0c9d lib: add some utf16 handling helpers
>> Signed-off-by: Heinrich Schuchardt <xypron.glpk at gmx.de>
>> ---
>> The code is only used by efi_loader.
>> So this patch should go via efi-next.
>> ---
>> include/charset.h | 4 ++--
>> 1 file changed, 2 insertions(+), 2 deletions(-)
>>
>> diff --git a/include/charset.h b/include/charset.h
>> index 9c2866bbe6..2f7f7eacc9 100644
>> --- a/include/charset.h
>> +++ b/include/charset.h
>> @@ -9,7 +9,7 @@
>> #ifndef __CHARSET_H_
>> #define __CHARSET_H_
>>
>> -#define MAX_UTF8_PER_UTF16 4
>> +#define MAX_UTF8_PER_UTF16 3
>>
>> /**
>>  * utf16_strlen() - Get the length of an utf16 string
>> @@ -52,7 +52,7 @@ uint16_t *utf16_strdup(const uint16_t *s);
>>  * Converts 'size' characters of the utf16 string 'src' to utf8
>>  * written to the 'dest' buffer.
>>  *
>> - * NOTE that a single utf16 character can generate up to 4 utf8
>> + * NOTE that a single utf16 character can generate up to 3 utf8
>>  * characters.  See MAX_UTF8_PER_UTF16.
>>  *
>>  * @dest   the destination buffer to write the utf8 characters
>> -- 
>> 2.14.1
>>
> 



More information about the U-Boot mailing list