how does strlen count unicode in c

Question

how does strlen count unicode in c

6.6k views Asked by Horse SMith At 23 November 2014 at 08:18

I'm curious as to how strlen count unicode characters of multiple bytes in C.

Does it count each byte or character (as they can consist of several bytes) until first '\0'?

Original Q&A

There are 2 answers

Jens Gustedt On 23 November 2014 at 09:51

strlen only applies to strings, that is null terminated arrays of char. All multibyte encodings that are permitted inside strings have the property that they contain no internal null bytes, so strlen and other str functions such as strcat work fine.

If by "unicode" you mean arrays of wchar_t then this can contain null bytes, but here again this is no problem, none of the wchar_t elements itself will be null. And you shouldn't apply the str functions to such arrays, they are not defined for them.

**Yu Hao** · Accepted Answer · 2014-11-23T08:30:58+00:00

strlen() counts number of bytes until a \0 is encountered. This holds true for all strings.

For Unicode, note that the return value of strlen() may be affected by the possible existing \0 byte in a valid character other than the null terminator. If UTF-8 is used, it's fine because no valid character other than ASCII 0 can have a \0 byte, but it may not be true for other encodings.

TechQA.

how does strlen count unicode in c

There are 2 answers

Related Questions in C

Related Questions in UNICODE

Related Questions in COUNT

Related Questions in STRLEN

Popular Questions

Trending Questions