Normalize a Unicode string to get its canonical representation

Question

Normalize a Unicode string to get its canonical representation

942 views Asked by Clafou At 10 January 2012 at 23:53

Given that for example "à" (one Unicode character) can also be encoded as "\u0300a" (two Unicode characters, i.e. a combining grave accent (U+0300) followed by an a), is there functionality in .NET to normalize a string so that the latter is converted into the former?

I believe the former is deemed the canonical representation. My particular issue is that I've seen cases where the latter isn't displayed correctly by some browsers, but this could be useful in other scenarios too.

Original Q&A

There are 1 answers

**Clafou** · Answer 1 · 2012-01-10T23:57:27+00:00

Clafou On 10 January 2012 at 23:57

Just found it, duh! String.Normalize

TechQA.

Normalize a Unicode string to get its canonical representation

There are 1 answers

Related Questions in .NET

Related Questions in UNICODE

Related Questions in NORMALIZATION

Related Questions in DIACRITICS

Related Questions in UNICODE-NORMALIZATION

Popular Questions

Popular Tags

Trending Questions