6f4e% | Unicode is harder than you think · mcilloni's blog
https://mcilloni.ovh/2023/07/23/unicode-is-hard/
Saved on 2023-07-26 [19564 edays] via mcilloni.ovh
Modified 2023-08-20 [19589 edays]
blogs unicode

Reading the excellent article by JeanHeyd Meneide on how broken string encoding in C/C++ is made me realise that Unicode is a topic that is often overlooked by a large number of developers. In my experience, there’s a lot of confusion and wrong expectations on what Unicode is, and what best practices to follow when dealing with strings that may contain characters outside of the ASCII range.

This article attempts to briefly summarise and clarify some of the most common misconceptions I’ve seen people struggle with, and some of the pitfalls that tend to recur in codebases that have to deal with non-ASCII text.