![]() On modern servers, this performance boost will be all but negligible. It does not follow the Unicode rules and will result in undesirable sorting or comparison in some situations, such as when using particular languages or characters. Utf8mb4_general_ci is a simplified set of sorting rules which aims to do as well as it can while taking many short-cuts designed to improve speed. Utf8mb4_unicode_ci is based on the official Unicode rules for universal sorting and comparison, which sorts accurately in a wide range of languages. Much of what's written below is not of much interest anymore if you can use one of the newer collations instead. People reading this now should probably use one of these newer collations instead of either _unicode or _general. Note: Newer versions of MySQL have updated Unicode sorting rules, available under names such as utf8mb4_0900_ai_ci for equivalent rules based on Unicode 9.0 - and with no equivalent _general variant. The fixed version was given the name utf8mb4. Confusingly, utf8 is a flawed UTF-8 implementation from early MySQL versions which remains only for backward compatibility. Note: In MySQL you have to use utf8mb4 rather than utf8. The differences are in how text is sorted and compared. These two collations are both for the UTF-8 character encoding. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |