Unicode assigns a number—called a code point—to every character you can type or see: letters, digits, emoji, punctuation, and entire writing systems. Think of it as a shared map for text.
Smiling Face | U+1F600 | 😀 |
Heart | U+2764 | ❤ |
Snowflake | U+2744 | ❄ |
Ukrainian Flag | U+1F1FA U+1F1E6 | 🇺🇦 |
Each character has a hexadecimal identifier like U+0410
(Cyrillic capital “A”). Many characters also have combining marks—for example, an accent—so the same visual letter can be written in more than one way.
Because of those alternate forms, software often applies normalization (NFC/NFD/NFKC/NFKD) so that strings compare consistently. If two strings look the same but won’t match, normalization is usually the fix.
The tool below shows the list of Unicode blocks — groups of characters that belong together (Latin, Cyrillic, Greek, Emoji, and many more).
Open directly: fontgenerator.info/unicodes/