Unicode
There are several reasons why someone might
choose to use Unicode code points in regex patterns:
- Latin Letter 'A' and Greek Letter 'Alpha'
- Latin Capital Letter A: A (U+0041)
- Greek Capital Letter Alpha: Α (U+0391)
- Latin Letter 'B' and Greek Letter 'Beta'
- Latin Capital Letter B: B (U+0042)
- Greek Capital Letter Beta: Β (U+0392)
- Latin Letter 'E' and Greek Letter 'Epsilon'
- Latin Capital Letter E: E (U+0045)
- Greek Capital Letter Epsilon: Ε (U+0395)
- Latin Letter 'I' and Greek Letter 'Iota'
- Latin Capital Letter I: I (U+0049)
- Greek Capital Letter Iota: Ι (U+0399)
- Latin Letter 'O' and Greek Letter 'Omicron'
- Latin Capital Letter O: O (U+004F)
- Greek Capital Letter Omicron: Ο (U+039F)
- Latin Letter 'P' and Greek Letter 'Rho'
- Latin Capital Letter P: P (U+0050)
- Greek Capital Letter Rho: Ρ (U+03A1)
- Latin Letter 'X' and Greek Letter 'Chi'
- Latin Capital Letter X: X (U+0058)
- Greek Capital Letter Chi: Χ (U+03A7)
- Latin Letter 'Y' and Greek Letter 'Upsilon'
- Latin Capital Letter Y: Y (U+0059)
- Greek Capital Letter Upsilon: Υ (U+03A5)
- Latin Letter 'C' and Cyrillic Letter 'Es'
- Latin Capital Letter C: C (U+0043)
- Cyrillic Capital Letter Es: С (U+0421)
- Latin Letter 'a' and Cyrillic Letter 'a'
- Latin Small Letter a: a (U+0061)
- Cyrillic Small Letter a: а (U+0430)
- Latin Letter 'e' and Cyrillic Letter 'e'
- Latin Small Letter e: e (U+0065)
- Cyrillic Small Letter e: е (U+0435)
- Latin Letter 'o' and Cyrillic Letter 'o'
- Latin Small Letter o: o (U+006F)
- Cyrillic Small Letter o: о (U+043E)
- Latin Letter 'p' and Cyrillic Letter 'er'
- Latin Small Letter p: p (U+0070)
- Cyrillic Small Letter er: р (U+0440)
- Latin Letter 'y' and Cyrillic Letter 'u'
- Latin Small Letter y: y (U+0079)
- Cyrillic Small Letter u: у (U+0443)