Unicode Sets
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

View only
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
NameStarting (Unicode)Ending (Unicode)Starting (UTF-8)Ending (UTF-8)ReasonExclusion Importance for URLsExclusion Importance for Blacklist
2
Latin-10000001F001FNon-printableHighHigh
3
Latin-1 Supplement007F009FC280C2A0Non-printableHighHigh
4
Latin-1 Supplement00A600A8C2A6C2A8Bars, dots, etc.HighHigh
5
Latin-1 Supplement00AA00ADC2AAC2ADArrows, bars, dashHighHigh
6
Latin-1 Supplement00AF00B0C2AFC2B0Bars and dotsHighHigh
7
Latin-1 Supplement00B700B7C2B7C2B7DotHighHigh
8
Latin-1 Supplement00BA00BBC2BAC2BBDot and arrowHighHigh
9
Spacing Modifiers02BB02FFCABBCBBFMisc. punctuationHighHigh
10
Combining Diacritical Marks0300036FCC80CDAF
Combining marks. These should only be used when combined with the previous character and never standalone. Therefore, you might need to verify that the previous character is a combinable character depending on the Unicode implementation.
HighHigh
11
Hebrew Combining059005CFD690D78F
Hebrew combining marks. These should only be used when combined with the previous character and never standalone. Therefore, you might need to verify that the previous character is a combinable character depending on the Unicode implementation.
HighHigh
12
The rows below belong to various languages. You should keep the languages you intend to support (i.e. Hangul Jamo if you want to support Korean)
13
Syriac0700074FDC80DD8FUnlikely to be usedLowHigh
14
Thaana078007BFDE80DEBFUnlikely to be usedLowHigh
15
N'Ko07C007FFDF80DFBFUnlikely to be usedLowHigh
16
Samaritan0800083FE0A080E0A0BFUnlikely to be usedLowHigh
17
Mandaic0840085FE0A180E0A19FUnlikely to be usedLowHigh
18
Arabic Extended08A008FFE0A2A0E0A3BFUnlikely to be usedLowHigh
19
Devanagari0900097FE0A480E0A5BFUnlikely to be usedLowHigh
20
Bengali098009FFE0A680E0A7BFUnlikely to be usedLowHigh
21
Gurmukhi0A000A7FE0A880E0A9BFUnlikely to be usedLowHigh
22
Gujarati0A800AFFE0AA80E0ABBFUnlikely to be usedLowHigh
23
Oriya0B000B7FE0AC80E0ADBFUnlikely to be usedLowHigh
24
Tamil0B800BFFE0AE80E0AFBFUnlikely to be usedLowHigh
25
Telugu0C000C7FE0B080E0B1BFUnlikely to be usedLowHigh
26
Kannada0C800CFFE0B280E0B3BFUnlikely to be usedLowHigh
27
Malayaiam0D000D7FE0B480E0B5BFUnlikely to be usedLowHigh
28
Sinhaia0D800DF0E0B680E0B7BFUnlikely to be usedLowHigh
29
Thai0E000E7FE0B880E0B9BFUnlikely to be usedLowHigh
30
Lao0E800EFFE0BA80E0BBBFUnlikely to be usedLowHigh
31
Tibetan0F000FFFE0BC80E0BFBFUnlikely to be usedLowHigh
32
Myanmar1000109FE18080E1829FUnlikely to be usedLowHigh
33
Georgian10A010FFE182A0E183BFUnlikely to be usedLowHigh
34
Hangul Jamo110011FFE18480E187BFUnlikely to be usedLowLow
35
Ethiopic1200137FE18880E18DBFUnlikely to be usedLowHigh
36
Ethiopic Supplement1380139FE18E80E18E9FUnlikely to be usedLowHigh
37
Cherokee13A013FFE18EA0E18FBFUnlikely to be usedLowHigh
38
Candian Aboriginal1400167FE19080E199BFUnlikely to be usedLowHigh
39
Ogham1680169FE19A80E19A9FSymbolsHighHigh
40
Runic16A016FFE19AA0E19BBFSymbolsHighHigh
41
Tagalog1700171FE19C80E19C9FUnlikely to be usedLowHigh
42
Hanunoo1720173FE19CA0E19CBFUnlikely to be usedLowHigh
43
Buhid1740175FE19D80E19DBFUnlikely to be usedLowHigh
44
Tagbanwa1760177FE19DA0E19DBFUnlikely to be usedLowHigh
45
Khmer178017FFE19E80E19FBFUnlikely to be usedLowHigh
46
Mongolian180018AFE1A080E1A2AFUnlikely to be usedLowHigh
47
Candian Aboriginal Extended18B018FFE1A2B0E1A3BFUnlikely to be usedLowHigh
48
Limbu1900194FE1A480E1A58FUnlikely to be usedLowHigh
49
Tai Le1950197FE1A590E1A5BFUnlikely to be usedLowHigh
50
New Tai Le198019DFE1A680E1A79FUnlikely to be usedLowHigh
51
Khmer Symbols19E019FFE1A7A0E1A7BFUnlikely to be usedLowHigh
52
Buginese1A001A1FE1A880E1A89FUnlikely to be usedLowHigh
53
Tai Tham1A201AAFE1A8A0E1AAAFUnlikely to be usedLowHigh
54
Balinese1B001B7FE1AC80E1ADBFUnlikely to be usedLowHigh
55
Sundanese1B801BBFE1AE80E1AEBFUnlikely to be usedLowHigh
56
Batak1BC01BFFE1AF80E1AFBFUnlikely to be usedLowHigh
57
Lepcha1C001C4FE1B080E1B18FUnlikely to be usedLowHigh
58
Ol Chiki1C501C7FE1B190E1B1BFUnlikely to be usedLowHigh
59
The rows below are extended characters for some languages (and other random things like phonetic marks). You should keep these if you intend to support that language and its extensions.
60
Sundanese Supplement1CC01CCFE1B380E1B3BFUnlikely to be usedLowHigh
61
Vedic Extensions1CD01CFFE1B390E1B3BFUnlikely to be usedLowHigh
62
Phonetic Extensions1D001DB0E1B480E1B6BFUnlikely to be usedLowHigh
63
Combining Diacritical Marks1DC01DFFE1B780E1B7BF
Combining marks. These should only be used when combined with the previous character and never standalone. Therefore, you might need to verify that the previous character is a combinable character depending on the Unicode implementation.
HighHigh
64
Latin Extended1E001EFFE1B880E1BBBFUnlikely to be usedLowHigh
65
Greek Extended1F001FFFE1BC80E1BFBFUnlikely to be usedLowHigh
66
The rows below contain mostly punctuation, pictures, shapes, etc. You can likely extend all of these.
67
General Punctuation2000206FE28080E281AFSymbols and punctuationHighHigh
68
Superscripts and Subscripts2070209FE281B0E2829FUnlikely to be usedLowHigh
69
Combining Diacritical Marks for Symbols
20D020FFE28390E283BFSymbols and punctuationHighHigh
70
Letterlike2100214FE28480E2858FUnlikely to be usedLowHigh
71
Number Forms2150218FE28590E2868FUnlikely to be usedLowHigh
72
Arrows219021FFE28690E287BFArrowsHighHigh
73
Mathematical Operators220022FFE28880E28BBFArrows, bars, dashHighHigh
74
Misc. Technical230023FFE28C80E28FBFArrows, bars, dashHighHigh
75
Control Pictures2400243FE29080E290BFWordsHighHigh
76
Optical Characters2440245FE29180E2919FSymbols, Arrows, dotsHighHigh
77
Enclosed246024FFE291A0E293BFLetters and numebrsLowHigh
78
Box Drawing2500257FE29480E295BFLinesHighHigh
79
Block Elements2580259FE29680E29569FBlocksHighHigh
80
Shapes25A025FFE296A0E297BFArrows, bars, dots, etc.HighHigh
81
Misc. Symbols260026FFE29880E29BBFSymbols, pictures, etc.HighHigh
82
Dingbats270027BFE29C80E29EBFSymbols, pictures, etc.HighHigh
83
Misc. Math27C027EFE29F80E29FAFSymbols, pictures, etc.HighHigh
84
Arrows27F027FFE29FB0E29FBFArrowsHighHigh
85
Braille280028FFE2A080E2A3BFDotsHighHigh
86
Arrows2900297FE2A480E2A5BFArrowsHighHigh
87
Misc. Math298029FFE2A680E2A7BFSymbolsHighHigh
88
Math2A002AFFE2A880E2ABBFSymbolsHighHigh
89
Arrows2B002BFFE2AC80E2AFBFSymbolsHighHigh
90
The rows below contain archiac and historic characters or extensions that are likely not to be used. You should only include the sets that you are certain you need.
91
Giagolitic2C002C5FE2B080E2B19FUnlikely to be usedLowHigh
92
Latin Extended2C602C7FE2B1A0E2B1BFUnlikely to be usedLowHigh
93
Coptic2C802CFFE2B280E2B3BFUnlikely to be usedLowHigh
94
Georgian2D002D2FE2B480E2B4AFUnlikely to be usedLowHigh
95
Tifinagh2D302D7FE2B4B0E2B5BFUnlikely to be usedLowHigh
96
Ethiopic Extended2D802DDFE2B680E2B79FUnlikely to be usedLowHigh
97
Cyrillic Extended2DE02DFFE2B7A0E2B7BFUnlikely to be usedLowHigh
98
Punctuation2E002E7FE2B880E2B9BFPunctuationHighHigh
99
CJK Punctuation3000303FE38080E380BFPunctuationHighHigh
100
Lisu and aboveA4D0ABFFEA9390EAAFBFRandomHighHigh
Loading...