A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | For sorting names in phone or address books or similar, we believe that it is most natural to have names in the UI language's native script sort first, followed by names in other writing systems. | This spreadsheet includes only languages that use non-Latin scripts and already have CLDR collation data. | |||||||||||||||||||
2 | The default script sort order is: | Latin Greek Cyrillic Hebrew Arabic Ethiopic Indic Thai Lao Tibetan Myanmar Khmer Hangul Kana Bopomofo Han (http://www.unicode.org/charts/collation/) | Indic = Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | ||||||||||||||||||
3 | Language code | Language name | Which script(s) should sort first, before the remaining scripts? | Comments | Script codes | ||||||||||||||||
4 | ar | Arabic | Arabic | already in CLDR data | Arab | ||||||||||||||||
5 | as | Assamese | Bengali Devanagari Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | For Indic languages, we should sort all Indic scripts first before non-Indic scripts, unless more specific data is available. | Beng Deva Guru Gujr Orya Taml Telu Knda Mlym Sinh | ||||||||||||||||
6 | az | Azerbaijani | Latin Cyrillic | Latn Cyrl | |||||||||||||||||
7 | be | Belarusian | Cyrillic | Cyrl | |||||||||||||||||
8 | bg | Bulgarian | Cyrillic | Cyrl | |||||||||||||||||
9 | bn | Bengali | Bengali Devanagari Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | Beng Deva Guru Gujr Orya Taml Telu Knda Mlym Sinh | |||||||||||||||||
10 | bs_Cyrl | Bosnian/Cyrillic | Cyrillic | already in CLDR data | Cyrl | ||||||||||||||||
11 | bs | Bosnian/Latin | Latin Cyrillic | already in CLDR data | Latn Cyrl | ||||||||||||||||
12 | dz | Dzongkha | Tibetan | Tibt | |||||||||||||||||
13 | el | Greek | Greek | Grek | |||||||||||||||||
14 | fa | Persian | Arabic | Arab | |||||||||||||||||
15 | gu | Gujarati | Gujarati Devanagari Bengali Gurmukhi Oriya Tamil Telugu Kannada Malayalam Sinhala | Gujr Deva Beng Guru Orya Taml Telu Knda Mlym Sinh | |||||||||||||||||
16 | he | Hebrew | Hebrew | already in CLDR data | Hebr | ||||||||||||||||
17 | hi | Hindi | Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | Deva Beng Guru Gujr Orya Taml Telu Knda Mlym Sinh | |||||||||||||||||
18 | hr | Croatian | Latin Cyrillic | already in CLDR data | Latn Cyrl | ||||||||||||||||
19 | hy | Armenian | Armenian | Armn | |||||||||||||||||
20 | ja | Japanese | Kana Kanji | Kanji=Han; JIS X 4061 actually sorts Greek Cyrillic Latin Kana Kanji: http://search.cpan.org/~sadahiro/Lingua-JA-Sort-JIS-0.10/JIS.pod & http://homepage1.nifty.com/ nomenclator/perl/ShiftJIS-Collate.html | Kana Hani | ||||||||||||||||
21 | kk | Kazakh | Cyrillic | Cyrl | |||||||||||||||||
22 | km | Khmer | Khmer | Khmr | |||||||||||||||||
23 | kn | Kannada | Kannada Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Malayalam Sinhala | Knda Deva Beng Guru Gujr Orya Taml Telu Mlym Sinh | |||||||||||||||||
24 | ko | Korean | Hangul Hanja | Hanja=Han | Hang Hani | ||||||||||||||||
25 | kok | Konkani | Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | Deva Beng Guru Gujr Orya Taml Telu Knda Mlym Sinh | |||||||||||||||||
26 | mk | Macedonian | Cyrillic | Cyrl | |||||||||||||||||
27 | ml | Malayalam | Malayalam Latin Devanagari Arabic Tamil Kannada Telugu Bengali Gurmukhi Gujarati Oriya Sinhala | native speaker's specific list | Mlym Latn Deva Arab Taml Knda Telu Beng Guru Gujr Orya Sinh | ||||||||||||||||
28 | mr | Marathi | Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala | Deva Beng Guru Gujr Orya Taml Telu Knda Mlym Sinh | |||||||||||||||||
29 | my | Burmese | Myanmar | Mymr | |||||||||||||||||
30 | or | Oriya | Oriya Devanagari Bengali Gurmukhi Gujarati Tamil Telugu Kannada Malayalam Sinhala | Orya Deva Beng Guru Gujr Taml Telu Knda Mlym Sinh | |||||||||||||||||
31 | pa | Punjabi | Gurmukhi Devanagari Bengali Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala Arabic | Guru Deva Beng Gujr Orya Taml Telu Knda Mlym Sinh Arab | |||||||||||||||||
32 | ps | Pashto | Arabic | Arab | |||||||||||||||||
33 | ru | Russian | Cyrillic | already in CLDR data | Cyrl | ||||||||||||||||
34 | si | Sinhala | Sinhala Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Telugu Kannada Malayalam | Sinh Deva Beng Guru Gujr Orya Taml Telu Knda Mlym | |||||||||||||||||
35 | sr_Latn | Serbian/Latin | Latin Cyrillic | already in CLDR data | Latn Cyrl | ||||||||||||||||
36 | sr | Serbian/Cyrillic | Cyrillic Latin | already in CLDR data | Cyrl | ||||||||||||||||
37 | ta | Tamil | Tamil Devanagari Bengali Gurmukhi Gujarati Oriya Telugu Kannada Malayalam Sinhala | Taml Deva Beng Guru Gujr Orya Telu Knda Mlym Sinh | |||||||||||||||||
38 | te | Telugu | Telugu Devanagari Bengali Gurmukhi Gujarati Oriya Tamil Kannada Malayalam Sinhala | Telu Deva Beng Guru Gujr Orya Taml Knda Mlym Sinh | |||||||||||||||||
39 | th | Thai | Thai | already in CLDR data | Thai | ||||||||||||||||
40 | uk | Ukranian | Cyrillic | Cyrl | |||||||||||||||||
41 | ur | Urdu | Arabic | Arab | |||||||||||||||||
42 | zh-u-co-pinyin | Chinese/simplified Pinyin order | Han | Hani | |||||||||||||||||
43 | zh-u-co-gb2312 | Chinese/simplified GB2312 order | Latin Han | Latin first to match charset order | Latn Hani | ||||||||||||||||
44 | zh-u-co-stroke | Chinese/traditional Stroke order | Han Bopomofo | Hani Bopo | |||||||||||||||||
45 | zh-u-co-zhuyin | Chinese/traditional Bopomofo order | Han Bopomofo | Hani Bopo | |||||||||||||||||
46 | zh-u-co-big5han | Chinese/traditional Big5 order | Latin Han Bopomofo | Latin first to match charset order | Latn Hani Bopo | ||||||||||||||||
47 | zh-u-co-unihan | Chinese, Unicode Radical/Stroke order | Han Bopomofo | Hani Bopo | |||||||||||||||||
48 | |||||||||||||||||||||
49 | |||||||||||||||||||||
50 |