Root Zone LGR for script: Latin (Latn) | rz-lgr-5-latin-script-26may22-en |
---|
This document is mechanically formatted from the above XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.
Date | 2022-05-26 |
---|---|
LGR Version | 5 (Root Zone LGR for the Latin Script) |
Language | und-Latn (Latin Script) |
Scope | domain: "." (Root) |
Unicode Version | 11.0.0 |
This file contains Label Generation Rules (LGR) for the Latin script for the Root Zone. For more details on this LGR and additional background on the script, see “Proposal for a Latin Script Root Zone LGR” [Proposal-Latin]], including the appendices [Proposal-Latin-Appendices]. This file is one of a set of LGR files that together form an integrated LGR for the DNS Root Zone [RZ-LGR-5]. The format of this file follows [RFC 7940].
According to Section 5, “Repertoire” in [Proposal-Latin], the repertoire contains the 197 code points needed to write hundreds of languages in the Latin script. An additional 7 code points are available as part of 21 explicitly defined combining sequences which form part of the repertoire. One additional in-script sequence is defined for use in variants. This results in a total of 219 repertoire elements.
The repertoire is based on [MSR-5], which is a subset of Unicode 11.0.0 [Unicode 11.0].
Each code point is tagged with the script or scripts with which the code point is used. For each repertoire element, one or more references document sufficient justification for inclusion in the repertoire; see “References” below. For code points that are part of the repertoire, comments identify the languages using it along with their [EGIDS] level.
Code points outside the Latin script that are listed in this file are targets for out-of-script variants and are identified by a reflexive (identity) variant of type “out-of-repertoire-var”. They do not form part of the repertoire.
As part of the Root Zone, this LGR includes neither decimal digits nor the HYPHEN-MINUS.
The Latin script is a major writing system of the world, and the most widely used in terms of the number of languages and speakers, with circa 70% of the world’s readers and writers making use of this script. From a list of 1,189 languages using the Latin script [Omniglot] the 212 languages that were taken into consideration contain all 182 languages with [EGIDS] level 1–4 together with many languages with EGIDS level 5, each spoken by more than 1 million estimated speakers. Altogether over 100 languages are cited here to justify specific additions to the repertoire, but many other languages may also be written using some subset of the repertoire of this LGR. In a few cases, code points were excluded in [MSR-5] due to security concerns; for the affected languages, only a subrepertoire could be supported. More details in Section 3, “Background on Script and Principal Languages Using It” of [Proposal-Latin].
According to Section 6, “Variants”, in [Proposal-Latin], this LGR defines the principles and methodology for developing both in-script and cross-scripts variants. The variant sets are defined in Section 6.3, “Variant Sets” and detailed analysis can be found in Appendix D of [Proposal-Latin]. See also Section 6.4, “Other Considerations for Variant Analysis”.
Variant Disposition: Except for limited exceptions for the variants defined for Latin Small Letter Sharp S and Latin Small Letter Dotless I all variants defined here result in a variant label disposition of “blocked”
The specification of variants in the Root Zone LGR follows the guidelines in [RFC 8228].
In the case of U+00DF (ß) Latin Small Letter Sharp S used in German, the LGR includes the code point together with a variant relationship with the sequence of two letters “ss” (U+0073 U+0073), as described in Section 6.4.2, “IDNA2003 Compatibility” in [Proposal-Latin]. To reduce the number of allocatable labels in case the applied-for label contains multiple sharp s letters, the LGR defines special variant types and actions so that only two allocatable variants are allowed for each label: the label as applied-for (original label), and the label with only the sequence ss. Note that this restriction is independent of the restriction on variants of dotless i, see below, so that a total of up to four allocatable labels may exist.
Overlapped Variant Sequence: Both “ss” and “s” coexist in the repertoire and “s” has variant relationships on its own. These variants thus overlap: making the variant set well-behaved for index variant calculation requires that the sequence “ss” also be given variants to all permutations of variants for the letter s followed by itself, as well as all transitive variants due to other variants for U+00DF (ß).
The LGR defines a variant between U+0131 (Small Latin Letter Dotless i) and U+0069 (Small Latin Letter i) such that labels containing the any dotless i have allocatable variant labels that only have U+0069 (i), as described in Section 6.4.2, “IDNA2003 Compatibility” in [Proposal-Latin]. To reduce the number of allocatable labels in case the applied-for label contains multiple instances of dotless i, the LGR defines special variant types and actions so that only two allocatable variants are allowed for each label: the label as applied-for (original label), and the label with only dotted letters i. Note that this restriction is independent of the restriction on variants of sharp s, see above, so that a total of up to four allocatable labels may exist.
U+00DF (ß) Sharp S has been given the reflexive variant type “r-eszett” and U+0131 (ı) Dotless i has been given the reflexive variant of type “r-dotless”. (By convention, the prefix “r-“ marks a type used in a reflexive variant mapping, that is, it represents an instance of the original code point at that location in a variant label, see Section 5.3.4 in [RFC 7940].)
The variant mapping from U+00DF (ß) Sharp S to “ss” is of type “eszett-to-ss”, while the variant type for the mapping from “ss” to Sharp S is “blocked”. The variant mapping from U+0131 (ı) Dotless i to “i” is of type “dotted”, while the variant type for the mapping from “i” to Dotless i is “blocked”. Special <action> elements defined for this LGR use these types to ensure the following restrictions. (See also “Latin-specific Actions” below.)
To limit the number of allocatable variants, a label is only allocatable when the variants for any U+00DF (ß) in the label are all “ss” ; or the variants for any Dotless i are all “i” ; or both; or when it is the original label as applied for. As a result of these actions, any label may result in at most four allocatable labels including the original label as applied-for. However, if it is applied for without Sharp S or Dotless i it will not result in any allocatable variant labels.
The Latin script is closely related to the Greek and Cyrillic scripts, with a more distant relation to the Armenian script. These relationships give rise to shared forms that necessitate cross-script variants. A number of letter forms like “i”, “o” and “c” are rather generic in appearance and thus give rise to cross-script variant relations with otherwise unrelated scripts.
Where homoglyphs or near homoglyphs with Latin code points exist in other scripts, out-of-repertoire variants are defined with a comment “Cross-script homoglyph” or “Cross-script near homoglyph” respectively.
Because variant mappings, including cross-script variants, must be symmetric and transitive this LGR inherits additional blocked cross-script and in-script variants by integration; while not alwayas further identified, all such variants are listed here in full for ease of reference. However, always use the Common LGR [RZ-LGR-5] for determining cross-script collisions of labels.
In particular, the Latin LGR inherits a number in-script variants as result of integration, primarily due to variants with the Greek script. These variants are marked in comments as “Required for integration”.
For more details, see Section 6.3.2 “Cross-Script Variants” in [Proposal-Latin].
Overlapped sequence: one sequence with a variant “ss” is overlapped with a code point “s” that has a variant of its own. To ensure that the sets of variant labels are well-behaved, additional variant mappings had to be defined, including the following out-of-repertoire sequences: U+0455 U+0455 (ѕѕ), U+0D1F U+0D1F (ടട).
The LGR defines no character classes.
The LGR includes the set of required default WLE rules and actions applicable to the Root Zone and defined in [MSR-5]. They are marked with ⍟.
This LGR does not define Latin-specific Whole Label Evaluation Rules.
The LGR includes the set of required default actions applicable to the Root Zone and defined in [MSR-5]. They are marked with ⍟.
The LGR contains additional Latin-specific actions as described in Section 6 of [Proposal-Latin]. These resolve the extended set of variant types into a disposition for variant labels of either “allocatable” or “blocked”. Latin-specific actions that are triggered by the LGR-specific variant types described above limit the “allocatable” variant labels to those containing only “ss” or dotted “i” variants or both, while disallowing mixed use of “ss” and “ß” or Dotless i and “i” respectively, except as in the original applied-for label. To account for original code points in a permuted variant, reflexive variant mappings with an “r-” prefix are used. (See [RFC 7940]).
Note that variant mapping types are not symmetric: they depend on which code point is considered the source or the target in a given mapping. As specified in [RFC 7940], mapping types are evaluated for each permutation of a label and its variants, with code points that are unchanged in a given label given the type of their “reflexive” mapping. The actions finally evaluate the collected set of mapping types and resolve them into one of two dispositions for the variant label. Per [RFC 7940] actions are always applied one after the other, and the evaluation stops at the first action that assigns a disposition to a given label.
For more information on how to assign a variant label disposition under this LGR, see Section 6.4.2, “IDNA2003 Compatibility” in [Proposal-Latin]. The specification of variants in the LGR follows the guidelines in [RFC 8228].
The Root Zone LGR for the Latin script was developed by the Latin Generation Panel. For additional detail on methodology and contributors see Sections 4 and 8 in [Proposal-Latin], as well as [RZ-LGR-5-Overview].
The following general references are cited in this document:
For references consulted particularly in designing the repertoire for the Latin script for the Root Zone please see details in the Table of References below. References [0] and up refer to the Unicode Standard versions in which corresponding code points were initially encoded. References [101] and up correspond to a source given in [Proposal-Latin] for justifying the inclusion of the corresponding code points. Entries in the table may have multiple source reference values.
Number of elements in repertoire | 219 | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Out-of-repertoire variants | 67 | ||||||||||||||||
Total entries in table | 286 | ||||||||||||||||
Number of code points for each script |
|
||||||||||||||||
Number of code points | 262 | ||||||||||||||||
Number of sequences | 24 | ||||||||||||||||
Longest code point sequence | 3 | ||||||||||||||||
Code points defined via sequence | 7 |
The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where a comment in the original LGR is equal to the character name, it has been suppressed.
Some code points that may be part of a valid label under this LGR only occur as part of one or more sequences. Such code points are not listed individually in the table.
For any code point or sequence for which a variant is defined, additional information is provided in the Variants column. Some code points or sequences listed in the following table are not part of the repertoire itself; they document targets for out-of-repertoire variant mappings as indicated. See also the legend provided below the table.
Code Point |
Glyph | Script | Name | Ref | Part of Repertoire |
Variants | Comment |
---|---|---|---|---|---|---|---|
U+0061 | a | Latin | LATIN SMALL LETTER A | [0], [99] | ✔ | set 1 | Basic Latin |
U+0061 U+0331 | a̱ | {Latin, Inherited} | LATIN SMALL LETTER A + COMBINING MACRON BELOW | [129], [146] | ✔ | Nuer (4) | |
U+0062 | b | Latin | LATIN SMALL LETTER B | [0], [99] | ✔ | Basic Latin | |
U+0063 | c | Latin | LATIN SMALL LETTER C | [0], [99] | ✔ | set 2 | Basic Latin |
U+0064 | d | Latin | LATIN SMALL LETTER D | [0], [99] | ✔ | Basic Latin | |
U+0065 | e | Latin | LATIN SMALL LETTER E | [0], [99] | ✔ | set 3 | Basic Latin |
U+0065 U+0331 | e̱ | {Latin, Inherited} | LATIN SMALL LETTER E + COMBINING MACRON BELOW | [146] | ✔ | Nuer (4) | |
U+0066 | f | Latin | LATIN SMALL LETTER F | [0], [99] | ✔ | set 4 | Basic Latin |
U+0067 | g | Latin | LATIN SMALL LETTER G | [0], [99] | ✔ | set 5 | Basic Latin |
U+0067 U+0303 | g̃ | {Latin, Inherited} | LATIN SMALL LETTER G + COMBINING TILDE | [142], [143] | ✔ | set 6 | Guarani (1) |
U+0068 | h | Latin | LATIN SMALL LETTER H | [0], [99] | ✔ | set 7 | Basic Latin |
U+0069 | i | Latin | LATIN SMALL LETTER I | [0], [99] | ✔ | set 8 | Basic Latin |
U+0069 U+0331 | i̱ | {Latin, Inherited} | LATIN SMALL LETTER I + COMBINING MACRON BELOW | [146] | ✔ | Nuer (4) | |
U+006A | j | Latin | LATIN SMALL LETTER J | [0], [99] | ✔ | set 9 | Basic Latin |
U+006B | k | Latin | LATIN SMALL LETTER K | [0], [99] | ✔ | Basic Latin | |
U+006C | l | Latin | LATIN SMALL LETTER L | [0], [99] | ✔ | set 10 | Basic Latin |
U+006D | m | Latin | LATIN SMALL LETTER M | [0], [99] | ✔ | Basic Latin | |
U+006D U+0327 | m̧ | {Latin, Inherited} | LATIN SMALL LETTER M + COMBINING CEDILLA | [136], [213], [214] | ✔ | Marshallese (1) | |
U+006E | n | Latin | LATIN SMALL LETTER N | [0], [99] | ✔ | set 11 | Basic Latin |
U+006E U+0304 | n̄ | {Latin, Inherited} | LATIN SMALL LETTER N + COMBINING MACRON | [136], [200], [213] | ✔ | set 12 | Raga (Hano) (3), Marshallese (1) |
U+006E U+0308 | n̈ | {Latin, Inherited} | LATIN SMALL LETTER N + COMBINING DIAERESIS | [276] | ✔ | Malagasy (1) | |
U+006F | o | Latin | LATIN SMALL LETTER O | [0], [99] | ✔ | set 13 | Basic Latin |
U+006F U+0327 | o̧ | {Latin, Inherited} | LATIN SMALL LETTER O + COMBINING CEDILLA | [136] | ✔ | Marshallese (1) | |
U+006F U+0331 | o̱ | {Latin, Inherited} | LATIN SMALL LETTER O + COMBINING MACRON BELOW | [129], [146] | ✔ | Nuer (4) | |
U+0070 | p | Latin | LATIN SMALL LETTER P | [0], [99] | ✔ | set 14 | Basic Latin |
U+0071 | q | Latin | LATIN SMALL LETTER Q | [0], [99] | ✔ | set 15 | Basic Latin |
U+0072 | r | Latin | LATIN SMALL LETTER R | [0], [99] | ✔ | set 16 | Basic Latin |
U+0072 U+0303 | r̃ | {Latin, Inherited} | LATIN SMALL LETTER R WITH TILDE | [147] | ✔ | Hausa (2) | |
U+0073 | s | Latin | LATIN SMALL LETTER S | [0], [99] | ✔ | set 17 | Basic Latin |
U+0073 U+0073 | ss | {Latin} | LATIN SMALL LETTER S + LATIN SMALL LETTER S | ✔ | set 18 | Sequence added for variant mapping | |
U+0074 | t | Latin | LATIN SMALL LETTER T | [0], [99] | ✔ | Basic Latin | |
U+0075 | u | Latin | LATIN SMALL LETTER U | [0], [99] | ✔ | set 19 | Basic Latin |
U+0076 | v | Latin | LATIN SMALL LETTER V | [0], [99] | ✔ | set 20 | Basic Latin |
U+0077 | w | Latin | LATIN SMALL LETTER W | [0], [99] | ✔ | Basic Latin | |
U+0078 | x | Latin | LATIN SMALL LETTER X | [0], [99] | ✔ | set 21 | Basic Latin |
U+0079 | y | Latin | LATIN SMALL LETTER Y | [0], [99] | ✔ | set 22 | Basic Latin |
U+007A | z | Latin | LATIN SMALL LETTER Z | [0], [99] | ✔ | Basic Latin | |
U+00DF | ß | Latin | LATIN SMALL LETTER SHARP S | [0], [119] | ✔ | set 18 | German (1) |
U+00E0 | à | Latin | LATIN SMALL LETTER A WITH GRAVE | [0], [106], [114], [130], [131], [132] | ✔ | set 23 | Italian (1), French (1), Galician (2), Wolof (4) |
U+00E1 | á | Latin | LATIN SMALL LETTER A WITH ACUTE | [0], [100], [101], [102], [103], [105], [106], [107], [108] | ✔ | set 1 | Spanish (1), Czech (1), Icelandic (1), Faroese (2), Chuukese (2), Galician (2), Lule Sami (2), Northern Sami (2) |
U+00E2 | â | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX | [0], [106], [109], [110], [113], [114], [115], [116], [117], [275] | ✔ | Vietnamese (1), Romanian (1), Skolt Sami (2), French (1), Galician (2), West Frisian (1), Friulian (4), Xavante (4) | |
U+00E3 | ã | Latin | LATIN SMALL LETTER A WITH TILDE | [0], [141], [142], [143], [144], [145] | ✔ | set 24 | Umbundu (3), Guarani (1), Nauruan (3), Khoekhoe (4) |
U+00E4 | ä | Latin | LATIN SMALL LETTER A WITH DIAERESIS | [0], [107], [119], [120], [121], [122], [123], [124], [125], [126], [127], [128], [129] | ✔ | set 25 | German (1), Finnish (1), Turkmen (1), Estonian (1), Swedish (1), Lule Sami (2), Yapese (2), Dinka (4), Kaqchikel (4), Bashkir (4), Alsatian (5), Nuer (4) |
U+00E5 | å | Latin | LATIN SMALL LETTER A WITH RING ABOVE | [0], [107], [120], [123], [139], [140] | ✔ | Danish (1), Finnish (1), Chamorro (1), Swedish (1), Lule Sami (2) | |
U+00E6 | æ | Latin | LATIN SMALL LETTER AE | [0], [102], [103], [139] | ✔ | set 26 | Danish (1), Icelandic (1), Faroese (2) |
U+00E7 | ç | Latin | LATIN SMALL LETTER C WITH CEDILLA | [0], [106], [114], [116], [121], [127], [157], [158], [159], [160], [161] | ✔ | set 27 | Turkish (1), Turkmen (1), Kurdish (2), French (1), Azerbaijani (1), Basque (1), Galician (2), Friulian (4), Bashkir (4) |
U+00E8 | è | Latin | LATIN SMALL LETTER E WITH GRAVE | [0], [114], [130], [175], [182], [183] | ✔ | French (1), Italian (1), Afrikaans (1), Haitian Creole (1), French (1) | |
U+00E9 | é | Latin | LATIN SMALL LETTER E WITH ACUTE | [0], [100], [101], [102], [105], [106], [114], [115], [117], [130], [132], [275] | ✔ | French (1), Italian (1), Spanish (1), Czech (1), Icelandic (1), Chuukese (2), Galician (2), Wolof (4), Xavante (4), West Frisian (2) | |
U+00EA | ê | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX | [0], [109], [114], [115], [116], [158], [173], [174], [175] | ✔ | French (1), Tswana (1), Afrikaans (1), Vietnamese (1), Kurdish (2), West Frisian (2), Friulian (4) | |
U+00EB | ë | Latin | LATIN SMALL LETTER E WITH DIAERESIS | [0], [114], [115], [124], [126], [129], [132], [175], [176], [177], [179], [180] | ✔ | set 28 | Afrikaans (1), Albanian (1), French (1), Uyghur (2), Yapese (2), Wolof (4), Drehu (4), Kaqchikel (4), West Frisian (2), Nuer (4) |
U+00EC | ì | Latin | LATIN SMALL LETTER I WITH GRAVE | [0], [130], [206], [208] | ✔ | Italian (1) | |
U+00ED | í | Latin | LATIN SMALL LETTER I WITH ACUTE | [0], [100], [101], [102], [103], [106], [127] | ✔ | set 8 | Spanish (1), Czech (1), Icelandic (1), Faroese (2), Galician (2), Bashkir (4) |
U+00EE | î | Latin | LATIN SMALL LETTER I WITH CIRCUMFLEX | [0], [110], [114], [116], [158], [175] | ✔ | Afrikaans (1), Romanian (1), Kurdish (2), French (1), Friulian (4) | |
U+00EF | ï | Latin | LATIN SMALL LETTER I WITH DIAERESIS | [0], [114], [115], [125], [126], [175] | ✔ | set 8 | Afrikaans (1), French (1), Kaqchikel (4), Dinka (4), West Frisian (2) |
U+00F0 | ð | Latin | LATIN SMALL LETTER ETH | [0], [102], [103] | ✔ | Faroese (2), Icelandic (1) | |
U+00F1 | ñ | Latin | LATIN SMALL LETTER N WITH TILDE | [0], [106], [127], [132], [136], [142], [143], [144], [149], [160], [197], [205], [221], [222], [223], [224], [225], [226], [227], [228], [229] | ✔ | set 12 | Spanish (1), Fula (3), Chamorro (1), Filipino (1), Guarani (1), Chavacano (4), Basque (1), Galician (2), Iloco (3), Quechua (3), Cape Verdean Creole (4), Waray-Waray (3), Wolof (4), Nauruan (3), Lozi (4), Bashkir (4), Marshallese (1), Mandinka (5), Igbo (2) |
U+00F2 | ò | Latin | LATIN SMALL LETTER O WITH GRAVE | [0], [130], [182], [183] | ✔ | set 29 | Italian (1), Haitian Creole (1) |
U+00F3 | ó | Latin | LATIN SMALL LETTER O WITH ACUTE | [0], [100], [101], [102], [105], [106], [132], [152] | ✔ | set 13 | Spanish (1), Polish (1), Czech (1), Icelandic (1), Chuukese (2), Galician (2), Wolof (4) |
U+00F4 | ô | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX | [0], [106], [109], [114], [115], [116], [117], [173], [174], [175], [230], [275] | ✔ | Tswana (1), Afrikaans (1), Vietnamese (1), French (1), Northern Sotho (1), West Frisian (2), Galician (2), Friulian (4), Xavante (4) | |
U+00F5 | õ | Latin | LATIN SMALL LETTER O WITH TILDE | [0], [113], [117], [122], [141], [142], [143], [144], [145], [275] | ✔ | set 30 | Estonian (1), Skolt Sami (2), Umbundu (3), Guarani (1), Nauruan (3), Xavante (4), Khoekhoe (4) |
U+00F6 | ö | Latin | LATIN SMALL LETTER O WITH DIAERESIS | [0], [115], [119], [120], [123], [124], [125], [126], [127], [129], [157], [175], [179], [180], [232] | ✔ | set 31 | German (1), Finnish (1), Afrikaans (1), Turkish (1), Swedish (1), Uygur (2), Yapese (2), Drehu (4), Kaqchikel (4), Dinka (4), Bashkir (4), Chechen (2), 1992 Version, West Frisian (2), Nuer (4) |
U+00F8 | ø | Latin | LATIN SMALL LETTER O WITH STROKE | [0], [103], [139] | ✔ | Danish (1), Faroese (2) | |
U+00F9 | ù | Latin | LATIN SMALL LETTER U WITH GRAVE | [0], [114], [130], [206], [245], [246], [253] | ✔ | set 32 | Italian (1), French (1), Papiamento (1) |
U+00FA | ú | Latin | LATIN SMALL LETTER U WITH ACUTE | [0], [100], [101], [102], [103], [105], [106], [115] | ✔ | set 19 | Spanish (1), Czech (1), Icelandic (1), Faroese (2), Chuukese (2), West Frisian (2), Galician (2) |
U+00FB | û | Latin | LATIN SMALL LETTER U WITH CIRCUMFLEX | [0], [114], [115], [116], [158], [175], [202], [243] | ✔ | Afrikaans (1), Kurdish (2), French (1), Miskito (2), West Frisian (2), Friulian (4), Zazaki (4) | |
U+00FC | ü | Latin | LATIN SMALL LETTER U WITH DIAERESIS | [0], [100], [106], [114], [119], [123], [126], [127], [157], [159], [161], [175], [179] | ✔ | set 19 | German (1), Spanish (1), Afrikaans (1), Turkish (1), Swedish (1), French (1), Azeri (1), Basque (1), Galician (2), Uygur (2), Kaqchikel (4), Bashkir (4) |
U+00FD | ý | Latin | LATIN SMALL LETTER Y WITH ACUTE | [0], [101], [102], [103], [121], [142], [143] | ✔ | set 33 | Turkmen (1), Czech (1), Icelandic (1), Faroese (2), Guarani (1) |
U+00FE | þ | Latin | LATIN SMALL LETTER THORN | [0], [102] | ✔ | Icelandic (1) | |
U+00FF | ÿ | Latin | LATIN SMALL LETTER Y WITH DIAERESIS | [0], [114], [253], [257] | ✔ | set 34 | French (1) |
U+0101 | ā | Latin | LATIN SMALL LETTER A WITH MACRON | [0], [133], [134], [135], [136] | ✔ | set 24 | Latvian (1), Tongan (1), Hawaiian (2), Marshallese (1) |
U+0103 | ă | Latin | LATIN SMALL LETTER A WITH BREVE | [0], [109], [110] | ✔ | set 35 | Vietnamese (1), Romanian (1) |
U+0105 | ą | Latin | LATIN SMALL LETTER A WITH OGONEK | [0], [137], [138] | ✔ | Polish (1), Lithuanian (1) | |
U+0107 | ć | Latin | LATIN SMALL LETTER C WITH ACUTE | [0], [150], [151], [152] | ✔ | set 36 | Croatian (1), Serbian (1), Polish (1) |
U+0109 | ĉ | Latin | LATIN SMALL LETTER C WITH CIRCUMFLEX | [0], [255] | ✔ | Esperanto (3) | |
U+010B | ċ | Latin | LATIN SMALL LETTER C WITH DOT ABOVE | [0], [163] | ✔ | set 36 | Maltese (1) |
U+010D | č | Latin | LATIN SMALL LETTER C WITH CARON | [0], [108], [133], [150], [151], [153], [154] | ✔ | Croatian (1), Serbian (1), Latvian (1), Slovak (1), Northern Sami (2), Lithuanian (1) | |
U+010F | ď | Latin | LATIN SMALL LETTER D WITH CARON | [0], [101], [153] | ✔ | Czech (1), Slovak (1) | |
U+0111 | đ | Latin | LATIN SMALL LETTER D WITH STROKE | [0], [108], [109], [150], [151], [168] | ✔ | Croatian (1), Serbian (1), Vietnamese (1), Northern Sami (2), Brahui (5) | |
U+0113 | ē | Latin | LATIN SMALL LETTER E WITH MACRON | [0], [133], [134], [135], [184] | ✔ | set 37 | Latvian (1), Hawaiian (2), Tongan (1), Minangkabau (5) |
U+0117 | ė | Latin | LATIN SMALL LETTER E WITH DOT ABOVE | [0], [138], [154] | ✔ | Lithuanian (1) | |
U+0119 | ę | Latin | LATIN SMALL LETTER E WITH OGONEK | [0], [138], [152], [154], [185] | ✔ | Polish (1), Palauan (2), Lithuanian (1) | |
U+011B | ě | Latin | LATIN SMALL LETTER E WITH CARON | [0], [101], [172] | ✔ | Czech (1), Sorbian (4) | |
U+011D | ĝ | Latin | LATIN SMALL LETTER G WITH CIRCUMFLEX | [0], [255] | ✔ | Esperanto (3) | |
U+011F | ğ | Latin | LATIN SMALL LETTER G WITH BREVE | [0], [127], [157], [159], [201], [202] | ✔ | set 38 | Turkish (1), Tatar (2), Azeri (1), Bashkir (4), Zaza (5) |
U+0121 | ġ | Latin | LATIN SMALL LETTER G WITH DOT ABOVE | [0], [163] | ✔ | set 39 | Maltese (1) |
U+0123 | ģ | Latin | LATIN SMALL LETTER G WITH CEDILLA | [0], [133], [168] | ✔ | set 39 | Latvian (1), Brahui (5) |
U+0125 | ĥ | Latin | LATIN SMALL LETTER H WITH CIRCUMFLEX | [0], [255] | ✔ | Esperanto (3) | |
U+0127 | ħ | Latin | LATIN SMALL LETTER H WITH STROKE | [0], [163] | ✔ | set 40 | Maltese (1) |
U+0129 | ĩ | Latin | LATIN SMALL LETTER I WITH TILDE | [0], [142], [143], [145], [186], [209] | ✔ | set 41 | Guarani (1), Cubeo (3), Khoekhoe (4), Kikuyu (5) |
U+012B | ī | Latin | LATIN SMALL LETTER I WITH MACRON | [0], [133], [134], [135], [138] | ✔ | set 41 | Latvian (1), Lithuanian (1), Hawaiian (2), Tongan (1) |
U+012F | į | Latin | LATIN SMALL LETTER I WITH OGONEK | [0], [154] | ✔ | Lithuanian (1) | |
U+0131 | ı | Latin | LATIN SMALL LETTER DOTLESS I | [0], [157], [159], [201], [203] | ✔ | set 8 | Turkish (1), Tatar (2), Azeri (1) |
U+0135 | ĵ | Latin | LATIN SMALL LETTER J WITH CIRCUMFLEX | [0], [255] | ✔ | Esperanto (3) | |
U+0137 | ķ | Latin | LATIN SMALL LETTER K WITH CEDILLA | [0], [133] | ✔ | Latvian (1) | |
U+013A | ĺ | Latin | LATIN SMALL LETTER L WITH ACUTE | [0], [153] | ✔ | Slovak (1) | |
U+013C | ļ | Latin | LATIN SMALL LETTER L WITH CEDILLA | [0], [133], [168], [213], [214] | ✔ | Latvian (1), Marshallese (1), Brahui (5) | |
U+013E | ľ | Latin | LATIN SMALL LETTER L WITH CARON | [0], [153] | ✔ | Slovak (1) | |
U+0142 | ł | Latin | LATIN SMALL LETTER L WITH STROKE | [0], [152] | ✔ | Polish (1) | |
U+0144 | ń | Latin | LATIN SMALL LETTER N WITH ACUTE | [0], [107], [152], [168], [172] | ✔ | set 11 | Polish (1), Lule Sami (2), Sorbian (4), Brahui (5) |
U+0146 | ņ | Latin | LATIN SMALL LETTER N WITH CEDILLA | [0], [133], [136] | ✔ | Latvian (1), Marshallese (1) | |
U+0148 | ň | Latin | LATIN SMALL LETTER N WITH CARON | [0], [101], [121], [153] | ✔ | Turkmen (1), Czech (1), Slovak (1) | |
U+014B | ŋ | Latin | LATIN SMALL LETTER ENG | [0], [108], [125], [129], [132], [146], [148], [170], [188], [189], [190], [191], [192], [193], [194], [195], [196], [197], [198], [199] | ✔ | set 11 | Inari Saami (2), Dagaare - Burkina Faso (4), Dagbani (Dagomba), (4), Northern Sami (2), Ewondo (3), Luganda (3), Wolof (4), Adzera (4), Nuer (4), Ga (4), Dinka (4), Duala (3), Ewe (3), Soga (5), Alur (5), Mandinka (5), Acholi (5), Bambara (4), Nuer (4) |
U+014D | ō | Latin | LATIN SMALL LETTER O WITH MACRON | [0], [134], [135], [136] | ✔ | set 30 | Hawaiian (2), Marshallese (1), Tongan (1) |
U+0151 | ő | Latin | LATIN SMALL LETTER O WITH DOUBLE ACUTE | [0], [233], [234] | ✔ | Hungarian (1) | |
U+0153 | œ | Latin | LATIN SMALL LIGATURE OE | [0], [114], [253] | ✔ | French (1) | |
U+0155 | ŕ | Latin | LATIN SMALL LETTER R WITH ACUTE | [0], [153], [168] | ✔ | set 42 | Slovak (1), Brahui (5) |
U+0159 | ř | Latin | LATIN SMALL LETTER R WITH CARON | [0], [101], [172] | ✔ | Czech (1), Sorbian (4) | |
U+015B | ś | Latin | LATIN SMALL LETTER S WITH ACUTE | [0], [152], [258] | ✔ | Polish (1), Montenegrin (1) | |
U+015D | ŝ | Latin | LATIN SMALL LETTER S WITH CIRCUMFLEX | [0], [255] | ✔ | Esperanto (3) | |
U+015F | ş | Latin | LATIN SMALL LETTER S WITH CEDILLA | [0], [121], [127], [157], [158], [159], [168], [201], [202] | ✔ | Turkish (1), Turkmen (1), Kurdish (2), Tatar (2), Azeri (1), Bashkir (4), Brahui (5), Zaza (5) | |
U+0161 | š | Latin | LATIN SMALL LETTER S WITH CARON | [0], [108], [133], [150], [151], [154], [174], [230] | ✔ | Tswana (1), Croatian (1), Serbian (1), Latvian (1), Northern Sotho (1), Northern Sami (2), Lithuanian (1) | |
U+0165 | ť | Latin | LATIN SMALL LETTER T WITH CARON | [0], [101], [153] | ✔ | Czech (1), Slovak (1) | |
U+0167 | ŧ | Latin | LATIN SMALL LETTER T WITH STROKE | [0], [108], [168] | ✔ | Northern Sami (2), Brahui (5) | |
U+0169 | ũ | Latin | LATIN SMALL LETTER U WITH TILDE | [0], [141], [142], [143], [144], [145], [209] | ✔ | set 43 | Umbundu (3), Guarani (1), Nauruan (3), Khoekhoe (4), Kikuyu (5) |
U+016B | ū | Latin | LATIN SMALL LETTER U WITH MACRON | [0], [133], [134], [135], [136], [138], [154] | ✔ | set 43 | Latvian (1), Hawaiian (2), Lithuanian (1), Marshallese (1), Tongan (1) |
U+016D | ŭ | Latin | LATIN SMALL LETTER U WITH BREVE | [0], [255] | ✔ | Esperanto (3) | |
U+016F | ů | Latin | LATIN SMALL LETTER U WITH RING ABOVE | [0], [101] | ✔ | Czech (1) | |
U+0171 | ű | Latin | LATIN SMALL LETTER U WITH DOUBLE ACUTE | [0], [233], [234] | ✔ | Hungarian (1) | |
U+0173 | ų | Latin | LATIN SMALL LETTER U WITH OGONEK | [0], [138], [154] | ✔ | Lithuanian (1) | |
U+0175 | ŵ | Latin | LATIN SMALL LETTER W WITH CIRCUMFLEX | [0], [247], [256] | ✔ | Chichewa (3), Welsh (2) | |
U+0177 | ŷ | Latin | LATIN SMALL LETTER Y WITH CIRCUMFLEX | [0], [256] | ✔ | Welsh (2) | |
U+017A | ź | Latin | LATIN SMALL LETTER Z WITH ACUTE | [0], [152], [168], [172], [252], [258] | ✔ | set 44 | Polish (1), Brahui (5), Sorbian (4), Montenegrin (1) |
U+017C | ż | Latin | LATIN SMALL LETTER Z WITH DOT ABOVE | [0], [152], [163] | ✔ | set 44 | Polish (1), Maltese (1) |
U+017E | ž | Latin | LATIN SMALL LETTER Z WITH CARON | [0], [108], [121], [133], [150], [151], [153], [154], [232] | ✔ | Lithuanian (1), Croatian (1), Serbian (1), Turkmen (1), Latvian (1), Slovak (1), Northern Sami (2), Chechen (2) 1925 Version | |
U+0188 | ƈ | Latin | LATIN SMALL LETTER C WITH HOOK | [0], [277] | ✔ | Serer (5) | |
U+0192 | ƒ | Latin | LATIN SMALL LETTER F WITH HOOK | [0], [170] | ✔ | set 4 | Ewe (3) |
U+0199 | ƙ | Latin | LATIN SMALL LETTER K WITH HOOK | [0], [147] | ✔ | Hausa (2) | |
U+01A1 | ơ | Latin | LATIN SMALL LETTER O WITH HORN | [0], [109] | ✔ | set 45 | Vietnamese (1) |
U+01A5 | ƥ | Latin | LATIN SMALL LETTER P WITH HOOK | [0], [277] | ✔ | Serer (5) | |
U+01AD | ƭ | Latin | LATIN SMALL LETTER T WITH HOOK | [0], [277] | ✔ | Serer (5) | |
U+01B0 | ư | Latin | LATIN SMALL LETTER U WITH HORN | [0], [109] | ✔ | Vietnamese (1) | |
U+01B4 | ƴ | Latin | LATIN SMALL LETTER Y WITH HOOK | [0], [148], [149], [251] | ✔ | Dagaare - Burkina Faso (4), Fula (3) | |
U+01DD | ǝ | Latin | LATIN SMALL LETTER TURNED E | [0], [240] | ✔ | set 46 | Kanuri (3) |
U+01E7 | ǧ | Latin | LATIN SMALL LETTER G WITH CARON | [0], [113] | ✔ | set 38 | Skolt Sami (2) |
U+01E9 | ǩ | Latin | LATIN SMALL LETTER K WITH CARON | [0], [113] | ✔ | Skolt Sami (2) | |
U+01EF | ǯ | Latin | LATIN SMALL LETTER EZH WITH CARON | [0], [113] | ✔ | Skolt Sami (2) | |
U+0219 | ș | Latin | LATIN SMALL LETTER S WITH COMMA BELOW | [3], [110] | ✔ | Romanian (1) | |
U+021B | ț | Latin | LATIN SMALL LETTER T WITH COMMA BELOW | [3], [110] | ✔ | Romanian (1) | |
U+024D | ɍ | Latin | LATIN SMALL LETTER R WITH STROKE | [8], [240] | ✔ | set 47 | Kanuri (3) |
U+0253 | ɓ | Latin | LATIN SMALL LETTER B WITH HOOK | [0], [147], [148], [250] | ✔ | Hausa (2), Dagaare - Burkina Faso (4), Pulaar (3) | |
U+0254 | ɔ | Latin | LATIN SMALL LETTER OPEN O | [0], [129], [146], [148], [169], [170], [189], [190], [193], [194], [236], [237] | ✔ | Dagaare - Burkina Faso (4), Dagbani (Dagomba) (4), Lingala (2), Akan (3), Ewondo (3), Fon (3), Nuer (4), Ga (4), Duala (3), EWE (3), Nuer (4) | |
U+0254 U+0308 | ɔ̈ | {Latin, Inherited} | LATIN SMALL LETTER OPEN O + COMBINING DIAERESIS | [125] | ✔ | DINKA (4) | |
U+0254 U+0331 | ɔ̱ | {Latin, Inherited} | LATIN SMALL LETTER OPEN O + COMBINING MACRON BELOW | [129], [146] | ✔ | Nuer (4) | |
U+0256 | ɖ | Latin | LATIN SMALL LETTER D WITH TAIL | [0], [169], [170] | ✔ | Fon (3), Ewe (3) | |
U+0257 | ɗ | Latin | LATIN SMALL LETTER D WITH HOOK | [0], [147], [149], [250] | ✔ | Hausa (2), Fula (3) | |
U+0259 | ə | Latin | LATIN SMALL LETTER SCHWA | [0], [159], [170], [190], [241] | ✔ | set 46 | Azeri, Azerbaijani (1), Ewondo (3), Ewe (3), Bugis (3) |
U+025B | ɛ | Latin | LATIN SMALL LETTER OPEN E | [0], [129], [148], [169], [170], [189], [190], [193], [194], [199], [212], [236], [237], [238] | ✔ | set 48 | Dagaare - Burkina Faso (4), Lingala (2), Akan (3), Ewondo (3), Dagbani (Dagomba), (4), Fon (3), Mossi (3), Ga (4), Ewe (3), Duala (3), Bambara (4), Nuer (4) |
U+025B U+0308 | ɛ̈ | {Latin, Inherited} | LATIN SMALL LETTER OPEN E + COMBINING DIAERESIS | [125], [129], [146], [239] | ✔ | Nuer (4), Dinka (4) | |
U+025B U+0331 | ɛ̱ | {Latin, Inherited} | LATIN SMALL LETTER OPEN E + COMBINING MACRON BELOW | [129], [146], [239] | ✔ | Nuer (4) | |
U+025B U+0331 U+0308 | ɛ̱̈ | {Latin, Inherited} | LATIN SMALL LETTER OPEN E + COMBINING MACRON BELOW + COMBINING DIAERESIS | [129], [146], [239] | ✔ | Nuer (4) | |
U+0260 | ɠ | Latin | LATIN SMALL LETTER G WITH HOOK | [0], [278] | ✔ | Kpelle (5) | |
U+0263 | ɣ | Latin | LATIN SMALL LETTER GAMMA | [0], [125], [129], [146], [170], [189] | ✔ | set 22 | Dagbani (Dagomba) (4), Nuer (4), Dinka (4), Ewe (3), Nuer (4) |
U+0268 | ɨ | Latin | LATIN SMALL LETTER I WITH STROKE | [0], [186], [189], [210], [211] | ✔ | Cubeo (3), Dagbani (Dagomba) (4), HIxkaryána (4), Maasai (5) | |
U+0268 U+0303 | ɨ̃ | {Latin, Inherited} | LATIN SMALL LETTER I WITH STROKE + COMBINING TILDE | [186] | ✔ | Cubeo (3) | |
U+0269 | ɩ | Latin | LATIN SMALL LETTER IOTA | [0], [148], [212] | ✔ | set 8 | Dagaare - Burkina Faso (4), Mossi (3) |
U+0272 | ɲ | Latin | LATIN SMALL LETTER N WITH LEFT HOOK | [0], [199], [218], [219] | ✔ | Susu (4), Zarma (4), Bambara (4) | |
U+0289 | ʉ | Latin | LATIN SMALL LETTER U BAR | [0], [186], [187], [211] | ✔ | Cubeo (3), Maasai (5) | |
U+0289 U+0303 | ʉ̃ | {Latin, Inherited} | LATIN SMALL LETTER U BAR + COMBINING TILDE | [186], [187] | ✔ | Cubeo (3) | |
U+028B | ʋ | Latin | LATIN SMALL LETTER V WITH HOOK | [0], [148], [170], [212], [238] | ✔ | set 19 | Dagaare - Burkina Faso (4), Mossi (3), Ewe (3) |
U+0292 | ʒ | Latin | LATIN SMALL LETTER EZH | [0], [113], [189] | ✔ | set 49 | Skolt Sami (2), Dagbani (Dagomba) (4) |
U+0390 | ΐ | Greek | GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS | [0] | ✗ | set 8 | Not part of repertoire |
U+03AC | ά | Greek | GREEK SMALL LETTER ALPHA WITH TONOS | [0] | ✗ | set 1 | Not part of repertoire |
U+03AD | έ | Greek | GREEK SMALL LETTER EPSILON WITH TONOS | [0] | ✗ | set 48 | Not part of repertoire |
U+03AE | ή | Greek | GREEK SMALL LETTER ETA WITH TONOS | [0] | ✗ | set 11 | Not part of repertoire |
U+03AF | ί | Greek | GREEK SMALL LETTER IOTA WITH TONOS | [0] | ✗ | set 8 | Not part of repertoire |
U+03B0 | ΰ | Greek | GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS | [0] | ✗ | set 19 | Not part of repertoire |
U+03B1 | α | Greek | GREEK SMALL LETTER ALPHA | [0] | ✗ | set 1 | Not part of repertoire |
U+03B2 | β | Greek | GREEK SMALL LETTER BETA | [0] | ✗ | set 18 | Not part of repertoire |
U+03B3 | γ | Greek | GREEK SMALL LETTER GAMMA | [0] | ✗ | set 22 | Not part of repertoire |
U+03B5 | ε | Greek | GREEK SMALL LETTER EPSILON | [0] | ✗ | set 48 | Not part of repertoire |
U+03B7 | η | Greek | GREEK SMALL LETTER ETA | [0] | ✗ | set 11 | Not part of repertoire |
U+03B9 | ι | Greek | GREEK SMALL LETTER IOTA | [0] | ✗ | set 8 | Not part of repertoire |
U+03BD | ν | Greek | GREEK SMALL LETTER NU | [0] | ✗ | set 20 | Not part of repertoire |
U+03BF | ο | Greek | GREEK SMALL LETTER OMICRON | [0] | ✗ | set 13 | Not part of repertoire |
U+03C1 | ρ | Greek | GREEK SMALL LETTER RHO | [0] | ✗ | set 14 | Not part of repertoire |
U+03C2 | ς | Greek | GREEK SMALL LETTER FINAL SIGMA | [0] | ✗ | set 45 | Not part of repertoire |
U+03C3 | σ | Greek | GREEK SMALL LETTER SIGMA | [0] | ✗ | set 45 | Not part of repertoire |
U+03C5 | υ | Greek | GREEK SMALL LETTER UPSILON | [0] | ✗ | set 19 | Not part of repertoire |
U+03CA | ϊ | Greek | GREEK SMALL LETTER IOTA WITH DIALYTIKA | [0] | ✗ | set 8 | Not part of repertoire |
U+03CB | ϋ | Greek | GREEK SMALL LETTER UPSILON WITH DIALYTIKA | [0] | ✗ | set 19 | Not part of repertoire |
U+03CC | ό | Greek | GREEK SMALL LETTER OMICRON WITH TONOS | [0] | ✗ | set 13 | Not part of repertoire |
U+03CD | ύ | Greek | GREEK SMALL LETTER UPSILON WITH TONOS | [0] | ✗ | set 19 | Not part of repertoire |
U+0430 | а | Cyrillic | CYRILLIC SMALL LETTER A | [0] | ✗ | set 1 | Not part of repertoire |
U+0433 | г | Cyrillic | CYRILLIC SMALL LETTER GHE | [0] | ✗ | set 16 | Not part of repertoire |
U+0435 | е | Cyrillic | CYRILLIC SMALL LETTER IE | [0] | ✗ | set 3 | Not part of repertoire |
U+043E | о | Cyrillic | CYRILLIC SMALL LETTER O | [0] | ✗ | set 13 | Not part of repertoire |
U+0440 | р | Cyrillic | CYRILLIC SMALL LETTER ER | [0] | ✗ | set 14 | Not part of repertoire |
U+0441 | с | Cyrillic | CYRILLIC SMALL LETTER ES | [0] | ✗ | set 2 | Not part of repertoire |
U+0443 | у | Cyrillic | CYRILLIC SMALL LETTER U | [0] | ✗ | set 22 | Not part of repertoire |
U+0445 | х | Cyrillic | CYRILLIC SMALL LETTER HA | [0] | ✗ | set 21 | Not part of repertoire |
U+0451 | ё | Cyrillic | CYRILLIC SMALL LETTER IO | [0] | ✗ | set 28 | Not part of repertoire |
U+0453 | ѓ | Cyrillic | CYRILLIC SMALL LETTER GJE | [0] | ✗ | set 42 | Not part of repertoire |
U+0455 | ѕ | Cyrillic | CYRILLIC SMALL LETTER DZE | [0] | ✗ | set 17 | Not part of repertoire |
U+0455 U+0455 | ѕѕ | {Cyrillic} | CYRILLIC SMALL LETTER DZE + CYRILLIC SMALL LETTER DZE | [0] | ✗ | set 18 | Not part of repertoire |
U+0456 | і | Cyrillic | CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I | [0] | ✗ | set 8 | Not part of repertoire |
U+0457 | ї | Cyrillic | CYRILLIC SMALL LETTER YI | [0] | ✗ | set 8 | Not part of repertoire |
U+0458 | ј | Cyrillic | CYRILLIC SMALL LETTER JE | [0] | ✗ | set 9 | Not part of repertoire |
U+045B | ћ | Cyrillic | CYRILLIC SMALL LETTER TSHE | [0] | ✗ | set 40 | Not part of repertoire |
U+045F | џ | Cyrillic | CYRILLIC SMALL LETTER DZHE | [0] | ✗ | set 50 | Not part of repertoire |
U+0493 | ғ | Cyrillic | CYRILLIC SMALL LETTER GHE WITH STROKE | [0] | ✗ | set 47 | Not part of repertoire |
U+04AB | ҫ | Cyrillic | CYRILLIC SMALL LETTER ES WITH DESCENDER | [0] | ✗ | set 27 | Not part of repertoire |
U+04AF | ү | Cyrillic | CYRILLIC SMALL LETTER STRAIGHT U | [0] | ✗ | set 22 | Not part of repertoire |
U+04BB | һ | Cyrillic | CYRILLIC SMALL LETTER SHHA | [0] | ✗ | set 7 | Not part of repertoire |
U+04CF | ӏ | Cyrillic | CYRILLIC SMALL LETTER PALOCHKA | [8] | ✗ | set 10 | Not part of repertoire |
U+04D1 | ӑ | Cyrillic | CYRILLIC SMALL LETTER A WITH BREVE | [0] | ✗ | set 35 | Not part of repertoire |
U+04D3 | ӓ | Cyrillic | CYRILLIC SMALL LETTER A WITH DIAERESIS | [0] | ✗ | set 25 | Not part of repertoire |
U+04D5 | ӕ | Cyrillic | CYRILLIC SMALL LIGATURE A IE | [0] | ✗ | set 26 | Not part of repertoire |
U+04D9 | ә | Cyrillic | CYRILLIC SMALL LETTER SCHWA | [0] | ✗ | set 46 | Not part of repertoire |
U+04E1 | ӡ | Cyrillic | CYRILLIC SMALL LETTER ABKHASIAN DZE | [0] | ✗ | set 49 | Not part of repertoire |
U+04E7 | ӧ | Cyrillic | CYRILLIC SMALL LETTER O WITH DIAERESIS | [0] | ✗ | set 31 | Not part of repertoire |
U+04F1 | ӱ | Cyrillic | CYRILLIC SMALL LETTER U WITH DIAERESIS | [0] | ✗ | set 34 | Not part of repertoire |
U+0566 | զ | Armenian | ARMENIAN SMALL LETTER ZA | [0] | ✗ | set 15 | Not part of repertoire |
U+0570 | հ | Armenian | ARMENIAN SMALL LETTER HO | [0] | ✗ | set 7 | Not part of repertoire |
U+0572 | ղ | Armenian | ARMENIAN SMALL LETTER GHAD | [0] | ✗ | set 11 | Not part of repertoire |
U+0578 | ո | Armenian | ARMENIAN SMALL LETTER VO | [0] | ✗ | set 11 | Not part of repertoire |
U+057D | ս | Armenian | ARMENIAN SMALL LETTER SEH | [0] | ✗ | set 19 | Not part of repertoire |
U+0581 | ց | Armenian | ARMENIAN SMALL LETTER CO | [0] | ✗ | set 5 | Not part of repertoire |
U+0582 | ւ | Armenian | ARMENIAN SMALL LETTER YIWN | [0] | ✗ | set 8 | Not part of repertoire |
U+0585 | օ | Armenian | ARMENIAN SMALL LETTER OH | [0] | ✗ | set 13 | Not part of repertoire |
U+05D5 | ו | Hebrew | HEBREW LETTER VAV | [0] | ✗ | set 8 | Not part of repertoire |
U+05E1 | ס | Hebrew | HEBREW LETTER SAMEKH | [0] | ✗ | set 13 | Not part of repertoire |
U+0B20 | ଠ | Oriya | ORIYA LETTER TTHA | [0] | ✗ | set 13 | Not part of repertoire |
U+0D1F | ട | Malayalam | MALAYALAM LETTER TTA | [0] | ✗ | set 17 | Not part of repertoire |
U+0D1F U+0D1F | ടട | {Malayalam} | MALAYALAM LETTER TTA + MALAYALAM LETTER TTA | [0] | ✗ | set 18 | Not part of repertoire |
U+0D20 | ഠ | Malayalam | MALAYALAM LETTER TTHA | [0] | ✗ | set 13 | Not part of repertoire |
U+1004 | င | Myanmar | MYANMAR LETTER NGA | [3] | ✗ | set 2 | Not part of repertoire |
U+101D | ဝ | Myanmar | MYANMAR LETTER WA | [3] | ✗ | set 13 | Not part of repertoire |
U+1E13 | ḓ | Latin | LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW | [0], [164], [257] | ✔ | Venda (1) | |
U+1E21 | ḡ | Latin | LATIN SMALL LETTER G WITH MACRON | [0], [200] | ✔ | set 6 | Raga (Hano) (3) |
U+1E3D | ḽ | Latin | LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW | [0], [164], [257] | ✔ | Venda (1) | |
U+1E45 | ṅ | Latin | LATIN SMALL LETTER N WITH DOT ABOVE | [0], [164], [257] | ✔ | set 11 | Venda (1) |
U+1E49 | ṉ | Latin | LATIN SMALL LETTER N WITH LINE BELOW | [0], [220] | ✔ | Pitjantjatjara (4) | |
U+1E4B | ṋ | Latin | LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW | [0], [164], [257] | ✔ | Venda (1) | |
U+1E63 | ṣ | Latin | LATIN SMALL LETTER S WITH DOT BELOW | [0], [254] | ✔ | Yoruba (2) | |
U+1E6D | ṭ | Latin | LATIN SMALL LETTER T WITH DOT BELOW | [0], [242] | ✔ | Mizo (4) | |
U+1E71 | ṱ | Latin | LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW | [0], [164], [257] | ✔ | Venda (1) | |
U+1E8D | ẍ | Latin | LATIN SMALL LETTER X WITH DIAERESIS | [0], [248], [249] | ✔ | Mam (4) | |
U+1EA1 | ạ | Latin | LATIN SMALL LETTER A WITH DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EA3 | ả | Latin | LATIN SMALL LETTER A WITH HOOK ABOVE | [0], [109] | ✔ | set 23 | Vietnamese (1) |
U+1EA5 | ấ | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EA7 | ầ | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EA9 | ẩ | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EAB | ẫ | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EAD | ậ | Latin | LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EAF | ắ | Latin | LATIN SMALL LETTER A WITH BREVE AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EB1 | ằ | Latin | LATIN SMALL LETTER A WITH BREVE AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EB3 | ẳ | Latin | LATIN SMALL LETTER A WITH BREVE AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EB5 | ẵ | Latin | LATIN SMALL LETTER A WITH BREVE AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EB7 | ặ | Latin | LATIN SMALL LETTER A WITH BREVE AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EB9 | ẹ | Latin | LATIN SMALL LETTER E WITH DOT BELOW | [0], [254] | ✔ | Yoruba (2) | |
U+1EB9 U+0300 | ẹ̀ | {Latin, Inherited} | LATIN SMALL LETTER E WITH DOT BELOW + COMBINING GRAVE ACCENT | [254] | ✔ | Yoruba (2) | |
U+1EB9 U+0301 | ẹ́ | {Latin, Inherited} | LATIN SMALL LETTER E WITH DOT BELOW + COMBINING ACUTE ACCENT | [254] | ✔ | Yoruba (2) | |
U+1EBB | ẻ | Latin | LATIN SMALL LETTER E WITH HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EBD | ẽ | Latin | LATIN SMALL LETTER E WITH TILDE | [0], [117], [141], [142], [143], [186], [187], [275] | ✔ | set 37 | Umbundu (3), Guarani (1), Cubeo (3), Xavante (4) |
U+1EBF | ế | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EC1 | ề | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EC3 | ể | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EC5 | ễ | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EC7 | ệ | Latin | LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EC9 | ỉ | Latin | LATIN SMALL LETTER I WITH HOOK ABOVE | [0], [109] | ✔ | set 8 | Vietnamese (1) |
U+1ECB | ị | Latin | LATIN SMALL LETTER I WITH DOT BELOW | [0], [205] | ✔ | Igbo (2) | |
U+1ECD | ọ | Latin | LATIN SMALL LETTER O WITH DOT BELOW | [0], [136], [204], [205], [215], [216], [254] | ✔ | Igbo (2), Yoruba (2), Marshallese (1) | |
U+1ECD U+0300 | ọ̀ | {Latin, Inherited} | LATIN SMALL LETTER O WITH DOT BELOW + COMBINING GRAVE ACCENT | [254] | ✔ | Yoruba (2) | |
U+1ECD U+0301 | ọ́ | {Latin, Inherited} | LATIN SMALL LETTER O WITH DOT BELOW + COMBINING ACUTE ACCENT | [254] | ✔ | Yoruba (2) | |
U+1ECF | ỏ | Latin | LATIN SMALL LETTER O WITH HOOK ABOVE | [0], [109] | ✔ | set 29 | Vietnamese (1) |
U+1ED1 | ố | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1ED3 | ồ | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1ED5 | ổ | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1ED7 | ỗ | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1ED9 | ộ | Latin | LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EDB | ớ | Latin | LATIN SMALL LETTER O WITH HORN AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EDD | ờ | Latin | LATIN SMALL LETTER O WITH HORN AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EDF | ở | Latin | LATIN SMALL LETTER O WITH HORN AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EE1 | ỡ | Latin | LATIN SMALL LETTER O WITH HORN AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EE3 | ợ | Latin | LATIN SMALL LETTER O WITH HORN AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EE5 | ụ | Latin | LATIN SMALL LETTER U WITH DOT BELOW | [0], [109], [204], [205] | ✔ | set 50 | Vietnamese (1), Igbo (2) |
U+1EE7 | ủ | Latin | LATIN SMALL LETTER U WITH HOOK ABOVE | [0], [109] | ✔ | set 32 | Vietnamese (1) |
U+1EE9 | ứ | Latin | LATIN SMALL LETTER U WITH HORN AND ACUTE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EEB | ừ | Latin | LATIN SMALL LETTER U WITH HORN AND GRAVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EED | ử | Latin | LATIN SMALL LETTER U WITH HORN AND HOOK ABOVE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EEF | ữ | Latin | LATIN SMALL LETTER U WITH HORN AND TILDE | [0], [109] | ✔ | Vietnamese (1) | |
U+1EF1 | ự | Latin | LATIN SMALL LETTER U WITH HORN AND DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EF3 | ỳ | Latin | LATIN SMALL LETTER Y WITH GRAVE | [0], [109] | ✔ | set 33 | Vietnamese (1) |
U+1EF5 | ỵ | Latin | LATIN SMALL LETTER Y WITH DOT BELOW | [0], [109] | ✔ | Vietnamese (1) | |
U+1EF7 | ỷ | Latin | LATIN SMALL LETTER Y WITH HOOK ABOVE | [0], [109] | ✔ | set 33 | Vietnamese (1) |
U+1EF9 | ỹ | Latin | LATIN SMALL LETTER Y WITH TILDE | [0], [109], [142] | ✔ | Vietnamese (1), Guarani (1) |
Throughout this LGR, a code point sequence may be annotated with a string in ALL CAPS that is constructed on the same principle as a name for a Unicode Named Sequence. No claim is made that a sequence thus annotated is in fact a named sequence, nor that the annotation in such case actually corresponds to the formal name of a named sequence.
Number of variant sets | 50 | ||||||
---|---|---|---|---|---|---|---|
Largest variant set | 14 | ||||||
Ordinary Variants by Type |
|
||||||
Reflexive Variants by Type |
|
The following tables list all variant sets defined in this LGR, except for singleton sets. Each table lists all variant mapping pairs of the set; one per row. Mappings are assumed to be symmetric: each row documents both forward (→) and reverse (←) mapping directions. In each table, the mappings are sorted by Source value in ascending code point order; shading is used to group mappings from the same source code point or sequence.
Where the type of both forward and reverse mappings are the same, a single value is given in the Type column; otherwise the types for forward and reverse mappings, as well as comments and references, are listed above one another. For summary counts, both forward and reverse mappings are always counted separately.
A mapping where source and target are the same is reflexive. Variant sets consisting of only a single reflexive mapping are not shown as a set. Instead, the variant type of the mapping is listed in the Variants column of the Repertoire by Code Point table. Reflexive mappings that are part of a larger set are indicated with a “≡” and are counted once per entry.
In any LGR with variant specifications that are well behaved, all members within each variant set are defined as variants of each other; the mappings in each set are symmetric and transitive; and all variant sets are disjoint.
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0061 | a | 00E1 | á | ↔ | blocked | Required for integration | |
0061 | a | 03AC | ά | ↔ | blocked | ||
0061 | a | 03B1 | α | ↔ | blocked | Cross-script near homoglyph | |
0061 | a | 0430 | а | ↔ | blocked | Cross-script homoglyph | |
00E1 | á | 03AC | ά | ↔ | blocked | Cross-script near homoglyph | |
00E1 | á | 03B1 | α | ↔ | blocked | ||
00E1 | á | 0430 | а | ↔ | blocked | ||
03AC | ά | 03AC | ά | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03AC | ά | 03B1 | α | ↔ | blocked | ||
03AC | ά | 0430 | а | ↔ | blocked | ||
03B1 | α | 03B1 | α | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B1 | α | 0430 | а | ↔ | blocked | ||
0430 | а | 0430 | а | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0063 | c | 0441 | с | ↔ | blocked | Cross-script homoglyph | |
0063 | c | 1004 | င | ↔ | blocked | Cross-script near homoglyph | |
0441 | с | 0441 | с | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0441 | с | 1004 | င | ↔ | blocked | ||
1004 | င | 1004 | င | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0065 | e | 0435 | е | ↔ | blocked | Cross-script homoglyph | |
0435 | е | 0435 | е | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0066 | f | 0192 | ƒ | ↔ | blocked | Generally acceptable alternate glyph |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0067 | g | 0581 | ց | ↔ | blocked | Cross-script near homoglyph | |
0581 | ց | 0581 | ց | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0067 0303 | g̃ | 1E21 | ḡ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0068 | h | 04BB | һ | ↔ | blocked | Cross-script homoglyph | |
0068 | h | 0570 | հ | ↔ | blocked | Cross-script near homoglyph | |
04BB | һ | 04BB | һ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
04BB | һ | 0570 | հ | ↔ | blocked | ||
0570 | հ | 0570 | հ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0069 | i | 00ED | í | ↔ | blocked | Required for integration | |
0069 | i | 00EF | ï | ↔ | blocked | Required for integration | |
0069 | i | 0131 | ı | → | blocked | IDNA2003 Compatibility | |
← | dotted | IDNA2003 Compatibility | |||||
0069 | i | 0269 | ɩ | ↔ | blocked | Required for integration | |
0069 | i | 0390 | ΐ | ↔ | blocked | ||
0069 | i | 03AF | ί | ↔ | blocked | ||
0069 | i | 03B9 | ι | ↔ | blocked | ||
0069 | i | 03CA | ϊ | ↔ | blocked | ||
0069 | i | 0456 | і | ↔ | blocked | Cross-script homoglyph | |
0069 | i | 0457 | ї | ↔ | blocked | ||
0069 | i | 0582 | ւ | ↔ | blocked | ||
0069 | i | 05D5 | ו | ↔ | blocked | ||
0069 | i | 1EC9 | ỉ | ↔ | blocked | Glyphs either homoglyph or nearly identical | |
00ED | í | 00EF | ï | ↔ | blocked | Required for integration | |
00ED | í | 0131 | ı | ↔ | blocked | Required for integration | |
00ED | í | 0269 | ɩ | ↔ | blocked | Required for integration | |
00ED | í | 0390 | ΐ | ↔ | blocked | ||
00ED | í | 03AF | ί | ↔ | blocked | Cross-script homoglyph | |
00ED | í | 03B9 | ι | ↔ | blocked | ||
00ED | í | 03CA | ϊ | ↔ | blocked | ||
00ED | í | 0456 | і | ↔ | blocked | ||
00ED | í | 0457 | ї | ↔ | blocked | ||
00ED | í | 0582 | ւ | ↔ | blocked | ||
00ED | í | 05D5 | ו | ↔ | blocked | ||
00ED | í | 1EC9 | ỉ | ↔ | blocked | Required for integration | |
00EF | ï | 0131 | ı | ↔ | blocked | Required for integration | |
00EF | ï | 0269 | ɩ | ↔ | blocked | Required for integration | |
00EF | ï | 0390 | ΐ | ↔ | blocked | ||
00EF | ï | 03AF | ί | ↔ | blocked | ||
00EF | ï | 03B9 | ι | ↔ | blocked | ||
00EF | ï | 03CA | ϊ | ↔ | blocked | Cross-script homoglyph | |
00EF | ï | 0456 | і | ↔ | blocked | ||
00EF | ï | 0457 | ї | ↔ | blocked | Cross-script homoglyph | |
00EF | ï | 0582 | ւ | ↔ | blocked | ||
00EF | ï | 05D5 | ו | ↔ | blocked | ||
00EF | ï | 1EC9 | ỉ | ↔ | blocked | Required for integration | |
0131 | ı | 0131 | ı | ≡ | r-dotless | Dotless form | |
0131 | ı | 0269 | ɩ | ↔ | blocked | Glyphs either homoglyph or nearly identical | |
0131 | ı | 0390 | ΐ | ↔ | blocked | ||
0131 | ı | 03AF | ί | ↔ | blocked | ||
0131 | ı | 03B9 | ι | ↔ | blocked | Cross-script homoglyph | |
0131 | ı | 03CA | ϊ | ↔ | blocked | ||
0131 | ı | 0456 | і | ↔ | blocked | ||
0131 | ı | 0457 | ї | ↔ | blocked | ||
0131 | ı | 0582 | ւ | ↔ | blocked | ||
0131 | ı | 05D5 | ו | ↔ | blocked | Cross-script near homoglyph | |
0131 | ı | 1EC9 | ỉ | ↔ | blocked | Required for integration | |
0269 | ɩ | 0390 | ΐ | ↔ | blocked | ||
0269 | ɩ | 03AF | ί | ↔ | blocked | ||
0269 | ɩ | 03B9 | ι | ↔ | blocked | Cross-script homoglyph | |
0269 | ɩ | 03CA | ϊ | ↔ | blocked | ||
0269 | ɩ | 0456 | і | ↔ | blocked | ||
0269 | ɩ | 0457 | ї | ↔ | blocked | ||
0269 | ɩ | 0582 | ւ | ↔ | blocked | Cross-script near homoglyph | |
0269 | ɩ | 05D5 | ו | ↔ | blocked | ||
0269 | ɩ | 1EC9 | ỉ | ↔ | blocked | Required for integration | |
0390 | ΐ | 0390 | ΐ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0390 | ΐ | 03AF | ί | ↔ | blocked | ||
0390 | ΐ | 03B9 | ι | ↔ | blocked | ||
0390 | ΐ | 03CA | ϊ | ↔ | blocked | ||
0390 | ΐ | 0456 | і | ↔ | blocked | ||
0390 | ΐ | 0457 | ї | ↔ | blocked | ||
0390 | ΐ | 0582 | ւ | ↔ | blocked | ||
0390 | ΐ | 05D5 | ו | ↔ | blocked | ||
0390 | ΐ | 1EC9 | ỉ | ↔ | blocked | ||
03AF | ί | 03AF | ί | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03AF | ί | 03B9 | ι | ↔ | blocked | ||
03AF | ί | 03CA | ϊ | ↔ | blocked | ||
03AF | ί | 0456 | і | ↔ | blocked | ||
03AF | ί | 0457 | ї | ↔ | blocked | ||
03AF | ί | 0582 | ւ | ↔ | blocked | ||
03AF | ί | 05D5 | ו | ↔ | blocked | ||
03AF | ί | 1EC9 | ỉ | ↔ | blocked | ||
03B9 | ι | 03B9 | ι | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B9 | ι | 03CA | ϊ | ↔ | blocked | ||
03B9 | ι | 0456 | і | ↔ | blocked | ||
03B9 | ι | 0457 | ї | ↔ | blocked | ||
03B9 | ι | 0582 | ւ | ↔ | blocked | ||
03B9 | ι | 05D5 | ו | ↔ | blocked | ||
03B9 | ι | 1EC9 | ỉ | ↔ | blocked | ||
03CA | ϊ | 03CA | ϊ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03CA | ϊ | 0456 | і | ↔ | blocked | ||
03CA | ϊ | 0457 | ї | ↔ | blocked | Cross-script homoglyph | |
03CA | ϊ | 0582 | ւ | ↔ | blocked | ||
03CA | ϊ | 05D5 | ו | ↔ | blocked | ||
03CA | ϊ | 1EC9 | ỉ | ↔ | blocked | ||
0456 | і | 0456 | і | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0456 | і | 0457 | ї | ↔ | blocked | ||
0456 | і | 0582 | ւ | ↔ | blocked | ||
0456 | і | 05D5 | ו | ↔ | blocked | ||
0456 | і | 1EC9 | ỉ | ↔ | blocked | ||
0457 | ї | 0457 | ї | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0457 | ї | 0582 | ւ | ↔ | blocked | ||
0457 | ї | 05D5 | ו | ↔ | blocked | ||
0457 | ї | 1EC9 | ỉ | ↔ | blocked | ||
0582 | ւ | 0582 | ւ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0582 | ւ | 05D5 | ו | ↔ | blocked | ||
0582 | ւ | 1EC9 | ỉ | ↔ | blocked | ||
05D5 | ו | 05D5 | ו | ≡ | out-of-repertoire-var | Out-of-repertoire | |
05D5 | ו | 1EC9 | ỉ | ↔ | blocked |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
006A | j | 0458 | ј | ↔ | blocked | Cross-script homoglyph | |
0458 | ј | 0458 | ј | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
006C | l | 04CF | ӏ | ↔ | blocked | Cross-script homoglyph | |
04CF | ӏ | 04CF | ӏ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
006E | n | 0144 | ń | ↔ | blocked | Required for integration | |
006E | n | 014B | ŋ | ↔ | blocked | Required for integration | |
006E | n | 03AE | ή | ↔ | blocked | ||
006E | n | 03B7 | η | ↔ | blocked | Cross-script near homoglyph | |
006E | n | 0572 | ղ | ↔ | blocked | ||
006E | n | 0578 | ո | ↔ | blocked | Cross-script near homoglyph | |
006E | n | 1E45 | ṅ | ↔ | blocked | Required for integration | |
0144 | ń | 014B | ŋ | ↔ | blocked | Required for integration | |
0144 | ń | 03AE | ή | ↔ | blocked | ||
0144 | ń | 03B7 | η | ↔ | blocked | ||
0144 | ń | 0572 | ղ | ↔ | blocked | ||
0144 | ń | 0578 | ո | ↔ | blocked | ||
0144 | ń | 1E45 | ṅ | ↔ | blocked | Glyphs either homoglyph or nearly identical | |
014B | ŋ | 03AE | ή | ↔ | blocked | ||
014B | ŋ | 03B7 | η | ↔ | blocked | Cross-script near homoglyph | |
014B | ŋ | 0572 | ղ | ↔ | blocked | Cross-script near homoglyph | |
014B | ŋ | 0578 | ո | ↔ | blocked | ||
014B | ŋ | 1E45 | ṅ | ↔ | blocked | Required for integration | |
03AE | ή | 03AE | ή | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03AE | ή | 03B7 | η | ↔ | blocked | ||
03AE | ή | 0572 | ղ | ↔ | blocked | ||
03AE | ή | 0578 | ո | ↔ | blocked | ||
03AE | ή | 1E45 | ṅ | ↔ | blocked | ||
03B7 | η | 03B7 | η | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B7 | η | 0572 | ղ | ↔ | blocked | ||
03B7 | η | 0578 | ո | ↔ | blocked | ||
03B7 | η | 1E45 | ṅ | ↔ | blocked | ||
0572 | ղ | 0572 | ղ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0572 | ղ | 0578 | ո | ↔ | blocked | ||
0572 | ղ | 1E45 | ṅ | ↔ | blocked | ||
0578 | ո | 0578 | ո | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0578 | ո | 1E45 | ṅ | ↔ | blocked |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
006E 0304 | n̄ | 00F1 | ñ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
006F | o | 00F3 | ó | ↔ | blocked | Required for integration | |
006F | o | 03BF | ο | ↔ | blocked | Cross-script homoglyph | |
006F | o | 03CC | ό | ↔ | blocked | ||
006F | o | 043E | о | ↔ | blocked | Cross-script homoglyph | |
006F | o | 0585 | օ | ↔ | blocked | Cross-script homoglyph | |
006F | o | 05E1 | ס | ↔ | blocked | Cross-script near homoglyph | |
006F | o | 0B20 | ଠ | ↔ | blocked | Cross-script near homoglyph | |
006F | o | 0D20 | ഠ | ↔ | blocked | Cross-script near homoglyph | |
006F | o | 101D | ဝ | ↔ | blocked | Cross-script near homoglyph | |
00F3 | ó | 03BF | ο | ↔ | blocked | ||
00F3 | ó | 03CC | ό | ↔ | blocked | Cross-script homoglyph | |
00F3 | ó | 043E | о | ↔ | blocked | ||
00F3 | ó | 0585 | օ | ↔ | blocked | ||
00F3 | ó | 05E1 | ס | ↔ | blocked | ||
00F3 | ó | 0B20 | ଠ | ↔ | blocked | ||
00F3 | ó | 0D20 | ഠ | ↔ | blocked | ||
00F3 | ó | 101D | ဝ | ↔ | blocked | ||
03BF | ο | 03BF | ο | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03BF | ο | 03CC | ό | ↔ | blocked | ||
03BF | ο | 043E | о | ↔ | blocked | ||
03BF | ο | 0585 | օ | ↔ | blocked | ||
03BF | ο | 05E1 | ס | ↔ | blocked | ||
03BF | ο | 0B20 | ଠ | ↔ | blocked | ||
03BF | ο | 0D20 | ഠ | ↔ | blocked | ||
03BF | ο | 101D | ဝ | ↔ | blocked | ||
03CC | ό | 03CC | ό | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03CC | ό | 043E | о | ↔ | blocked | ||
03CC | ό | 0585 | օ | ↔ | blocked | ||
03CC | ό | 05E1 | ס | ↔ | blocked | ||
03CC | ό | 0B20 | ଠ | ↔ | blocked | ||
03CC | ό | 0D20 | ഠ | ↔ | blocked | ||
03CC | ό | 101D | ဝ | ↔ | blocked | ||
043E | о | 043E | о | ≡ | out-of-repertoire-var | Out-of-repertoire | |
043E | о | 0585 | օ | ↔ | blocked | ||
043E | о | 05E1 | ס | ↔ | blocked | ||
043E | о | 0B20 | ଠ | ↔ | blocked | ||
043E | о | 0D20 | ഠ | ↔ | blocked | ||
043E | о | 101D | ဝ | ↔ | blocked | ||
0585 | օ | 0585 | օ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0585 | օ | 05E1 | ס | ↔ | blocked | ||
0585 | օ | 0B20 | ଠ | ↔ | blocked | ||
0585 | օ | 0D20 | ഠ | ↔ | blocked | ||
0585 | օ | 101D | ဝ | ↔ | blocked | ||
05E1 | ס | 05E1 | ס | ≡ | out-of-repertoire-var | Out-of-repertoire | |
05E1 | ס | 0B20 | ଠ | ↔ | blocked | ||
05E1 | ס | 0D20 | ഠ | ↔ | blocked | ||
05E1 | ס | 101D | ဝ | ↔ | blocked | ||
0B20 | ଠ | 0B20 | ଠ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0B20 | ଠ | 0D20 | ഠ | ↔ | blocked | ||
0B20 | ଠ | 101D | ဝ | ↔ | blocked | ||
0D20 | ഠ | 0D20 | ഠ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0D20 | ഠ | 101D | ဝ | ↔ | blocked | ||
101D | ဝ | 101D | ဝ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0070 | p | 03C1 | ρ | ↔ | blocked | Cross-script near homoglyph | |
0070 | p | 0440 | р | ↔ | blocked | Cross-script homoglyph | |
03C1 | ρ | 03C1 | ρ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03C1 | ρ | 0440 | р | ↔ | blocked | ||
0440 | р | 0440 | р | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0071 | q | 0566 | զ | ↔ | blocked | Cross-script near homoglyph | |
0566 | զ | 0566 | զ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0072 | r | 0433 | г | ↔ | blocked | Cross-script near homoglyph | |
0433 | г | 0433 | г | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0073 | s | 0455 | ѕ | ↔ | blocked | Cross-script homoglyph | |
0073 | s | 0D1F | ട | ↔ | blocked | Cross-script near homoglyph | |
0455 | ѕ | 0455 | ѕ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0455 | ѕ | 0D1F | ട | ↔ | blocked | ||
0D1F | ട | 0D1F | ട | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0073 0073 | ss | 00DF | ß | → | blocked | IDNA2003 Compatibility | |
← | eszett-to-ss | IDNA2003 Compatibility | |||||
0073 0073 | ss | 03B2 | β | ↔ | blocked | ||
0073 0073 | ss | 0455 0455 | ѕѕ | ↔ | blocked | Cross-script homoglyph | |
0073 0073 | ss | 0D1F 0D1F | ടട | ↔ | blocked | Cross-script near homoglyph | |
00DF | ß | 00DF | ß | ≡ | r-eszett | Eszett | |
00DF | ß | 03B2 | β | ↔ | blocked | Cross-script near homoglyph | |
00DF | ß | 0455 0455 | ѕѕ | ↔ | blocked | ||
00DF | ß | 0D1F 0D1F | ടട | ↔ | blocked | ||
03B2 | β | 03B2 | β | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B2 | β | 0455 0455 | ѕѕ | ↔ | blocked | ||
03B2 | β | 0D1F 0D1F | ടട | ↔ | blocked | ||
0455 0455 | ѕѕ | 0455 0455 | ѕѕ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0455 0455 | ѕѕ | 0D1F 0D1F | ടട | ↔ | blocked | ||
0D1F 0D1F | ടട | 0D1F 0D1F | ടട | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0075 | u | 00FA | ú | ↔ | blocked | Required for integration | |
0075 | u | 00FC | ü | ↔ | blocked | Required for integration | |
0075 | u | 028B | ʋ | ↔ | blocked | Required for integration | |
0075 | u | 03B0 | ΰ | ↔ | blocked | ||
0075 | u | 03C5 | υ | ↔ | blocked | ||
0075 | u | 03CB | ϋ | ↔ | blocked | ||
0075 | u | 03CD | ύ | ↔ | blocked | ||
0075 | u | 057D | ս | ↔ | blocked | Cross-script near homoglyph | |
00FA | ú | 00FC | ü | ↔ | blocked | Required for integration | |
00FA | ú | 028B | ʋ | ↔ | blocked | Required for integration | |
00FA | ú | 03B0 | ΰ | ↔ | blocked | ||
00FA | ú | 03C5 | υ | ↔ | blocked | ||
00FA | ú | 03CB | ϋ | ↔ | blocked | ||
00FA | ú | 03CD | ύ | ↔ | blocked | Cross-script near homoglyph | |
00FA | ú | 057D | ս | ↔ | blocked | ||
00FC | ü | 028B | ʋ | ↔ | blocked | Required for integration | |
00FC | ü | 03B0 | ΰ | ↔ | blocked | ||
00FC | ü | 03C5 | υ | ↔ | blocked | ||
00FC | ü | 03CB | ϋ | ↔ | blocked | Cross-script near homoglyph | |
00FC | ü | 03CD | ύ | ↔ | blocked | ||
00FC | ü | 057D | ս | ↔ | blocked | ||
028B | ʋ | 03B0 | ΰ | ↔ | blocked | ||
028B | ʋ | 03C5 | υ | ↔ | blocked | ||
028B | ʋ | 03CB | ϋ | ↔ | blocked | ||
028B | ʋ | 03CD | ύ | ↔ | blocked | ||
028B | ʋ | 057D | ս | ↔ | blocked | ||
03B0 | ΰ | 03B0 | ΰ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B0 | ΰ | 03C5 | υ | ↔ | blocked | ||
03B0 | ΰ | 03CB | ϋ | ↔ | blocked | ||
03B0 | ΰ | 03CD | ύ | ↔ | blocked | ||
03B0 | ΰ | 057D | ս | ↔ | blocked | ||
03C5 | υ | 03C5 | υ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03C5 | υ | 03CB | ϋ | ↔ | blocked | ||
03C5 | υ | 03CD | ύ | ↔ | blocked | ||
03C5 | υ | 057D | ս | ↔ | blocked | ||
03CB | ϋ | 03CB | ϋ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03CB | ϋ | 03CD | ύ | ↔ | blocked | ||
03CB | ϋ | 057D | ս | ↔ | blocked | ||
03CD | ύ | 03CD | ύ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03CD | ύ | 057D | ս | ↔ | blocked | ||
057D | ս | 057D | ս | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0076 | v | 03BD | ν | ↔ | blocked | Cross-script near homoglyph | |
03BD | ν | 03BD | ν | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0078 | x | 0445 | х | ↔ | blocked | Cross-script homoglyph | |
0445 | х | 0445 | х | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0079 | y | 0263 | ɣ | ↔ | blocked | Required for integration | |
0079 | y | 03B3 | γ | ↔ | blocked | Cross-script near homoglyph | |
0079 | y | 0443 | у | ↔ | blocked | Cross-script homoglyph | |
0079 | y | 04AF | ү | ↔ | blocked | Cross-script near homoglyph | |
0263 | ɣ | 03B3 | γ | ↔ | blocked | Cross-script near homoglyph | |
0263 | ɣ | 0443 | у | ↔ | blocked | ||
0263 | ɣ | 04AF | ү | ↔ | blocked | ||
03B3 | γ | 03B3 | γ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03B3 | γ | 0443 | у | ↔ | blocked | ||
03B3 | γ | 04AF | ү | ↔ | blocked | ||
0443 | у | 0443 | у | ≡ | out-of-repertoire-var | Out-of-repertoire | |
0443 | у | 04AF | ү | ↔ | blocked | ||
04AF | ү | 04AF | ү | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00E0 | à | 1EA3 | ả | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00E3 | ã | 0101 | ā | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00E4 | ä | 04D3 | ӓ | ↔ | blocked | Cross-script homoglyph | |
04D3 | ӓ | 04D3 | ӓ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00E6 | æ | 04D5 | ӕ | ↔ | blocked | Cross-script homoglyph | |
04D5 | ӕ | 04D5 | ӕ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00E7 | ç | 04AB | ҫ | ↔ | blocked | Cross-script near homoglyph | |
04AB | ҫ | 04AB | ҫ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00EB | ë | 0451 | ё | ↔ | blocked | Cross-script homoglyph | |
0451 | ё | 0451 | ё | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00F2 | ò | 1ECF | ỏ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00F5 | õ | 014D | ō | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00F6 | ö | 04E7 | ӧ | ↔ | blocked | Cross-script homoglyph | |
04E7 | ӧ | 04E7 | ӧ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00F9 | ù | 1EE7 | ủ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00FD | ý | 1EF3 | ỳ | ↔ | blocked | Variant due to transitivity | |
00FD | ý | 1EF7 | ỷ | ↔ | blocked | Glyphs either homoglyph or nearly identical | |
1EF3 | ỳ | 1EF7 | ỷ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
00FF | ÿ | 04F1 | ӱ | ↔ | blocked | Cross-script homoglyph | |
04F1 | ӱ | 04F1 | ӱ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0103 | ă | 04D1 | ӑ | ↔ | blocked | Cross-script homoglyph | |
04D1 | ӑ | 04D1 | ӑ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0107 | ć | 010B | ċ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0113 | ē | 1EBD | ẽ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
011F | ğ | 01E7 | ǧ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0121 | ġ | 0123 | ģ | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0127 | ħ | 045B | ћ | ↔ | blocked | Cross-script homoglyph | |
045B | ћ | 045B | ћ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0129 | ĩ | 012B | ī | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0155 | ŕ | 0453 | ѓ | ↔ | blocked | Cross-script near homoglyph | |
0453 | ѓ | 0453 | ѓ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0169 | ũ | 016B | ū | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
017A | ź | 017C | ż | ↔ | blocked | Glyphs either homoglyph or nearly identical |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
01A1 | ơ | 03C2 | ς | ↔ | blocked | ||
01A1 | ơ | 03C3 | σ | ↔ | blocked | Cross-script near homoglyph | |
03C2 | ς | 03C2 | ς | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03C2 | ς | 03C3 | σ | ↔ | blocked | ||
03C3 | σ | 03C3 | σ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
01DD | ǝ | 0259 | ə | ↔ | blocked | Glyphs either homoglyph or nearly identical | |
01DD | ǝ | 04D9 | ә | ↔ | blocked | Cross-script homoglyph | |
0259 | ə | 04D9 | ә | ↔ | blocked | Cross-script homoglyph | |
04D9 | ә | 04D9 | ә | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
024D | ɍ | 0493 | ғ | ↔ | blocked | Cross-script near homoglyph | |
0493 | ғ | 0493 | ғ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
025B | ɛ | 03AD | έ | ↔ | blocked | Cross-script homoglyph | |
025B | ɛ | 03B5 | ε | ↔ | blocked | Cross-script homoglyph | |
03AD | έ | 03AD | έ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
03AD | έ | 03B5 | ε | ↔ | blocked | ||
03B5 | ε | 03B5 | ε | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
0292 | ʒ | 04E1 | ӡ | ↔ | blocked | Cross-script homoglyph | |
04E1 | ӡ | 04E1 | ӡ | ≡ | out-of-repertoire-var | Out-of-repertoire |
Source | Glyph | Target | Glyph | Type | Ref | Comment | |
---|---|---|---|---|---|---|---|
045F | џ | 045F | џ | ≡ | out-of-repertoire-var | Out-of-repertoire | |
045F | џ | 1EE5 | ụ | ↔ | blocked | Cross-script near homoglyph |
Implict defined by script tag | 8 |
---|
The following table lists all named and implicit classes with their definition and a list of their members intersected with the current repertoire (for larger classes, this list is elided).
Name | Definition | Count | Members or Ranges | Ref | Comment |
---|---|---|---|---|---|
implicit | Tag=sc:Armn | 8 | {0566 0570 0572 0578 057D 0581-0582 0585} | Any character tagged as Armenian | |
implicit | Tag=sc:Cyrl | 28 | {0430 0433 0435 043E 0440-0441 0443 0445 0451 0453 0455-0458 045B 045F 0493 04AB 04AF 04BB 04CF 04D1 04D3 04D5 04D9 04E1 04E7 04F1} | Any character tagged as Cyrillic | |
implicit | Tag=sc:Grek | 22 | {0390 03AC-03B3 03B5 03B7 03B9 03BD 03BF 03C1-03C3 03C5 03CA-03CD} | Any character tagged as Greek | |
implicit | Tag=sc:Hebr | 2 | {05D5 05E1} | Any character tagged as Hebrew | |
implicit | Tag=sc:Latn | 197 | {0061-007A 00DF-00F6 00F8-00FF 0101 0103 0105 0107 0109 010B 010D 010F 0111 0113 0117 0119 011B 011D 011F 0121 0123 0125 0127 0129 012B 012F 0131 0135 0137 ...} | Any character tagged as Latin | |
implicit | Tag=sc:Mlym | 2 | {0D1F-0D20} | Any character tagged as Malayalam | |
implicit | Tag=sc:Mymr | 2 | {1004 101D} | Any character tagged as Myanmar | |
implicit | Tag=sc:Orya | 1 | {0B20} | Any character tagged as Oriya |
Number of rules | 1 |
---|---|
Used to trigger actions | 1 |
The following table lists all named rules defined in the LGR and indicates whether they are used as trigger in an action or as context (when or not-when) for a code point or variant.
Name | Regular Expression | Used as Trigger |
Anchor | Used as Context |
Ref | Comment |
---|---|---|---|---|---|---|
leading-combining-mark | (start)[[\p{gc=Mn}] ∪ [∅=\p{gc=Mc}]] |
✔ | Default WLE rule matching labels with leading combining marks ⍟ |
The following table lists the actions that are used to assign dispositions to labels and variant labels based on the specified conditions. The order of actions defines their precedence: the first action triggered by a label is the one defining its disposition.
# | Condition | Rule / Variant Set | Disposition | Ref | Comment | |
---|---|---|---|---|---|---|
1 | if label matches | leading-combining-mark | → | invalid | labels with leading combining marks are invalid ⍟ | |
2 | if at least one variant is in | {out-of-repertoire-var} | → | invalid | any variant label with a code point out of repertoire is invalid ⍟ | |
3 | if at least one variant is in | {blocked} | → | blocked | any variant label containing blocked variants is blocked ⍟ | |
4 | if each variant is in | {r-eszett r-dotless} | → | valid | any remaining label containing only original code points is valid | |
5 | if each variant is in | {r-eszett dotted} | → | allocatable | any label with all dotted letters i and sharp s as applied for is allocatable | |
6 | if each variant is in | {eszett-to-ss r-dotless} | → | allocatable | any label with no sharp s and dotless letters i as applied for is allocatable | |
7 | if each variant is in | {eszett-to-ss dotted} | → | allocatable | any label with no sharp s and all dotted letters i is allocatable | |
8 | if at least one variant is in | {eszett-to-ss dotted} | → | blocked | any variant label with a mix of variant forms is blocked | |
9 | if each variant is in | {allocatable} | → | allocatable | variant labels with all variants allocatable are allocatable ⍟ | |
10 | if any label (catch-all) | → | valid | catch all (default action) ⍟ |
The following lists the references cited for specific code points, variants, classes, rules or actions in this LGR. For General references refer to the "References" section in the Description.
[0] | The Unicode Standard 1.1 Any code point originally encoded in Unicode 1.1 |
[3] | The Unicode Standard 3.0 Any code point originally encoded in Unicode 3.0 |
[8] | The Unicode Standard 5.0 Any code point originally encoded in Unicode 5.0 |
[99] | CO Controls and Basic Latin, The Unicode Standard https://unicode.org/charts/PDF/U0000.pdf Any code point cited is part of the Basic Latin (ASCII) set |
[100] | ICANN, Second Level Reference Label Generation Rules for Spanish https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-spanish-30aug16-en.html (Accessed on 31 August 2018) |
[101] | Omniglot, Czech (čeština) https://www.omniglot.com/writing/czech.htm (Accessed on 31 August 2018) |
[102] | Omniglot, Icelandic (Íslenska) https://www.omniglot.com/writing/icelandic.htm (Accessed on 31 August 2018) |
[103] | Omniglot, Faroese (føroyskt mál) https://www.omniglot.com/writing/faroese.htm (Accessed on 31 August 2018) |
[105] | Omniglot, Chuukese (Chuuk) https://www.omniglot.com/writing/chuukese.htm (Accessed on 31 August 2018) |
[106] | SCRIPTSOURCE, Galician written with Latin script https://www.webcitation.org/6siTI8ieQ (Accessed on 31 August 2018) |
[107] | Omniglot, Lule Sámi (julevsámegiella) https://www.omniglot.com/writing/lulesami.htm (Accessed on 31 August 2018) |
[108] | Wikipedia, Northern Sami https://en.wikipedia.org/wiki/Northern_Sami (Accessed on 4 September 2018) |
[109] | Omniglot, Vietnamese (tiếng việt / 㗂越) https://www.omniglot.com/writing/vietnamese.htm (Accessed on 4 September 2018) |
[110] | Omniglot, Romanian (limba română) https://www.omniglot.com/writing/romanian.htm (Accessed on 4 September 2018) |
[113] | Omniglot, Skolt Sámi (Sääˊmǩiõll / Nuõrttsää’m) https://www.omniglot.com/writing/skoltsami.htm (Accessed on 4 September 2018) |
[114] | Omniglot, French (français) https://www.omniglot.com/writing/french.htm (Accessed on 4 September 2018) |
[115] | Omniglot, West Frisian (Frysk) https://www.omniglot.com/writing/westfrisian.htm (Accessed on 4 September 2018) |
[116] | Omniglot, Friulian (furlan/marilenghe) https://www.omniglot.com/writing/friulian.htm (Accessed on 4 September 2018) |
[117] | Summer Institute of Linguistics, Pequeno dicionário: Xavante-Português, Português-Xavante https://www.sil.org/resources/archives/17019 (Accessed on 1 October 2020) |
[119] | Omniglot, German (Deutsch) https://www.omniglot.com/writing/german.htm (Accessed on 4 September 2018) |
[120] | Omniglot, Finnish (suomi) https://www.omniglot.com/writing/finnish.htm (Accessed on 4 September 2018) |
[121] | Omniglot, Turkmen (Türkmen dili / Түркмен дили) https://www.omniglot.com/writing/turkmen.htm (Accessed on 4 September 2018) |
[122] | Omniglot, Estonian (eesti keel) https://www.omniglot.com/writing/estonian.htm (Accessed on 4 September 2018) |
[123] | Omniglot, Swedish (svenska) https://www.omniglot.com/writing/swedish.htm (Accessed on 4 September 2018) |
[124] | Omniglot, Yapese (Waab) https://www.omniglot.com/writing/yapese.htm (Accessed on 4 September 2018) |
[125] | Omniglot, Dinka (Thuɔŋjäŋ) https://www.omniglot.com/writing/dinka.php (Accessed on 4 September 2018) |
[126] | Omniglot, Kaqchikel (Kaqchikel Ch’ab’äl) https://www.omniglot.com/writing/kaqchikel.htm (Accessed on 4 September 2018) |
[127] | Omniglot, Bashkir/Bashkort (Башҡорт теле / Başqort tele) https://www.omniglot.com/writing/bashkir.htm (Accessed on 4 September 2018) |
[128] | Omniglot, Alsatian (Ëlsässisch) https://www.omniglot.com/writing/alsatian.htm (Accessed on 4 September 2018) |
[129] | Wikipedia, Nuer language https://en.wikipedia.org/wiki/Nuer_language (Accessed on 4 September 2018) |
[130] | Omniglot, Italian (italiano) https://www.omniglot.com/writing/italian.htm (Accessed on 4 September 2018) |
[131] | Wikipedia, Italian orthography https://en.wikipedia.org/wiki/Italian_orthography (Accessed on 4 September 2018) |
[132] | Omniglot, Wolof (Wollof) https://www.omniglot.com/writing/wolof.htm (Accessed on 4 September 2018) |
[133] | Omniglot, Latvian (latviešu valoda) https://www.omniglot.com/writing/latvian.htm (Accessed on 4 September 2018) |
[134] | Omniglot, Tongan (Faka-Tonga) https://www.omniglot.com/writing/tongan.htm (Accessed on 4 September 2018) |
[135] | Omniglot, Hawaiian (ʻŌlelo Hawaiʻi) https://www.omniglot.com/writing/hawaiian.htm (Accessed on 4 September 2018) |
[136] | Omniglot, Marshallese (kajin m̧ajeļ) https://www.omniglot.com/writing/marshallese.php (Accessed on 4 September 2018) |
[137] | Omniglot, Polish (polski) https://www.omniglot.com/writing/polish.htm (Accessed on 4 September 2018) |
[138] | Omniglot, Lithuanian (lietuvių kalba) https://www.omniglot.com/writing/lithuanian.htm (Accessed on 4 September 2018) |
[139] | Omniglot, Danish (dansk) https://www.omniglot.com/writing/danish.htm (Accessed on 4 September 2018) |
[140] | Omniglot, Chamorro (chamoru) https://www.omniglot.com/writing/chamorro.htm (Accessed on 4 September 2018) |
[141] | Omniglot, Umbundu (Úmbúndú) https://www.omniglot.com/writing/umbundu.htm (Accessed on 4 September 2018) |
[142] | Omniglot, Guaraní (Avañe’ẽ) https://www.omniglot.com/writing/guarani.htm (Accessed on 4 September 2018) |
[143] | Wikipedia, Guarani alphabet https://en.wikipedia.org/wiki/Guarani_alphabet (Accessed on 4 September 2018) |
[144] | Omniglot, Nauruan (Ekaiairũ Naoero) https://www.omniglot.com/writing/nauruan.htm (Accessed on 4 September 2018) |
[145] | Omniglot, Khoekhoe (Khoekhoegowab) https://www.omniglot.com/writing/khoekhoe.htm (Accessed on 4 September 2018) |
[146] | Omniglot, Nuer (Naath) https://www.omniglot.com/writing/nuer.htm (Accessed on 4 September 2018) |
[147] | Omniglot, Hausa (Harshen Hausa / هَرْشَن هَوْسَ) https://www.omniglot.com/writing/hausa.htm (Accessed on 4 September 2018) |
[148] | Omniglot, Dagaare https://www.omniglot.com/writing/dagaare.htm (Accessed on 4 September 2018) |
[149] | Omniglot, Fula (Fulfulde, Pulaar, Pular’Fulaare) https://www.omniglot.com/writing/fula.htm (Accessed on 4 September 2018) |
[150] | Omniglot, Croatian (Hrvatski) https://www.omniglot.com/writing/croatian.htm (Accessed on 4 September 2018) |
[151] | Omniglot, Serbian (српски / srpski) https://www.omniglot.com/writing/serbian.htm (Accessed on 4 September 2018) |
[152] | Wikipedia, Polish language https://en.wikipedia.org/wiki/Polish_language (Accessed on 4 September 2018) |
[153] | Omniglot, Slovak (slovenčina) https://www.omniglot.com/writing/slovak.htm (Accessed on 4 September 2018) |
[154] | Evertype Publishing, Lithuanian lietuvių kalba Version 1.1 https://www.evertype.com/alphabets/lithuanian.pdf (Accessed on 4 September 2018) |
[157] | Omniglot, Turkish (Türkçe) https://www.omniglot.com/writing/turkish.htm (Accessed on 4 September 2018) |
[158] | Omniglot, Kurdish (Kurdî / کوردی) https://www.omniglot.com/writing/kurdish.htm (Accessed on 4 September 2018) |
[159] | Omniglot, Azerbaijani (آذربايجانجا ديلي / Азәрбајҹан дили / Azərbaycan dili) https://www.omniglot.com/writing/azeri.htm (Accessed on 4 September 2018) |
[160] | Omniglot, Basque (euskara) https://www.omniglot.com/writing/basque.htm (Accessed on 4 September 2018) |
[161] | Wikipedia, Basque language https://en.wikipedia.org/wiki/Basque_language#Writing_system (Accessed on 4 September 2018) |
[163] | Omniglot, Maltese (Malti) https://www.omniglot.com/writing/maltese.htm (Accessed on 4 September 2018) |
[164] | Omniglot, Venda (Tshivenḓa / Luvenḓa) https://www.omniglot.com/writing/venda.htm (Accessed on 4 September 2018) |
[168] | Omniglot, Brahui (Bráhuí / براوی) https://www.omniglot.com/writing/brahui.htm (Accessed on 4 September 2018) |
[169] | Wikipedia, Fon language https://en.wikipedia.org/wiki/Fon_language (Accessed on 4 September 2018) |
[170] | Omniglot, Ewe (Eʋegbe) https://www.omniglot.com/writing/ewe.htm (Accessed on 4 September 2018) |
[172] | Omniglot, Sorbian (hornjoserbsce/dolnoserbski) https://www.omniglot.com/writing/sorbian.htm (Accessed on 4 September 2018) |
[173] | Peace corps, Botswana, An Introduction to Setswana Language https://files.peacecorps.gov/multimedia/audio/languagelessons/botswana/Bw_Setswana_Language_Lessons.pdf (Accessed on 4 September 2018) |
[174] | Omniglot, Tswana (Setswana) https://www.omniglot.com/writing/tswana.php (Accessed on 4 September 2018) |
[175] | Wikipedia, Afrikaans https://en.wikipedia.org/wiki/Afrikaans (Accessed on 4 September 2018) |
[176] | Omniglot, Albanian (shqip / gjuha shqipe) https://www.omniglot.com/writing/albanian.htm (Accessed on 4 September 2018) |
[177] | Wikipedia, Albanian alphabet https://en.wikipedia.org/wiki/Albanian_alphabet (Accessed on 4 September 2018) |
[179] | Wikipedia, Uyghur Latin alphabet https://en.wikipedia.org/wiki/Uyghur_Latin_alphabet (Accessed on 4 September 2018) |
[180] | Omniglot, Drehu (Deʼu) https://www.omniglot.com/writing/drehu.php (Accessed on 4 September 2018) |
[182] | Omniglot, Haitian Creole (Kreyòl ayisyen) https://www.omniglot.com/writing/haitiancreole.htm (Accessed on 4 September 2018) |
[183] | Wikipedia, Haitian Creole https://en.wikipedia.org/wiki/Haitian_Creole#Orthography (Accessed on 4 September 2018) |
[184] | Omniglot, Minangkabau (Baso Minangkabau / باسو مينڠكاباو) https://www.omniglot.com/writing/minangkabau.htm (Accessed on 4 September 2018) |
[185] | Omniglot, Palauan (a tekoi er a Belau) https://www.omniglot.com/writing/palauan.htm (Accessed on 4 September 2018) |
[186] | Omniglot, Cubeo (pãmié) https://www.omniglot.com/writing/cubeo.htm (Accessed on 4 September 2018) |
[187] | Editorial Alberto Lleras Camargo, Diccionario Ilustrado Bilingüe cubeo-español español-cubeo https://www.sil.org/system/files/reapdata/10/58/27/10582785843693992331766506069073895620/40337_01.pdf (Accessed on 4 September 2018) |
[188] | Omniglot, Inari Saami (Anarâškielâ) https://www.omniglot.com/writing/inarisami.htm (Accessed on 4 September 2018) |
[189] | Omniglot, Compiled by Wolfram Siegel, DAGBANI https://www.omniglot.com/charts/dagbani.pdf (Accessed on 4 September 2018) |
[190] | Omniglot, Ewondo https://www.omniglot.com/writing/ewondo.php (Accessed on 4 September 2018) |
[191] | Omniglot, Luganda (Oluganda) https://www.omniglot.com/writing/ganda.php (Accessed on 4 September 2018) |
[192] | Omniglot, Adzera https://www.omniglot.com/writing/adzera.htm (Accessed on 4 September 2018) |
[193] | Omniglot, Ga (Gã) https://www.omniglot.com/writing/ga.htm (Accessed on 4 September 2018) |
[194] | Omniglot, Duala (Duálá) https://www.omniglot.com/writing/duala.php (Accessed on 4 September 2018) |
[195] | Omniglot, Soga (Lusoga) https://www.omniglot.com/writing/soga.htm (Accessed on 4 September 2018) |
[196] | Omniglot, Alur (Lur) https://www.omniglot.com/writing/alur.htm (Accessed on 4 September 2018) |
[197] | Omniglot, Mandinka (Mandi’nka kango / لغة مندنكا) https://www.omniglot.com/writing/mandinka.htm (Accessed on 4 September 2018) |
[198] | Omniglot, Acholi (Lwo) https://www.omniglot.com/writing/acholi.htm (Accessed on 4 September 2018) |
[199] | Omniglot, Bambara (Bamanankan) https://www.omniglot.com/writing/bambara.htm (Accessed on 4 September 2018) |
[200] | Omniglot, Raga (Hano) https://www.omniglot.com/writing/raga.htm (Accessed on 4 September 2018) |
[201] | Omniglot, Tatar (tatarça / татарча / تاتارچا) https://www.omniglot.com/writing/tatar.htm (Accessed on 4 September 2018) |
[202] | Omniglot, Zaza (Zazaki / زازاکی) https://www.omniglot.com/writing/zazaki.htm (Accessed on 4 September 2018) |
[203] | Wikipedia, Turkish alphabet https://en.wikipedia.org/wiki/Turkish_alphabet (Accessed on 4 September 2018) |
[204] | School of English, Adam Michiewicz University, Poznań, Poland, Poznań Studies in Contemporary Linguistics 43(1),2007, pp. 169-180, A Demographic Igbo Orthography https://www.degruyter.com/downloadpdf/j/psicl.2007.43.issue-1/v10010-007-0009-0/v10010-007-0009-0.pdf (Accessed on 4 September 2018) |
[205] | Omniglot, Igbo (Asụsụ Igbo) https://www.omniglot.com/writing/igbo.htm (Accessed on 4 September 2018) |
[206] | ItalianPod101, Italian Accents and Proper Italian Pronunciation https://www.italianpod101.com/italian-accents (Accessed on 4 September 2018) |
[208] | Reverso Dictionary, venerdì translation | Italian-English dictionary https://dictionary.reverso.net/italian-english/venerd%C3%AC (Accessed on 4 September 2018) |
[209] | Omniglot, Kikuyu (Gĩkũyũ) https://www.omniglot.com/writing/kikuyu.htm (Accessed on 4 September 2018) |
[210] | Omniglot, Hixkaryána https://www.omniglot.com/writing/hixkaryana.htm (Accessed on 4 September 2018) |
[211] | Omniglot, Maasai (ɔl Maa) https://www.omniglot.com/writing/maasai.htm (Accessed on 4 September 2018) |
[212] | Omniglot, Mossi (Mòoré) https://www.omniglot.com/writing/mossi.htm (Accessed on 4 September 2018) |
[213] | Omniglot, Jenesis. The Bible in Marshallese, 2009., Contributed by Wolfgang Kuhl https://www.omniglot.com/babel/marshallese.htm (Accessed on 4 September 2018) |
[214] | Wikipedia, Cedilla https://en.wikipedia.org/wiki/Cedilla#Marshallese (Accessed on 4 September 2018) |
[215] | Wikipedia, Marshallese language https://en.wikipedia.org/wiki/Marshallese_language#Display_issues (Accessed on 4 September 2018) |
[216] | Trussel, Marshallese-English Online Dictionary https://www.trussel2.com/MOD/ (Accessed on 4 September 2018) |
[218] | Omniglot, Susu (Sosoxi) https://www.omniglot.com/writing/susu.htm (Accessed on 4 September 2018) |
[219] | Omniglot, Zarma (Zarmaciine) https://www.omniglot.com/writing/zarma.htm (Accessed on 4 September 2018) |
[220] | Omniglot, Pitjantjatjara https://www.omniglot.com/writing/pitjantjatjara.htm (Accessed on 4 September 2018) |
[221] | Omniglot, Spanish (español/castellano) https://www.omniglot.com/writing/spanish.htm (Accessed on 4 September 2018) |
[222] | Omniglot, Filipino (wikang Filipino) https://www.omniglot.com/writing/filipino.htm (Accessed on 4 September 2018) |
[223] | Omniglot, Chavacano https://www.omniglot.com/writing/chavacano.php (Accessed on 4 September 2018) |
[224] | Wikipedia, Ilocano language https://en.wikipedia.org/wiki/Ilocano_language#Modern_alphabet (Accessed on 4 September 2018) |
[225] | Omniglot, Quechua (Runasimi) https://www.omniglot.com/writing/quechua.htm (Accessed on 4 September 2018) |
[226] | Wikipedia, Quechua alphabet https://en.wikipedia.org/wiki/Quechua_alphabet (Accessed on 4 September 2018) |
[227] | Omniglot, Cape Verdean Creole (Kriolu) https://www.omniglot.com/writing/kriol.php (Accessed on 4 September 2018) |
[228] | Omniglot, Waray-Waray https://www.omniglot.com/writing/waray.php (Accessed on 4 September 2018) |
[229] | Omniglot, Lozi (siLozi) https://www.omniglot.com/writing/lozi.htm (Accessed on 4 September 2018) |
[230] | africanlanguages.com, Sesotho sa Leboa (Northern Sotho) https://africanlanguages.com/northern_sotho/ (Accessed on 4 September 2018) |
[232] | Wikipedia, Chechen language https://en.wikipedia.org/wiki/Chechen_language (Accessed on 4 September 2018) |
[233] | Omniglot, Hungarian (magyar) https://www.omniglot.com/writing/hungarian.htm (Accessed on 4 September 2018) |
[234] | Wikipedia, Hungarian alphabet https://en.wikipedia.org/wiki/Hungarian_alphabet (Accessed on 4 September 2018) |
[236] | Omniglot, Lingala https://www.omniglot.com/writing/lingala.htm (Accessed on 4 September 2018) |
[237] | Omniglot, Akan https://www.omniglot.com/writing/akan.htm (Accessed on 4 September 2018) |
[238] | Wikipedia, Mossi language https://en.wikipedia.org/wiki/Mossi_language (Accessed on 4 September 2018) |
[239] | SIL-Sudan, OCCASIONAL PAPERS in the study of SUDANESE LANGUAGES No. 9 (p. 75) https://www.sil.org/system/files/reapdata/10/06/46/100646256099282892829790816212446104791/OPSL_9.pdf (Accessed on 4 September 2018) |
[240] | Omniglot, Kanuri https://www.omniglot.com/writing/kanuri.htm (Accessed on 4 September 2018) |
[241] | Omniglot, Bugis (Basa Ugi ) https://www.omniglot.com/writing/bugis.htm (Accessed on 4 September 2018) |
[242] | Omniglot, Mizo (Mizo ṭawng) https://www.omniglot.com/writing/mizo.htm (Accessed on 4 September 2018) |
[243] | Omniglot, Miskito (Mískitu) https://www.omniglot.com/writing/miskito.htm (Accessed on 4 September 2018) |
[245] | Wikipedia, Papiamento https://en.wikipedia.org/wiki/Papiamento (Accessed on 4 September 2018) |
[246] | Omniglot, Papiamento (Papiamentu) https://www.omniglot.com/writing/papiamento.php (Accessed on 4 September 2018) |
[247] | Omniglot, Chichewa (Chicheŵa) https://www.omniglot.com/writing/chichewa.php (Accessed on 4 September 2018) |
[248] | Native Languages of the Americas website, Vocabulary in Native American Languages: Mam Words https://www.native-languages.org/mam_words.htm (Accessed on 4 September 2018) |
[249] | Omniglot, Mam (Qyol Mam) https://www.omniglot.com/writing/mam.htm (Accessed on 4 September 2018) |
[250] | Wikipedia, Pulaar language https://en.wikipedia.org/wiki/Pulaar_language (Accessed on 4 September 2018) |
[251] | Wikipedia, Fula language https://en.wikipedia.org/wiki/Fula_language#Writing_systems (Accessed on 4 September 2018) |
[252] | Wikipedia, Polish alphabet https://en.wikipedia.org/wiki/Polish_alphabet (Accessed on 4 September 2018) |
[253] | Wikipedia, French orthography https://en.wikipedia.org/wiki/French_orthography (Accessed on 4 September 2018) |
[254] | Omniglot, Yoruba (Èdè Yorùbá) https://www.omniglot.com/writing/yoruba.htm (Accessed on 4 September 2018) |
[255] | Omniglot, Esperanto https://www.omniglot.com/writing/esperanto.htm (Accessed on 4 September 2018) |
[256] | Omniglot, Welsh (Cymraeg) https://www.omniglot.com/writing/welsh.htm (Accessed on 4 September 2018) |
[257] | Wikipedia, List of Latin-script letters https://en.wikipedia.org/wiki/List_of_Latin-script_letters (Accessed on 4 September 2018) |
[258] | Omniglot, Montenegrin https://www.omniglot.com/writing/montenegrin.htm (Accessed on 20 March 2019) |
[275] | Omniglot, Shavante https://www.omniglot.com/writing/shavante.php (Accessed on 24 September 2020) |
[276] | Wikipedia, Malagasy Language https://en.wikipedia.org/wiki/Malagasy_language (Accessed on 24 September 2020) |
[277] | Wikipedia, Serer language, https://en.wikipedia.org/wiki/Serer_language, accessed on 6 April 2021 |
[278] | Wikipedia, Kpelle language, https://en.wikipedia.org/wiki/Kpelle_language, accessed on 6 April 2021 |