This document specifies a reference set of Label Generation Rules for Serbian using a limited repertoire as appropriate for a second level domain.
All references converge on 30 Cyrillic code points (23 +7 as defined by RFC 5992 [130]).
There is an IDN table published in the IANA Repository of IDN Practices for Serbian by .rs (Serbia cctld) in [700].There is another new Cyrillic TLD created in Serbia: .срб, it uses the same repertoire as .rs.
The CLDR auxiliary set includes all code points to support Russian, but this is not supported by other sources, even for an extended set. In addition, there is some use of 6 vowels with double grave and inverted breve in Serbian phonology and poetics, but this is not germane to IDNs.
Letters documented in some references but not included:
U+0449 CYRILLIC SMALL LETTER SHCHA
U+044A CYRILLIC SMALL LETTER HARD SIGN
U+044B CYRILLIC SMALL LETTER YERU
U+044C CYRILLIC SMALL LETTER SOFT SIGN
U+044D CYRILLIC SMALL LETTER E
U+044E CYRILLIC SMALL LETTER YU
U+044F CYRILLIC SMALL LETTER YA
U+0451 CYRILLIC SMALL LETTER IO
U+0430 030F CYRILLIC SMALL LETTER A WITH DOUBLE GRAVE ACCENT
U+0430 0311 CYRILLIC SMALL LETTER A WITH INVERTED BREVE
U+0435 030F CYRILLIC SMALL LETTER IE WITH DOUBLE GRAVE ACCENT
U+0435 0311 CYRILLIC SMALL LETTER IE WITH INVERTED BREVE
U+0438 030F CYRILLIC SMALL LETTER I WITH DOUBLE GRAVE ACCENT
U+0438 0311 CYRILLIC SMALL LETTER I WITH INVERTED BREVE
U+043E 030F CYRILLIC SMALL LETTER O WITH DOUBLE GRAVE ACCENT
U+043E 0311 CYRILLIC SMALL LETTER O WITH INVERTED BREVE
U+0440 030F CYRILLIC SMALL LETTER ER WITH DOUBLE GRAVE ACCENT
U+0440 0311 CYRILLIC SMALL LETTER ER WITH INVERTED BREVE
U+0443 030F CYRILLIC SMALL LETTER U WITH DOUBLE GRAVE ACCENT
U+0443 0311 CYRILLIC SMALL LETTER U WITH INVERTED BREVE
None.
None.
This LGR defines no named character classes.
Common rules only:
Hyphen Restrictions — restrictions on the allowable placement of hyphens (no leading/ending hyphen and no hyphen in positions 3 and 4). These restrictions are described in section 4.2.3.1 of RFC5891 [120]. They are implemented here as context rule on U+002D (-) HYPHEN-MINUS.
Leading Combining Marks — restrictions on the allowable placement of combining marks (no leading combining mark). This rule is described in section 4.2.3.2 of RFC5891 [120].
Actions included are the default actions for LGRs as well as those needed to invalidate labels with misplaced combining marks.
Variant-related actions included to facilitate integration as appropriate.
This reference LGR for Serbian for the 2nd Level has been developed by Michel Suignard and Asmus Freytag, verified in expert reviews by Michael Everson, Nicholas Ostler, and Wil Tan, and based on multiple open public consultations.
Language tag has been updated.
General reference for the language:
In the listing of the repertoire by code point, references starting from [0] refer to the version of the Unicode Standard in which the corresponding code point was initially encoded. Other references, (starting from [100]) document usage of code points. For more details, see the Table of References below.
]]>