Overview

This file contains Label Generation Rules (LGR) for the Lao script for the Root zone. For more details on this LGR and its development, see "Proposal for a Lao Script Root Zone LGR [Proposal]". The format of this file follows [RFC 7940].

Repertoire

In addition to the 51 code points according to Section 5 “Repertoire” in [Proposal], the sequence 0EB2 0EB0 has been defined to facilitate implementation of WLE rule follows-vafter-context as a context rule. The repertoire only includes code points used by languages that are actively written in the Khmer script. The repertoire is based on [MSR-4], which is a subset of [Unicode 6.3].

Each code point or range is tagged with the script or scripts that the code point is used with, and one or more references documenting sufficient justification for inclusion in the repertoire, see "References" below.

Variants

According to Section 6, "Variants" in [Proposal], this LGR defines no variants.

Character Classes

Some consonants have been given the tag of Cf, which indicates final consonants. Other character classes that have been used are semi-consonant, tone-mark, vowel-above, vowel-before, vowel-below and vowel-after. See Section 5 of the [Proposal].

Whole Label Evaluation (WLE) and Context Rules

Default Whole Label Evaluation Rules and Actions

The LGR includes the set of required default WLE rules and actions applicable to the Root Zone and defined in [MSR-4]. They are marked with ⍟. The default prohibition on leading combining marks is equivalent to ensuring that a label only starts with a consonant or vowel-before.

Lao-specific Rules

Rules provided in the LGR as described in Section 7 of [Proposal] reasonably restrict labels so that they conform to Lao syllable structure. These constraints are presented exclusively as context rules.

The rules are:

follows-consonant — A context rule for semi-consonant. See Section 7 in [Proposal]. (WLE Rule 1)
precedes-consonant — A context rule for vowel-before. See Section 7 in [Proposal]. (WLE Rule 2)
follows-main-consonant — A context rule for vowel-below, and vowel-above. See Section 7 in [Proposal]. (WLE Rule 3)
follows-C-tonemark-vabove — A context rule for vowel-after. See Section 7 in [Proposal]. (WLE Rule 4)
follows-vbefore-consonant-cluster — A context rule for a vowel-after sequence. It incorporates consonant-cluster. See Section 7 in [Proposal]. (WLE Rule 5)
follows-C-vabove-vbelow — A context rule for tone mark. See Section 7 in [Proposal]. (WLE Rule 6)
follows-Cf — A context rule for U+0ECC (໌ ) LAO CANCELLATION MARK. See Section 7 in [Proposal]. (WLE Rule 7)
repetition-mark-limit.—. A rule that limits the occurrence of U+0EC6 ( ໆ ) LAO KO LA at the label end. See Section 7 in [Proposal]. (WLE Rule 8)

No context rules apply to “consonant” code points. For discussion, see Section 5.1 “Consonants” in [Proposal].

Methodology and Contributors

For methodology and contributors, see Sections 4 and 8 of [Proposal].

References

The following general references are cited in this document:

[MSR-4]: Integration Panel, "Maximal Starting Repertoire — MSR-4 Overview and Rationale", 7 February 2019, https://www.icann.org/en/system/files/files/msr-4-overview-25jan19-en.pdf
[Proposal]: Lao Generation Panel, "Proposal for Lao Script Root Zone LGR", https://www.icann.org/en/system/files/files/proposal-lao-lgr-31jan17-en.pdf
[RFC 7940]: Davies, K. and A. Freytag, "Representing Label Generation Rulesets Using XML", RFC 7940, August 2016, http://www.rfc-editor.org/info/rfc7940
[Unicode 6.3]: The Unicode Consortium. The Unicode Standard, Version 6.3.0, (Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5) http://www.unicode.org/versions/Unicode6.3.0/

For references consulted particularly in designing the repertoire for the Lao script for the Root Zone please see details in the Table of References below. Reference [0] refers to Unicode Standard version in which corresponding code points were initially encoded. References [201], [202], [203], [204], 205], [206], & [207] correspond to sources justifying the inclusion of or classification for the corresponding code points. Single code points or ranges may have multiple source reference values.

]]> The Unicode Standard 1.1 Lao grammar book published by the Ministry of Education in 1967, see Appendix B, Figure 1 Lao grammar book published by the Ministry of Education in 1967, see Appendix B, Figure 2 Lao grammar book published by the Ministry of Education in 1967, see Appendix B, Figure 3 Lao grammar book published by the Ministry of Education in 2000, see Appendix B, Figure 4 Lao grammar book published by the Ministry of Education in 2000, see Appendix B, Figure 5 Lao grammar book published by the Ministry of Education in 2000, see Appendix B, Figure 6 Lao grammar 1935, see Appendix B, Figure 7