<?xml version='1.0' encoding='utf-8'?>
<lgr xmlns="urn:ietf:params:xml:ns:lgr-1.0">
  <meta>
    <version comment="Proposal LGR for Devanagari">3.0</version>
    <date>2018-04-22</date>
    <unicode-version>6.3.0</unicode-version>
    <language>und-Deva</language>
    <scope type="domain">.</scope>
    <description type="text/html"><![CDATA[
    <h1>Label Generation Rules for Devanagari script</h1>
    
    <h2>Overview</h2>
    <p>This file contains Label Generation Rules (LGR) for the Devanagari script as would be appropriate for the 
    Root zone. For more details on this proposal, see &ldquo;Proposal for a Devanagari Script Root Zone Label Generation 
    Rule-Set (LGR)&rdquo; [Proposal]. The format of this file follows [RFC 7940].</p>
    
    <h2>Repertoire</h2>
    <p>The Root Zone LGR for the Devanagari script lists 83 unique code points in addition to 27 sequences, bringing the total repertoire entries to 110 . The two sequences U+0931 U+094D U+092F (ऱ्य) and U+0931 U+094D U+0939 (ऱ्ह) limit the character U+0931 (DEVANAGARI LETTER RRA) in its own specific context beyond which it does not stand by itself. Accordingly, while U+0931 (ऱ) is not listed by itself, it brings the total of distinct code points to 84.</p>
    
    <p>A number of other sequences have been defined in connection with the definition of variants (see "Variants" below).</p>

     <p>The repertoire includes code points used by languages written in Devanagari that
     fall within [EGIDS] scale 1 to 4. Boro, Braj, Dhundari, Mundari, Kharia  have also been additionally covered. 
     Though listed in EGIDS scale 4, Saraiki is not covered, because the Devanagari 
     script is &ldquo;no longer in use&rdquo; by the Saraiki community. For more details, see Section 5 &ldquo;Repertoire&rdquo; in [Proposal]). A non-exhaustive list of languages using each code point can be found in the comments.</p>

    <p>The repertoire is based on [MSR-4], which is a subset of Unicode 6.3 [Unicode 6.3].    </p>

    <h2>Variants</h2>
    <p>According to Section 6 &ldquo;Variants&rdquo;, in [Proposal], this LGR defines variants which are
    &ldquo;Confusing due to deviation from normally perceived character formations by the larger linguistic 
    community&rdquo; These cases are not of mere visual similarity as they involve some deviations from 
    the widely accepted norms of Devanagari Akshar formations. These can cause confusion even to a
    careful observer and are hence proposed as variants. They fall into three broad categories:</p>
    <ul>
    <li> Vowel/Vowel sign followed by Nukta</li>
    <li> Unique Vowels and Vowel Signs required for Kashmiri</li>
	<li> Variants based on Candrabindu and Candra Vowel Signs followed by Anusvara</li>
    </ul>
    <p>Variant Disposition: All variants are of type &ldquo;blocked&rdquo;, making labels that 
    differ only by these variants mutually exclusive: whichever label containing either of 
    these variants is chosen earlier would be delegated, while any other equivalent label should be blocked.</p>
    
    <p>In addition to these, cross-script variant analysis of Devanagari has been carried out by the NBGP. Possible cross-script
    variant cases were found with the Gurmukhi and Bengali script and have been mentioned in  Appendix 1 of the [Proposal].</p>

    <p>Context Rules for Variants: some of the variants defined in this LGR are "effective null variants", that is,
    both some  code points in the source map to "nothing" in the target with all other code points unchanged. 
    (Because mappings are symmetric, it does not matter whether it is the forward or reverse mapping that 
    maps to "null"). Such variants require a context rule to keep the variant well behaved. Symmetry requires 
    the same context rule for both forward and reverse mappings.</p>
    
    <p>In other cases, the sequences or code points making up source and target are constrained by explicit context 
    rules on the code points (or by implicit context rules defined for the adjacent code points). In such a case, any 
    variants may require context rules that match the intersection between the effective contexts for both source and 
    target; otherwise, a sequence might be considered valid in some variant label when it would not be valid in an 
    equivalent context in an original label.</p> 

      
    <h2>Character Classes</h2>
    <p>Devanagari is an alphasyllabary and the heart of the writing system is 
    the <em>akshar</em>. It is this unit, which is instinctively recognized by users of the script.
    The writing system of Devanagari could be summed up as composed of Consonants,  Halant, Vowels, Anusvara, Candrabindu, Nukta and Visarga.</p>
    
    <p>Consonants: Devanagari consonants all contain an implicit schwa /&#x0259;/. 
    To make a full  syllable, consonants may be followed by certain code points from one or more 
    of the  other groups (see &ldquo;WLE rules&rdquo; below). See Section &ldquo;3.3.1 The Consonants&rdquo; of the [Proposal].</p>
    
    <p>Halant: All consonants contain an implicit vowel (schwa).
    A special sign is needed to denote that this implicit vowel is stripped off. 
    This is known as the Halant (U+094D). The Halant thus joins two consonants and 
    creates conjuncts, which can be generally from 2 to 4 consonant combinations. 
    In rare cases, it can join up to 5 consonants. However, this LGR will not enforce any length limit.

    See section 3.3.2 &ldquo;The Implicit Vowel Killer: Halant&rdquo; in [Proposal].</p>

    <p>Vowels: There are separate code points for vowels that are pronounced 
    independently  at the beginning of a syllable or after a vowel sound. To indicate 
    a Vowel sound following a consonant other than the implicit shwa sound, a vowel sign (Matra) is 
    attached to the consonant. There is an equivalent Matra for each vowel excepting the U+0905.  
    See Section &ldquo;3.3.3 Vowels&rdquo; of the [Proposal] </p>
    
    <p>Anusvara : The Anusvara shows a nasal at  the end of a syllable. See Section &ldquo;3.3.4 The Anusvara&rdquo; of the [Proposal].</p>
    
    <p>Candrabindu : A Candrabindu denotes nasalization of the preceding vowel. Present-day 
    Hindi users tend to replace the Candrabindu by the Anusvara. 
    See Section &ldquo;3.3.5 Nasalization: Candrabindu&rdquo; of the [Proposal].</p>
    
    <p>Nukta : The nukta sign is placed below a certain number of consonants 
    to represent sounds found only in words borrowed from Perso-Arabic, English and other non-Aryan sources. 
    It is also placed under U+0921 and U+0922 to indicate 
    flapped sounds. Apart from this, Santali language uses Nukta adjoined to certain vowels and vowel signs. 
    See Section &ldquo;3.3.6 Nukta&rdquo; of the [Proposal].</p>

    <p>Visarga: The Visarga (U+0903), representing an aspiration at the end of a syllable, is 
    frequently used in Sanskrit. See Section &ldquo;3.3.7 Visarga and Avagraha&rdquo; of the [Proposal].</p>

      
    <h2>Whole Label Evaluation (WLE) rules</h2>
    <h3>Default Whole Label Evaluation Rules</h3>
    <p>The LGR includes the set of required default WLE rules and actions applicable to 
        the Root Zone and defined in [MSR-4]. They are marked with &#x235F;.</p> 
        
    <h3>Devanagari specific Rules</h3>
    <p>These rules ensure that the Devanagari label conforms
    to akshar formation norms for the Devanagari script.  These norms are exclusively 
    presented as context rules.</p>
    
    <p>The following symbols are used in the names and comments for WLE rules: </p>
    <ul>
      <li>C    &#x2192;    Consonant</li>
      <li>M    &#x2192;    Matra</li>
      <li>V    &#x2192;    Vowel</li>
      <li>B    &#x2192;    Anusvara (Bindu)</li> 
      <li>D    &#x2192;    Candrabindu</li>
      <li>X    &#x2192;    Visarga</li>
      <li>H    &#x2192;    Halant / Virama</li>
      <li>N    &#x2192;    Nukta</li>
        <li>S    &#x2192;    Eyelash Reph (C2HC3) where:
        <ul> 
          <li>C2 is U+0931 DEVANAGARI LETTER RRA</li>
          <li>H is U+094D DEVANAGARI SIGN VIRAMA</li>
          <li>C3 is either
                <ul>
                <li>U+092F DEVANAGARI LETTER YA</li>
                <li>or U+0939 DEVANAGARI LETTER HA</p></li>
                </ul>
          </li>
        </ul>
        </li></ul>
    
    <p>The rules are: </p>
     <ul>
     <li>1.    N: must be preceded only by either of C1, V1 and M1</li>
     <li>2.    H: must be preceded by C or CN</li>
     <li>3.    M: must be preceded by C or CN</li>
     <li>4.    X: must be preceded by either of V, C, N or M</li>
     <li>5.    B: must be preceded by either of V, C, N or M </li>
     <li>6.    D: must be preceded by either of V, C, N or M </li>
     <li>7.    V: Can NOT be preceded by H </li>
     </ul>
    <p>Additional rules are used only for variants where a Nukta maps to a "null", or that are overlapped:</p>
    <ul>
        <li>Variant is not defined if followed by a Nukta</li>
        <li>Variant undefined if it is not followed by V or C (including RRA) or end of label</li>
    </ul>
    
     <p>See Section &ldquo;7 Whole Label Evaluation Rules (WLE)&rdquo; of the [Proposal].</p>
    
    <h2>Overall Development Process and Methodology</h2>
    <p>Under the Neo-Brahmi Generation Panel, there are many different scripts 
    belonging to separate Unicode blocks. Each of these scripts has been assigned 
    a separate LGR; however, Neo-Brahmi GP ensured that the fundamental philosophy 
    behind building those LGRs are all in sync with all other Brahmi derived 
    scripts. This is the Devanagari LGR, which caters to multiple languages written 
    using Devanagari belonging to EGIDS scale 1 to 4.</p>
    
    <p>For additional details and contributors, see Sections 4 and  8 of the [Proposal].</p>
    
    <h2>References</h2> 
    <p>References [0] to [11] refer to the Unicode Standard versions in which corresponding code 
    points were initially encoded. Reference [100] and up correspond to sources given 
    in [Proposal] for justifying the inclusion of for the corresponding code points. 
    Single code point or ranges may have multiple source reference values.</p>
    
    <p>In addition, the following references are cited in this document:</p>
    <dl class="references">

     <dt>[MSR-4]</dt>
     <dd>Integration Panel, "Maximal Starting Repertoire — MSR-4 Overview and Rationale", 
	 7 February 2019 https://www.icann.org/en/system/files/files/msr-4-overview-25jan19-en.pdf</dd>

     <dt>[Proposal]</dt> 
     <dd>Neo-Brahmi Generation Panel, “Proposal for a Devanagari Script Root Zone Label 
     Generation Rule-Set (LGR)”, 22 April 2019, https://www.icann.org/en/system/files/files/proposal-devanagari-lgr-22apr19-en.pdf</dd>

     <dt>[RFC 7940]</dt>
     <dd>Davies, K. and A. Freytag, "Representing Label Generation Rulesets Using XML", 
     RFC 7940, August 2016, http://www.rfc-editor.org/info/rfc7940. </dd> 

     <dt>[EGIDS]</dt>
     <dd>Expanded Graded Intergenerational Disruption Scale, https://www.ethnologue.com/about/language-status (Accessed on 13th Nov. 2017)</dd>
     
     <dt>[Unicode 6.3]</dt>
     <dd>The Unicode Consortium. The Unicode Standard, Version 6.3.0, (Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5) 
     http://www.unicode.org/versions/Unicode6.3.0/</dd>
    </dl>

    <p>For more details for references [100] and up and [0] and up refer to the Table of References below.</p>
]]></description>
    <references>
      <reference comment="Any code point originally encoded in Unicode 1.1" id="0">The Unicode Standard 1.1</reference>
      <reference comment="Any code point originally encoded in Unicode 5.0" id="8">The Unicode Standard 5.0</reference>
      <reference comment="Any code point originally encoded in Unicode 5.1" id="9">The Unicode Standard 5.1</reference>
      <reference comment="Any code point originally encoded in Unicode 6.0" id="11">The Unicode Standard 6.0</reference>
      <reference id="100">Devanāgarī VIP Team, "Variant Issues Report", ICANN 3rd Oct. 2011, https://archive.icann.org/en/topics/new-gtlds/devanagari-vip-issues-report-03oct11-en.pdf (Accessed on 10th Oct. 2017)</reference>
      <reference id="101">Omniglot, "Hindi", https://www.omniglot.com/writing/hindi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="102">Omniglot, "Marathi", https://www.omniglot.com/writing/marathi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="103">Omniglot, "Sanskrit", https://www.omniglot.com/writing/sanskrit.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="104">Omniglot, "Sindhi", https://www.omniglot.com/writing/sindhi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="105">Omniglot, "Kashmiri", https://www.omniglot.com/writing/kashmiri.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="106">Unicode 10.0.0, "South and Central Asia-I - Official Scripts of India", Page 456 (R5 and R5a), http://www.unicode.org/versions/Unicode10.0.0/ch12.pdf (Accessed on 13th Nov. 2017)</reference>
      <reference id="107">Unicode Indic Group, "Devanagari Eyelash Ra", http://unicode.org/~emuller/iwg/p8/utcdoc.html (Accessed on 13th Nov. 2017)</reference>
      <reference id="108">M.K. Raina, "How to read and write Kashmiri in Devanagari?", http://www.koshur.org/pdf/Let%20Us%20Learn%20Kashmiri.pdf (Accessed on 12th Dec. 2017)</reference>
      <reference id="109">Central Hindi Directorate-Ministry of HRD-Govt. of India, "Devanāgarī Alphabet and its Romanization", http://hindinideshalaya.nic.in/english/hindi_orgin/devnagarithesysmbols.html (Accessed on 12th Dec. 2017</reference>
      <reference id="110">Omniglot, "Bodo", https://www.omniglot.com/writing/bodo.htm (Accessed on 12th Dec. 2017)</reference>
      <reference id="111">Omniglot, "Maithili", https://www.omniglot.com/writing/maithili.htm (Accessed on 12th Dec. 2017)</reference>
      <reference id="112">Omniglot, "Konkani", https://www.omniglot.com/writing/konkani.htm (Accessed on 20th May. 2018)</reference>
      <reference id="113">Omniglot, "Nepali", https://www.omniglot.com/writing/nepali.htm (Accessed on 20th May. 2018)</reference>
    </references>
  </meta>
  <data>
    <char comment="DEVANAGARI SIGN CANDRABINDU: Bodo, Hindi, Kashmiri, Konkani, Maithili, Marathi, Nepali, Santali, Sanskrit" cp="0901" ref="0 101 102 103 105 108 110 111 112 113" tag="candrabindu" when="follows-only-V-or-C-or-N-or-M">
      <var cp="0945 0902" type="blocked" when="follows-only-C-or-CN"/>
    </char>
    <char comment="DEVANAGARI SIGN ANUSVARA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0902" ref="0 101 102 103 113" tag="anusvara" when="follows-only-V-or-C-or-N-or-M">
      <var comment="Devanagari variant" cp="093A" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A02" type="blocked"/>
    </char>
    <char comment="DEVANAGARI SIGN VISARGA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0903" ref="0 101 102 103 113" tag="visarga" when="follows-only-V-or-C-or-N-or-M"/>
    <char comment="DEVANAGARI LETTER A: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0905" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel"/>
    <char cp="0905 0901" not-when="preceded-by-H">
      <var cp="0972 0902" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER A + DEVANAGARI SIGN ANUSVARA" cp="0905 0902" not-when="preceded-by-H" ref="0 105 108">
      <var comment="Devanagari variant" cp="0973" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER AA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0906" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel V1">
      <var comment="Devanagari variant" cp="0906 093C" not-when="followed-by-N" type="blocked"/>
    </char>
    <char cp="0906 0901" not-when="preceded-by-H">
      <var cp="0906 093C 0901" type="blocked"/>
      <var cp="0911 0902" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER AA + DEVANAGARI SIGN ANUSVARA" cp="0906 0902" not-when="preceded-by-H" ref="0 105 108">
      <var cp="0906 093C 0902" type="blocked" when="followed-by-V-C-or-end"/>
      <var comment="Devanagari variant" cp="0974" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char comment="DEVANAGARI LETTER AA + DEVANAGARI SIGN NUKTA" cp="0906 093C" not-when="preceded-by-H">
      <var comment="Devanagari variant" cp="0906" not-when="followed-by-N" type="blocked"/>
    </char>
    <char cp="0906 093C 0901" not-when="preceded-by-H">
      <var cp="0906 0901" type="blocked"/>
      <var cp="0911 0902" type="blocked"/>
    </char>
    <char cp="0906 093C 0902" not-when="preceded-by-H">
      <var cp="0906 0902" type="blocked" when="followed-by-V-C-or-end"/>
      <var cp="0974" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char comment="DEVANAGARI LETTER I: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0907" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel">
      <var comment="cross-script homoglyph" cp="0A19" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER II: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0908" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel"/>
    <char comment="DEVANAGARI LETTER U: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0909" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel">
      <var comment="cross-script homoglyph" cp="0A24" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER UU: Most of the languages given in section ‎3.2 of the [Proposal]." cp="090A" not-when="preceded-by-H" ref="0 101 102 103 104 113" tag="vowel"/>
    <char comment="DEVANAGARI LETTER VOCALIC R: Hindi, Marathi, Sanskrit" cp="090B" not-when="preceded-by-H" ref="0 101 102 103" tag="vowel"/>
    <char comment="DEVANAGARI LETTER CANDRA E: Hindi" cp="090D" not-when="preceded-by-H" ref="0 101" tag="vowel"/>
    <char cp="090D 0902" not-when="preceded-by-H">
      <var cp="090F 0901" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER SHORT E: Kashmiri" cp="090E" not-when="preceded-by-H" ref="0 105 108" tag="vowel">
      <var comment="Devanagari variant" cp="0910" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER E: Most of the languages given in section ‎3.2 of the [Proposal]." cp="090F" not-when="preceded-by-H" ref="0 101 102 103 104 105 108 113" tag="vowel"/>
    <char cp="090F 0901" not-when="preceded-by-H">
      <var cp="090D 0902" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER AI: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0910" not-when="preceded-by-H" ref="0 101 102 103 104 105 108 113" tag="vowel">
      <var cp="090E" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER CANDRA O: Hindi, Kashmiri, Konkani, Marathi, Kashmiri" cp="0911" not-when="preceded-by-H" ref="0 100 101 102 108 112" tag="vowel"/>
    <char cp="0911 0902" not-when="preceded-by-H">
      <var cp="0906 0901" type="blocked"/>
      <var cp="0906 093C 0901" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER SHORT O: Kashmiri" cp="0912" not-when="preceded-by-H" ref="0 105 108" tag="vowel"/>
    <char comment="DEVANAGARI LETTER O: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0913" not-when="preceded-by-H" ref="0 101 102 103 104 105 108 113" tag="vowel V1">
      <var comment="Devanagari variant" cp="0913 093C" not-when="followed-by-N" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER O + DEVANAGARI SIGN NUKTA" cp="0913 093C" not-when="preceded-by-H">
      <var comment="Devanagari variant" cp="0913" not-when="followed-by-N" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER AU: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0914" not-when="preceded-by-H" ref="0 101 102 103 104 105 108 113" tag="vowel">
      <var comment="Devanagari variant" cp="0975" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER KA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0915" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER KHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0916" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER GA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0917" ref="0 101 102 103 104 105 108 113" tag="consonant C1">
      <var comment="cross-script homoglyph" cp="0A17" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER GHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0918" ref="0 101 102 103 104 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A2C" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER NGA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0919" ref="0 101 102 103 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER CA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091A" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER CHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091B" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER JA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091C" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER JHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091D" ref="0 101 102 103 104 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER NYA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091E" ref="0 101 102 103 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER TTA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="091F" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A1F" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER TTHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0920" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A20" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER DDA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0921" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER DDHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0922" ref="0 101 102 103 104 113" tag="consonant C1">
      <var comment="cross-script homoglyph" cp="0A2B" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER NNA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0923" ref="0 101 102 103 104 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER TA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0924" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER TA + DEVANAGARI SIGN VIRAMA + DEVANAGARI LETTER TA: Sequence has cross-script variant" cp="0924 094D 0924">
      <var comment="cross-script homoglyph" cp="0A1C" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER THA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0925" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER DA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0926" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER DHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0927" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER NA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0928" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER PA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092A" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A27" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER PA + DEVANAGARI SIGN VIRAMA + DEVANAGARI LETTER TTA + DEVANAGARI VOWEL SIGN I, Sequence has cross-script variant" cp="092A 094D 091F 093F">
      <var comment="cross-script homoglyph" cp="0A07" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER PA + DEVANAGARI SIGN VIRAMA + DEVANAGARI LETTER TTA + DEVANAGARI VOWEL SIGN II, Sequence has cross-script variant" cp="092A 094D 091F 0940">
      <var comment="cross-script homoglyph" cp="0A08" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER PA + DEVANAGARI SIGN VIRAMA + DEVANAGARI LETTER TTA + DEVANAGARI VOWEL SIGN E, Sequence has cross-script variant" cp="092A 094D 091F 0947">
      <var cp="092A 094D 091F 0946" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A0F" type="blocked"/>
    </char>
    <char cp="092A 094D 091F 0946">
      <var cp="092A 094D 091F 0947" type="blocked"/>
      <var cp="0A0F" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER PHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092B" ref="0 101 102 103 104 105 108 113" tag="consonant C1"/>
    <char comment="DEVANAGARI LETTER BA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092C" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER BHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092D" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A2E" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER MA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092E" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="09AE" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A38" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER YA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="092F" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER RA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0930" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="Section 7, Case of Eyelash Reph: To enable Eyelash Reph Formation with Rra+Halanta+Ya and prevent stand-alone RRA from occuring; Konkani, Marathi, Nepali" cp="0931 094D 092F" ref="0 102 106 107 112 113"/>
    <char comment="Section 7, Case of Eyelash Reph: To enable Eyelash Reph Formation with Rra+Halanta+Ha and prevent stand-alone RRA from occuring; Konkani, Marathi, Nepali" cp="0931 094D 0939" ref="0 102 106 107 112 113"/>
    <char comment="DEVANAGARI LETTER LA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0932" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER LLA: Bodo, Konkani, Marathi, Nepali, Sanskrit" cp="0933" ref="0 102 103 110 112 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER VA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0935" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A15" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER SHA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0936" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER SSA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0937" ref="0 101 102 103 104 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER SA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0938" ref="0 101 102 103 104 105 108 113" tag="consonant"/>
    <char comment="DEVANAGARI LETTER HA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0939" ref="0 101 102 103 104 105 108 113" tag="consonant">
      <var comment="cross-script homoglyph" cp="0A35" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN OE: Kashmiri" cp="093A" ref="11 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="0902" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A02" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN OOE: Kashmiri" cp="093B" ref="11 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="093E 0902" type="blocked" when="followed-by-V-C-or-end"/>
      <var cp="093E 093C 0902" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char comment="DEVANAGARI SIGN NUKTA: Bodo, Hindi, Kashmiri, Maithili, Santali, Sindhi" cp="093C" ref="0 101 105 108 109 110 111" tag="nukta" when="follows-either-C1-V1-or-M1">
      <var comment="cross-script homoglyph" cp="0A3C" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN AA: Most of the languages given in section ‎3.2 of the [Proposal]." cp="093E" ref="0 101 102 103 113" tag="matra M1" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="093E 093C" not-when="followed-by-N" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN AA + DEVANAGARI SIGN NUKTA" cp="093E 093C" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="093E" not-when="followed-by-N" type="blocked"/>
    </char>
    <char cp="093E 0901" when="follows-only-C-or-CN">
      <var cp="093E 093C 0901" type="blocked"/>
      <var cp="0949 0902" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN AA + DEVANAGARI SIGN ANUSVARA" cp="093E 0902" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="093B" type="blocked" when="followed-by-V-C-or-end"/>
      <var cp="093E 093C 0902" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char cp="093E 093C 0901" when="follows-only-C-or-CN">
      <var cp="093E 0901" type="blocked"/>
      <var cp="0949 0902" type="blocked"/>
    </char>
    <char cp="093E 093C 0902" when="follows-only-C-or-CN">
      <var cp="093B" type="blocked" when="followed-by-V-C-or-end"/>
      <var cp="093E 0902" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN I: Most of the languages given in section ‎3.2 of the [Proposal]." cp="093F" ref="0 101 102 103 113" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="09BF" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A3F" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN II: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0940" ref="0 101 102 103 113" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="0A40" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN U: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0941" ref="0 101 102 103 113" tag="matra" when="follows-only-C-or-CN"/>
    <char comment="DEVANAGARI VOWEL SIGN UU: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0942" ref="0 101 102 103 113" tag="matra" when="follows-only-C-or-CN"/>
    <char comment="DEVANAGARI VOWEL SIGN VOCALIC R: Hindi, Marathi, Sanskrit" cp="0943" ref="0 101 102 103" tag="matra" when="follows-only-C-or-CN"/>
    <char comment="DEVANAGARI VOWEL SIGN CANDRA E = candra; Hindi, Kashmiri, Konkani, Marathi, Sanskrit" cp="0945" ref="0 100 101 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="0A71" type="blocked"/>
    </char>
    <char cp="0945 0902" when="follows-only-C-or-CN">
      <var cp="0901" type="blocked" when="follows-only-C-or-CN"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN SHORT E: Kashmiri" cp="0946" ref="0 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="0947" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A47" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A4B" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN E: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0947" ref="0 101 102 103 105 108 113" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="0946" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A47" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A4B" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN AI: Most of the languages given in section ‎3.2 of the [Proposal]." cp="0948" ref="0 101 102 103 113" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="0A48" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN CANDRA O: Hindi, Kashmiri, Konkani, Marathi" cp="0949" ref="0 100 108" tag="matra" when="follows-only-C-or-CN"/>
    <char cp="0949 0902" when="follows-only-C-or-CN">
      <var cp="093E 0901" type="blocked"/>
      <var cp="093E 093C 0901" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN SHORT O: Kashmiri" cp="094A" ref="0 105 108" tag="matra" when="follows-only-C-or-CN"/>
    <char comment="DEVANAGARI VOWEL SIGN O: Most of the languages given in section ‎3.2 of the [Proposal]." cp="094B" ref="0 101 102 103 105 108 113" tag="matra M1" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="094B 093C" not-when="followed-by-N" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN O + DEVANAGARI SIGN NUKTA" cp="094B 093C" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="094B" not-when="followed-by-N" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN AU: Most of the languages given in section ‎3.2 of the [Proposal]." cp="094C" ref="0 101 102 103 105 108 113" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="094F" type="blocked"/>
    </char>
    <char comment="DEVANAGARI SIGN VIRAMA = halant; suppresses inherent vowel: Most of the languages given in section ‎3.2 of the [Proposal]." cp="094D" ref="0 101 102 103 105 108 113" tag="halant" when="follows-only-C-or-CN"/>
    <char comment="DEVANAGARI VOWEL SIGN AW: Kashmiri" cp="094F" ref="0 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="Devanagari variant" cp="094C" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN UE: Kashmiri" cp="0956" ref="11 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="0A41" type="blocked"/>
    </char>
    <char comment="DEVANAGARI VOWEL SIGN UUE: Kashmiri" cp="0957" ref="11 105 108" tag="matra" when="follows-only-C-or-CN">
      <var comment="cross-script homoglyph" cp="0A42" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER CANDRA A: Kashmiri, Konkani, Marathi" cp="0972" not-when="preceded-by-H" ref="9 100 102 108 112" tag="vowel"/>
    <char cp="0972 0902" not-when="preceded-by-H">
      <var cp="0905 0901" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER OE: Kashmiri" cp="0973" not-when="preceded-by-H" ref="11 105 108" tag="vowel">
      <var comment="Devanagari variant" cp="0905 0902" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER OOE: Kashmiri" cp="0974" not-when="preceded-by-H" ref="11 105 108" tag="vowel">
      <var comment="Devanagari variant" cp="0906 0902" type="blocked" when="followed-by-V-C-or-end"/>
      <var cp="0906 093C 0902" type="blocked" when="followed-by-V-C-or-end"/>
    </char>
    <char comment="DEVANAGARI LETTER AW: Kashmiri" cp="0975" not-when="preceded-by-H" ref="11 105 108" tag="vowel">
      <var cp="0914" type="blocked"/>
    </char>
    <char comment="DEVANAGARI LETTER UE: Kashmiri" cp="0976" not-when="preceded-by-H" ref="11 105 108" tag="vowel"/>
    <char comment="DEVANAGARI LETTER UUE: Kashmiri" cp="0977" not-when="preceded-by-H" ref="11 105 108" tag="vowel"/>
    <char comment="DEVANAGARI LETTER GGA: Sindhi" cp="097B" ref="8 104" tag="consonant"/>
    <char comment="DEVANAGARI LETTER JJA: Sindhi" cp="097C" ref="8 104" tag="consonant"/>
    <char comment="DEVANAGARI LETTER DDDA: Sindhi" cp="097E" ref="8 104" tag="consonant"/>
    <char comment="DEVANAGARI LETTER BBA: Sindi" cp="097F" ref="8 104" tag="consonant"/>
    <char comment="Cross-script homoglyph" cp="09AE">
      <var comment="cross-script homoglyph" cp="092E" type="blocked"/>
      <var comment="out-of-repertoire" cp="09AE" type="out-of-repertoire-var"/>
      <var comment="cross-script homoglyph" cp="0A38" type="blocked"/>
    </char>
    <char comment="Cross-script homoglyph" cp="09BF">
      <var comment="cross-script homoglyph" cp="093F" type="blocked"/>
      <var comment="out-of-repertoire" cp="09BF" type="out-of-repertoire-var"/>
      <var comment="cross-script homoglyph" cp="0A3F" type="blocked"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A02">
      <var comment="cross-script homoglyph" cp="0902" type="blocked"/>
      <var comment="cross-script homoglyph" cp="093A" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A02" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A07">
      <var comment="cross-script homoglyph" cp="092A 094D 091F 093F" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A07" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A08">
      <var comment="cross-script homoglyph" cp="092A 094D 091F 0940" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A08" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A0F">
      <var cp="092A 094D 091F 0946" type="blocked"/>
      <var comment="cross-script homoglyph" cp="092A 094D 091F 0947" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A0F" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A15">
      <var comment="cross-script homoglyph" cp="0935" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A15" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A17">
      <var comment="cross-script homoglyph" cp="0917" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A17" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A19">
      <var comment="cross-script homoglyph" cp="0907" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A19" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A1C">
      <var comment="cross-script homoglyph" cp="0924 094D 0924" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A1C" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A1F">
      <var comment="cross-script homoglyph" cp="091F" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A1F" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A20">
      <var comment="cross-script homoglyph" cp="0920" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A20" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A24">
      <var comment="cross-script homoglyph" cp="0909" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A24" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A27">
      <var comment="cross-script homoglyph" cp="092A" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A27" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A2B">
      <var comment="cross-script homoglyph" cp="0922" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A2B" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A2C">
      <var comment="cross-script homoglyph" cp="0918" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A2C" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A2E">
      <var comment="cross-script homoglyph" cp="092D" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A2E" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A35">
      <var comment="cross-script homoglyph" cp="0939" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A35" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A38">
      <var comment="cross-script homoglyph" cp="092E" type="blocked"/>
      <var comment="cross-script homoglyph" cp="09AE" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A38" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A3C">
      <var comment="cross-script homoglyph" cp="093C" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A3C" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A3F">
      <var comment="cross-script homoglyph" cp="093F" type="blocked"/>
      <var comment="cross-script homoglyph" cp="09BF" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A3F" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A40">
      <var comment="cross-script homoglyph" cp="0940" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A40" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A41">
      <var comment="cross-script homoglyph" cp="0956" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A41" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A42">
      <var comment="cross-script homoglyph" cp="0957" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A42" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A47">
      <var comment="cross-script homoglyph" cp="0946" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0947" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A47" type="out-of-repertoire-var"/>
      <var comment="cross-script homoglyph" cp="0A4B" type="blocked"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A48">
      <var comment="cross-script homoglyph" cp="0948" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A48" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A4B">
      <var comment="cross-script homoglyph" cp="0946" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0947" type="blocked"/>
      <var comment="cross-script homoglyph" cp="0A47" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A4B" type="out-of-repertoire-var"/>
    </char>
    <char comment="Cross-script homoglyph" cp="0A71">
      <var comment="cross-script homoglyph" cp="0945" type="blocked"/>
      <var comment="out-of-repertoire" cp="0A71" type="out-of-repertoire-var"/>
    </char>
  </data>
  <rules>
    <class name="C" from-tag="consonant" comment="Any consonant"/>
    <class name="C1" from-tag="C1" comment="Consonants followed by Nukta"/>
    <class name="V" from-tag="vowel" comment="Any independent vowel"/>
    <class name="V1" from-tag="V1" comment="Vowels followed by Nukta (Required for Santali language)"/>
    <class name="M" from-tag="matra" comment="Any vowel sign (matra)"/>
    <class name="M1" from-tag="M1" comment="Matras followed by Nukta (Required for Santali language)"/>
    <class name="H" from-tag="halant" comment="The Halant (VIRAMA)"/>
    <class name="N" from-tag="nukta" comment="The Nukta"/>
    <rule name="leading-combining-mark" comment="Default rule from MSR-4 ⍟">
      <start/>
      <union>
        <class property="gc:Mn"/>
        <class property="gc:Mc"/>
      </union>
    </rule>
    <rule name="follows-either-C1-V1-or-M1" comment="Section 7, WLE 1: Rule for Nukta which restricts its preceding characters">
      <look-behind>
        <choice>
          <class by-ref="C1"/>
          <class by-ref="M1"/>
          <class by-ref="V1"/>
        </choice>
      </look-behind>
      <anchor/>
    </rule>
    <rule name="follows-only-C-or-CN" comment="Section 7, WLE 2, 3: Halant or Matra characters can only be preceded by consonant or a nukta which is preceded by a consonant">
      <look-behind>
        <choice>
          <class by-ref="C"/>
          <rule comment="nukta-preceded-by-consonant">
            <class by-ref="C"/>
            <class by-ref="N"/>
          </rule>
        </choice>
      </look-behind>
      <anchor/>
    </rule>
    <rule name="follows-only-V-or-C-or-N-or-M" comment="Section 7, WLE 4, 5, 6: Anusvara, Candrabindu or Visarga characters can only be preceded by either Vowel or Consonant or Nukta or Matra">
      <look-behind>
        <choice>
          <class by-ref="V"/>
          <class by-ref="C"/>
          <class by-ref="N"/>
          <class by-ref="M"/>
        </choice>
      </look-behind>
      <anchor/>
    </rule>
    <rule name="preceded-by-H" comment="Section 7, WLE 7: A vowel is not allowed to follow an H">
      <look-behind>
        <class by-ref="H"/>
      </look-behind>
      <anchor/>
    </rule>
    <rule name="followed-by-N" comment="Section 6.4.1, Variant Context Rule for Nukta: Variant to be generated only when not followed by Nukta.">
      <anchor/>
	  <look-ahead>
        <class by-ref="N"/>
      </look-ahead>
    </rule>
	<rule name="followed-by-V-C-or-end" comment="Section 6.1.2, Variant to be generated when followed by V or C (including RRA) or end of label">
	  <anchor/>
	  <look-ahead>
		<choice>
		  <class by-ref="V"/>
	  	  <class by-ref="C"/>
		  <char cp="0931"/>
		  <end/>
		</choice>
	  </look-ahead>
	</rule>
    <action disp="invalid" match="leading-combining-mark"/>
    <action disp="invalid" any-variant="out-of-repertoire-var" comment="any variant label with a code point out of repertoire is invalid ⍟"/>
    <action disp="blocked" any-variant="blocked" comment="default action MSR-4 ⍟"/>
    <action disp="allocatable" any-variant="allocatable" comment="default action MSR-4 ⍟"/>
    <action disp="valid" comment="catch all; default action from MSR-4 ⍟"/>
  </rules>
</lgr>
