This file contains a set of Label Generation Rules for the Japanese script (JLGR) using a limited repertoire as would be appropriate for the Root Zone.
For more details, see "Proposal for a Japanese Script Root Zone LGR" [Proposal-Japanese]. The format of this file follows [RFC 7940].
The repertoire includes 6356 Han characters, 85 Hiragana characters, 89 Katakana characters, and 2 Common (quasi-Kanji) characters for a total of 6532 repertoire elements. One of the common characters is shared between Han, Hiragana, and Katakana while the other one is shared between Hiragana and Katakana. For more details, see Section 5, "Repertoire" in [Proposal-Japanese].
The repertoire is contained in [MSR-5], which is a subset of [Unicode 11.0].
Each code point is tagged with the script or scripts that the code point is used with, and one or more references documenting sufficient justification for inclusion in the repertoire; see "References" below.
This LGR defines the following named character classes:
According to Sections 6 and 7 of [Proposal-Japanese] the LGR defines variants acquired through integration with the Chinese RZ-LGR [CLGR] and Korean RZ-LGR [KLGR] as well as visually identical variants within the Japanese script.
Of the imported variants, only those that result in an in-script variant for the Japanese script are explictly listed in this LGR. However, all cross-repertoire mappings will be applied as part of integration into the Root Zone LGR.
All variant labels are "blocked".
The following source references are used for variant entries in this LGR:
The LGR includes the set of required default WLE rules and actions applicable to the Root Zone and defined in [MSR-5]. They are marked with ⍟.
The LGR defines a number of actions that compute a label disposition based on WLE rules or variant mapping types. All of these are common to all LGRs, and none are specific to this LGR.
The LGR includes the set of required default actions applicable to the Root Zone and defined in [MSR-5]. They are marked with ⍟.
The Root Zone LGR for the Japanese script was developed by the Japanese Script Generation Panel (JGP). For further details on methodology and contributors, see Sections 4 and 9 in [Proposal-Japanese], as well as [RZ-LGR-x-Overview].
References [0] and up refer to the Unicode Standard versions in which corresponding code points were initially encoded. For example, a reference value of [0] means a character was first encoded in Unicode Version 1.1. References [100] and up indicate that the code point is included in one of the following source references: JIS X 0208:1997, Survey on the user perception of the homographic character set specified by JGP, CLGR repertoire, and KLGR repertoire. For example, a reference value of [300] means a code point listed here is included in the CLGR repertoire, which is superset of KLGR Hanja repertoire. References cited for variants indicate whether variant mappings match a source reference in mapping, variant type or both. Entries in the table may have multiple source reference values.
In addition, the following general references are cited in this document:
For additional detail on references cited in the remainder of the document refer to the Table of References below.
Where links to certain references are not available, the data may be archived as part of [Proposal-Japanese].
]]>