Skip to main content

New gTLD Program: String Similarity Contention Sets

ICANN is publishing today the contention sets identified by the string similarity review for applications submitted as part of the New gTLD Program. Review the full list of contention sets here: PDF [162 KB], CSV [65 KB] or from the Applicants' Corner page on the new gTLD microsite.

Overall statistics about Contention Sets

  • 2 Non-Exact Match Contention Sets
    • .hotels & .hoteis
    • .unicorn & .unicom
  • 230 Exact Match Contention Sets
  • 754 Total Applications in contention

The Current Application Status page on the new gTLD microsite will be updated to reflect these contention sets.

The role of the String Similarity Panel is to assess whether a proposed gTLD string creates a probability of user confusion due to similarity with any reserved name, any existing TLD, any requested IDN ccTLD, or any new gTLD string applied for in the current application round.

Per the Applicant Guidebook, a contention set is a group of two or more applications containing identical or visually similar applied-for gTLD strings. For more information on string contention procedures, please refer to Module 4 [PDF, 428 KB] of the Applicant Guidebook.

As a reminder, the objection filing period is open until 13 March 2013. For more information refer to the Objection & Dispute Resolution page.


More Announcements
Domain Name System
Internationalized Domain Name ,IDN,"IDNs are domain names that include characters used in the local representation of languages that are not written with the twenty-six letters of the basic Latin alphabet ""a-z"". An IDN can contain Latin letters with diacritical marks, as required by many European languages, or may consist of characters from non-Latin scripts such as Arabic or Chinese. Many languages also use other types of digits than the European ""0-9"". The basic Latin alphabet together with the European-Arabic digits are, for the purpose of domain names, termed ""ASCII characters"" (ASCII = American Standard Code for Information Interchange). These are also included in the broader range of ""Unicode characters"" that provides the basis for IDNs. The ""hostname rule"" requires that all domain names of the type under consideration here are stored in the DNS using only the ASCII characters listed above, with the one further addition of the hyphen ""-"". The Unicode form of an IDN therefore requires special encoding before it is entered into the DNS. The following terminology is used when distinguishing between these forms: A domain name consists of a series of ""labels"" (separated by ""dots""). The ASCII form of an IDN label is termed an ""A-label"". All operations defined in the DNS protocol use A-labels exclusively. The Unicode form, which a user expects to be displayed, is termed a ""U-label"". The difference may be illustrated with the Hindi word for ""test"" — परीका — appearing here as a U-label would (in the Devanagari script). A special form of ""ASCII compatible encoding"" (abbreviated ACE) is applied to this to produce the corresponding A-label: xn--11b5bs1di. A domain name that only includes ASCII letters, digits, and hyphens is termed an ""LDH label"". Although the definitions of A-labels and LDH-labels overlap, a name consisting exclusively of LDH labels, such as""icann.org"" is not an IDN."