Skip to main content
Resources

Community Experiences with the InterNIC Whois Data Problem Reports System

Contents

Executive Summary

I. Applicable Provisions of the ICANN RAA

II. Implementation of the WDPRS

III. Statistics from Operation of the WDPRS

IV. Impact of WDPRS on Improved Whois Data Accuracy


Executive Summary

This Report summarizes ICANN's experience with the operation of the Whois Data Problem Report system (WDPRS) during a 12-month reporting period that ended 28 February 2005. ICANN developed this system to receive and track complaints about inaccurate or incomplete Whois data entries. Individuals who encounter such entries may notify ICANN by completing an online form, which is then forwarded to the registrar of record for appropriate action. The WDPRS is one of the tools that ICANN uses to improve the accuracy of Whois data. Last year ICANN streamlined the system to provide for both greater automation and expanded functionality. The new system includes all gTLDs; the replaced system addressed .com, .net and .org only.

Under the WDPRS, ICANN is able to track how many reports are filed and how many of these are "confirmed" by the reporter so they may be sent to the registrar of record. After a prescribed period, ICANN asks the person filing the report to check the Whois data again, and indicate whether (i) the data was fixed; (ii) the domain name was deleted; (iii) the data was unchanged; or (iv) there is some other disposition.

On average, there were 2,865 reports confirmed each full month during the reporting period. (There were 31,533 confirmed reports total. While the report covers 12 months, only partial data is available for the first month of the reporting period).

A total of 16,941 unique domain names were the subject of the 31,533 Whois Data Problem Reports. Multiple reports for the same domain name accounted for 14,592 (one domain name received 61 reports).

Reports were submitted by 3,122 different individuals; the top 20 contributing individuals accounted for over 58% of those reports.

The analysis performed on the data indicates that more than 63% of the names reported were corrected, suspended, or are no longer registered.

The differences in methodology and reporting mechanisms from the previous reporting period make it difficult to draw significant, specific conclusions about the impact of the WDRPS on Whois data accuracy. While the numbers of reports to the WDRPS is higher than in the previous period, it is believed that this is a function of greater awareness and a simplification of the reporting process. This reflects enhanced attention paid to Whois data accuracy. To further improve our understanding of the potential impact, ICANN has added a component to this analysis by conducting a study of a random sampling of Whois data from across gTLDs.

Introduction

The following is a report summarizing ICANN's experience with the operation of the Whois Data Problem Report system ("Report") at InterNIC.net (http://wdprs.internic.net/) since publication of the previous report on 31 March 2004 (http://www.icann.org/Whois/WDPRS-report-final.pdf). These reports are published pursuant to Section II.C.10.a of Amendment 6 to the ICANN/DOC Memorandum of Understanding, which provides that:

ICANN shall publish a report no later than March 31, 2004, and annually thereafter, providing statistical and narrative information on community experiences with the InterNIC WHOIS Data Problem Reports system. The report shall include statistics on the number of WHOIS data inaccuracies reported to date, the number of unique domain names with reported inaccuracies, and registrar handling of the submitted reports. The narrative information shall include an evaluation of the impact of the WHOIS Data Problem Reports system on improved accuracy of WHOIS data (http://www.icann.org/general/amend6-jpamou-17sep03.htm).

Another report required by the same section of the MOU, entitled Implementation of the Whois Data Reminder Policy, was published on 30 November 2004 (http://www.icann.org/Whois/WDRP-Implementation-30Nov04.pdf).

Whois data for generic Top Level Domains (gTLDs) includes information about the registrant, administrative contact, technical contact, and name servers associated with each domain name. This information is used for a variety of important purposes, including resolution of technical network issues, identification and verification of online merchants, investigations by consumer protection and law enforcement authorities, enforcement of intellectual property rights, identification of sources of spam e-mail, and determinations of whether a domain name is available for registration. Whois services have been available on the Internet since the early 1980s and continue to be broadly used. According to an online survey of over 3000 participants (representing businesses, governments, ISPs, registrars, individuals, and non-commercial organizations) conducted by the ICANN Domain Name Supporting Organization in 2001, Internet users broadly consider accurate Whois data to be important and support measures to improve its accuracy http://www.dnso.org/dnso/notes/WhoisTF/20020625.TFWhois-report.htm.

I. Applicable Provisions of the ICANN Registrar Accreditation Agreement

ICANN's contracts with accredited registrars require them to obtain contact information from registrants, to provide it publicly by a Whois service, and to investigate and correct any reported inaccuracies in the contact information for names they sponsor. Several provisions of the ICANN Registrar Accreditation Agreement (RAA) (http://www.icann.org/registrars/ra-agreement-17may01.htm) relate to Whois data, including:

3.3.1 At its expense, Registrar shall provide an interactive web page and a port 43 Whois service providing free public query-based access to up-to-date (i.e., updated at least daily) data concerning all active Registered Names sponsored by Registrar for each TLD in which it is accredited. The data accessible shall consist of elements that are designated from time to time according to an ICANN adopted specification or policy. Until ICANN otherwise specifies by means of an ICANN adopted specification or policy, this data shall consist of the following elements as contained in Registrar's database:

3.3.1.1 The name of the Registered Name;

3.3.1.2 The names of the primary name server and secondary name server(s) for the Registered Name;

3.3.1.3 The identity of Registrar (which may be provided through Registrar's website);

3.3.1.4 The original creation date of the registration;

3.3.1.5 The expiration date of the registration;

3.3.1.6 The name and postal address of the Registered Name Holder;

3.3.1.7 The name, postal address, e-mail address, voice telephone number, and (where available) fax number of the technical contact for the Registered Name; and

3.3.1.8 The name, postal address, e-mail address, voice telephone number, and (where available) fax number of the administrative contact for the Registered Name.

 

3.7.7 Registrar shall require all Registered Name Holders to enter into an electronic or paper registration agreement with Registrar including at least the following provisions:

3.7.7.1 The Registered Name Holder shall provide to Registrar accurate and reliable contact details and promptly correct and update them during the term of the Registered Name registration, including: the full name, postal address, e-mail address, voice telephone number, and fax number if available of the Registered Name Holder; name of authorized person for contact purposes in the case of an Registered Name Holder that is an organization, association, or corporation; and the data elements listed in Subsections 3.3.1.2, 3.3.1.7 and 3.3.1.8.

3.7.7.2 A Registered Name Holder's willful provision of inaccurate or unreliable information, its willful failure promptly to update information provided to Registrar, or its failure to respond for over fifteen calendar days to inquiries by Registrar concerning the accuracy of contact details associated with the Registered Name Holder's registration shall constitute a material breach of the Registered Name Holder-registrar contract and be a basis for cancellation of the Registered Name registration.

3.7.7.3 Any Registered Name Holder that intends to license use of a domain name to a third party is nonetheless the Registered Name Holder of record and is responsible for providing its own full contact information and for providing and updating accurate technical and administrative contact information adequate to facilitate timely resolution of any problems that arise in connection with the Registered Name. A Registered Name Holder licensing use of a Registered Name according to this provision shall accept liability for harm caused by wrongful use of the Registered Name, unless it promptly discloses the identity of the licensee to a party providing the Registered Name Holder reasonable evidence of actionable harm.

3.7.8 Registrar shall abide by any specifications or policies established according to Section 4 requiring reasonable and commercially practicable (a) verification, at the time of registration, of contact information associated with a Registered Name sponsored by Registrar or (b) periodic re-verification of such information (emphasis added). Registrar shall, upon notification by any person of an inaccuracy in the contact information associated with a Registered Name sponsored by Registrar, take reasonable steps to investigate that claimed inaccuracy. In the event Registrar learns of inaccurate contact information associated with a Registered Name it sponsors, it shall take reasonable steps to correct that inaccuracy.

Based on the above provisions of the RAA, a registrar must:

  • Require each registrant to submit (and keep updated) accurate contact details (3.7.7.1);
  • Provide both a web-based and Port 43 Whois service providing access to complete contact information for all TLDs covered under the RAA (3.3.1);
  • Require registrants to agree that willfully submitting inaccurate contact details (or failing to respond within 15 days to an inquiry regarding accuracy) shall be a basis for cancellation of the registration (3.7.7.2); and
  • Take reasonable steps to investigate and correct the contact details in response to any reported inaccuracy (3.7.8).

ICANN has taken several steps to improve the accuracy of Whois data. These include:

  • On 10 May 2002, ICANN reminded registrars of the importance of understanding their obligations regarding the accuracy of Whois data in a "Registrar Advisory Concerning Whois Data Accuracy" (http://www.icann.org/announcements/advisory-10may02.htm).
  • On 3 September 2002, ICANN took additional steps to improve the accuracy of Whois data, see Announcement on Steps to Improve Whois Data Accuracy (http://www.icann.org/announcements/announcement-03sep02.htm). ICANN developed a system for receiving and tracking complaints about inaccurate or incomplete Whois data. As noted above, the first annual report on the "Whois Data Problem Reports System" was published on 31 March 2004 and covered information about that process.
  • On 27 March 2003, ICANN adopted the Whois Data Reminder Policy (WDRP) (http://www.icann.org/registrars/wdrp.htm) as a consensus policy. That Policy requires that a registrar present current Whois information to each registrant, at least annually, and remind the registrant that the provision of false data can be grounds for cancellation of a registration. Registrants must review their Whois data and make any necessary corrections.
  • On 3 April 2003, shortly after adopting the WDRP, ICANN issued a "Registrar Advisory Concerning the '15-day Period' in Whois Accuracy Requirements" (http://www.icann.org/announcements/advisory-03apr03.htm). That advisory provided guidance on a registrarâ's right to cancel a registration because of a registrant's (i) "willful provision of inaccurate or unreliable information"; (ii) "willful failure promptly to update information;" or (iii) a "failure to respond for over fifteen calendar days to inquiries by Registrar concerning the accuracy of contact details." The advisory also reiterated that a registrar has the right to cancel a registration in such cases, but is not required to do so.

II. Implementation of the Whois Data Problem Report System (WDPRS)

In order to assist registrars in complying with the contract obligations outlined above, ICANN implemented the Whois Data Problem Report System (WDPRS) on 3 September 2002. The goal of the WDPRS is to streamline the process for receiving and tracking complaints about inaccurate and incomplete Whois data, and thereby help improve the accuracy of Whois data.

Reports of inaccurate Whois data under the WDPRS are submitted through the InterNIC website, operated by ICANN as a public resource containing information relating to domain registration services. The centerpiece of the WDPRS is a centralized online form, available at http://wdprs.internic.net, for submitting reports about Whois data inaccuracies. The form requests Internet users (called "reporters" in this context) to specify the domain name they believe is inaccurate, and their name and email address. After entering "submit", the reporter is shown the Whois record for that domain name, and asked to indicate the inaccuracy. The system then sends the reporter an email request for confirmation of the report, which he has up to five days to acknowledge or the report is deleted.

Once the report is confirmed by the reporter, it is automatically forwarded to the registrar of record for handling. As of 31 March 2005, there are 468 ICANN-accredited registrars. A complete list of accredited registrars is available on the ICANN website at http://www.icann.org/registrars/accredited-list.html, and on the InterNIC website at http://www.internic.net/regist.html. (The InterNIC registrar listing can be sorted by location of registrar or by languages supported.)

Several enhancements have been made to the WDPRS since it was first implemented in 2002. A full description of the system's functionality at launch may be found in the 31 March 2004 report on "Community Experiences with the InterNIC Whois Data Problem Reports System" (http://www.icann.org/Whois/WDPRS-report-final.pdf).

III. Statistics from Operation of the WDPRS

The following sections provide a statistical summary of operation of the Whois Data Problem Report System. These statistics cover the operation of the system from the last report's cut-off date of 29 February 2004 until this year's cut-off date of 28 February 2005. It includes information concerning: (A) the number of Whois data inaccuracies reported; (B) the number of unique domain names with reported inaccuracies; and (C) registrar handling of the submitted reports. Although the type of data that is being collected has changed since the 2004 Report, and the reporting period considered in last year's report period was 50% longer (18 months instead of 12), this Report will make comparisons when appropriate.

A. Reported Data Inaccuracies

A total of 31,553 Whois Data Problem Reports were confirmed by their senders during the 12-month reporting period. (The 2004 Report indicated that 24,148 submissions had been confirmed during that 18-month reporting period.) The following table indicates the number of reports confirmed per month (the numbers for March 2004 reflect a partial month due to system changeover):

Date

Reports Confirmed

Mar-04

16

Apr-04

2003

May-04

2513

Jun-04

2978

Jul-04

3462

Aug-04

2998

Sep-04

2120

Oct-04

2289

Nov-04

3018

Dec-04

3594

Jan-05

3806

Feb-05

2736

Total

31533

On average, there were 2865 reports confirmed each month. During the previous reporting period, there were, on average, 1,342 reports confirmed every month.

On a per TLD basis, .com represented 62% of confirmed reports, with .info and .biz constituting 14% and 12% respectively. The statistics for those and other gTLDs are included in the following table:

TLD Reports # Reports % Reports per 10000 registrations*
.com 19705 62.49% 5.80
.info 4332 13.74% 12.92
.biz 3715 11.78% 32.37
.net 2504 7.94% 4.59
.org 1268 4.02% 3.73
.name 6 0.02% 0.42

*Based on registrations as of 31 December 2004

A total of 3122 different individuals submitted reports. On average, each reporter submitted approximately 10 reports, while some individuals submitted significantly more reports. Out of a total of 31533 confirmed reports, the numbers of reports per individual, for the top 20 reporters are as follows:

Top 20 Reporters # Reports Submitted
1 4035
2 2186
3 1197
4 1183
5 1058
6 891
7 881
8 770
9 715
10 592
11 572
12 555
13 532
14 513
15 505
16 482
17 482
18 415
19 414
20 339
Total 18317

As this table shows, fewer than 1% of all those who filed reports (20 people) were responsible for over 58% (18,317 out of 31,533) of all Whois inaccuracy reports submitted to ICANN during the reporting period. The 2004 Report indicated that the top 20 ( 0.3%) of reporters were responsible for over 40% (9,938 out of 24,148) of Whois inaccuracy reports.

There is evidence that individuals also are reporting single domains when they discover a problem -- there were 2,363 individuals who submitted exactly one report. Of these, 2,116 were later sent a follow-up message, and 922 of those did follow-up on their report, a return rate of about 44%. (The overall average return rate was about 51%).

From both anecdotal information received by ICANN and text accompanying the body of these reports we conclude that most, if not all of the high volume reporters are driven by a concern about abuses involving spam. In well over 80% of the reports filed, the reporter indicated "spam" as a factor in the body of the report.

B. Unique Domain Names

A total of 16,941 unique domain names were the subject of Whois Data Problem Reports. As reported above, there were a total of 31,533 reports confirmed. Accordingly, just over 14592 of the reports were "duplicates", meaning that the domain name they referred to was already the subject of a report made during the reporting period.

Top 20 Domain Names Reported Reports per Domain Name
1 61
2 50
3 47
4 42
5 41
6 38
7 37
8 36
9 34
10 34
11 33
12 33
13 32
14 31
15 30
16 30
17 29
18 28
19 27
20 27

In the previous reporting period, a total of 16045 unique domain names were the subject of Whois Data Problem Reports, and just over one-third of the reports were "duplicates." The following discussion of Whois accuracy focuses on number of individual domain names reported, not the total number of raw reports.

C. Registrar Handling

Under the streamlined WDPRS in effect during the reporting period, the reporter is asked to inform ICANN of the status of the data after the reported inaccuracy is forwarded to the registrar and a suitable period for action has elapsed (“follow-up reportâ€

Domain Name System
Internationalized Domain Name ,IDN,"IDNs are domain names that include characters used in the local representation of languages that are not written with the twenty-six letters of the basic Latin alphabet ""a-z"". An IDN can contain Latin letters with diacritical marks, as required by many European languages, or may consist of characters from non-Latin scripts such as Arabic or Chinese. Many languages also use other types of digits than the European ""0-9"". The basic Latin alphabet together with the European-Arabic digits are, for the purpose of domain names, termed ""ASCII characters"" (ASCII = American Standard Code for Information Interchange). These are also included in the broader range of ""Unicode characters"" that provides the basis for IDNs. The ""hostname rule"" requires that all domain names of the type under consideration here are stored in the DNS using only the ASCII characters listed above, with the one further addition of the hyphen ""-"". The Unicode form of an IDN therefore requires special encoding before it is entered into the DNS. The following terminology is used when distinguishing between these forms: A domain name consists of a series of ""labels"" (separated by ""dots""). The ASCII form of an IDN label is termed an ""A-label"". All operations defined in the DNS protocol use A-labels exclusively. The Unicode form, which a user expects to be displayed, is termed a ""U-label"". The difference may be illustrated with the Hindi word for ""test"" — परीका — appearing here as a U-label would (in the Devanagari script). A special form of ""ASCII compatible encoding"" (abbreviated ACE) is applied to this to produce the corresponding A-label: xn--11b5bs1di. A domain name that only includes ASCII letters, digits, and hyphens is termed an ""LDH label"". Although the definitions of A-labels and LDH-labels overlap, a name consisting exclusively of LDH labels, such as""icann.org"" is not an IDN."