Skip to main content

ICANN's Open Data Initiative Early Pilot Platforms Now Available

Odi pilot 750x583 27jun17 en

'Open data is publicly available data that can be universally and readily accessed, used, and redistributed free of charge. It is structured for usability and computability.'
-'The Global Impact of Open Data,' by Andrew Young and Stefaan Verhulst

Today, ICANN is pleased to announce the launch of the pilot program of the Open Data Initiative, which are being made available to all interested parties. This initiative consists of four different open data platforms, each of which holds several years of Registry Monthly Reports of Activity and Transactions data. We chose this dataset for several reasons: the data is already confirmed to be publicly redistributable, the current publication method of this data is awkward, encompassing many files spread over many web pages on ICANN's web site, and several community members have already asked for this data set to be made available in a more convenient format. Also, please note that the dataset in this pilot program is subject to the same three-month publication delay required by the registry agreements.

The four publication platforms each differ in their approach. One is an in-house effort built on the open source CKAN package. The other three use commercial products from Enigma, OpenDataSoft, and Socrata. The goal of providing different implementations using the same source dataset is to help explore different approaches to this open data.

The pilot program is described in detail here. Links to the individual software platforms are also available on that page.

We're actively seeking discussion of and feedback on the pilot program, to help set the future direction of this project. We've established a mailing list, odi-pilot@icann.org, for this purpose. Please visit https://mm.icann.org/mailman/listinfo/odi-pilot to subscribe. The vendors of the platforms have also been invited to subscribe to this list.

Please note that we are still in the early stages of the Open Data Initiative. The single dataset does not exercise the full power of many of the tools, but feel free to go beyond what is initially set up and explore the tools as much as possible. However, at this early stage, it's too early for a true side-by-side comparison of the different approaches. Our current goal is not to pick a platform as a result of the pilot program, but instead gather feedback to make that decision further in the future.

The pilot platforms will be available until at least 1 November 2017.

Comments

    Hayley Jones  04:41 UTC on 19 July 2017

    Want to save big when shopping online? Find the best bargains and money-saving offers, discounts, promo codes, deals on Dealstaxi.

Domain Name System
Internationalized Domain Name ,IDN,"IDNs are domain names that include characters used in the local representation of languages that are not written with the twenty-six letters of the basic Latin alphabet ""a-z"". An IDN can contain Latin letters with diacritical marks, as required by many European languages, or may consist of characters from non-Latin scripts such as Arabic or Chinese. Many languages also use other types of digits than the European ""0-9"". The basic Latin alphabet together with the European-Arabic digits are, for the purpose of domain names, termed ""ASCII characters"" (ASCII = American Standard Code for Information Interchange). These are also included in the broader range of ""Unicode characters"" that provides the basis for IDNs. The ""hostname rule"" requires that all domain names of the type under consideration here are stored in the DNS using only the ASCII characters listed above, with the one further addition of the hyphen ""-"". The Unicode form of an IDN therefore requires special encoding before it is entered into the DNS. The following terminology is used when distinguishing between these forms: A domain name consists of a series of ""labels"" (separated by ""dots""). The ASCII form of an IDN label is termed an ""A-label"". All operations defined in the DNS protocol use A-labels exclusively. The Unicode form, which a user expects to be displayed, is termed a ""U-label"". The difference may be illustrated with the Hindi word for ""test"" — परीका — appearing here as a U-label would (in the Devanagari script). A special form of ""ASCII compatible encoding"" (abbreviated ACE) is applied to this to produce the corresponding A-label: xn--11b5bs1di. A domain name that only includes ASCII letters, digits, and hyphens is termed an ""LDH label"". Although the definitions of A-labels and LDH-labels overlap, a name consisting exclusively of LDH labels, such as""icann.org"" is not an IDN."