Independent Submission                                    A. El-Sherbiny
Request for Comments: 5564                                      M. Farah
Category: Informational                                         UN-ESCWA
ISSN: 2070-1721                                              I. Oueichek
                                            Syrian Telecom Establishment
                                                             A. Al-Zoman
                                                          SaudiNIC, CITC
                                                           February 2010
        
Independent Submission                                    A. El-Sherbiny
Request for Comments: 5564                                      M. Farah
Category: Informational                                         UN-ESCWA
ISSN: 2070-1721                                              I. Oueichek
                                            Syrian Telecom Establishment
                                                             A. Al-Zoman
                                                          SaudiNIC, CITC
                                                           February 2010
        

Linguistic Guidelines for the Use of the Arabic Language in Internet Domains

互联网领域使用阿拉伯语的语言指南

Abstract

摘要

This document constitutes technical specifications for the use of Arabic in Internet domain names and provides linguistic guidelines for Arabic domain names. It addresses Arabic-specific linguistic issues pertaining to the use of Arabic language in domain names.

本文件构成互联网域名中使用阿拉伯语的技术规范,并提供阿拉伯语域名的语言指南。它解决了与在域名中使用阿拉伯语有关的特定于阿拉伯语的语言问题。

Status of This Memo

关于下段备忘

This document is not an Internet Standards Track specification; it is published for informational purposes.

本文件不是互联网标准跟踪规范;它是为了提供信息而发布的。

This is a contribution to the RFC Series, independently of any other RFC stream. The RFC Editor has chosen to publish this document at its discretion and makes no statement about its value for implementation or deployment. Documents approved for publication by the RFC Editor are not a candidate for any level of Internet Standard; see Section 2 of RFC 5741.

这是对RFC系列的贡献,独立于任何其他RFC流。RFC编辑器已选择自行发布此文档,并且未声明其对实现或部署的价值。RFC编辑批准发布的文件不适用于任何级别的互联网标准;见RFC 5741第2节。

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at http://www.rfc-editor.org/info/rfc5564.

有关本文件当前状态、任何勘误表以及如何提供反馈的信息,请访问http://www.rfc-editor.org/info/rfc5564.

Copyright Notice

版权公告

Copyright (c) 2010 IETF Trust and the persons identified as the document authors. All rights reserved.

版权所有(c)2010 IETF信托基金和确定为文件作者的人员。版权所有。

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document.

本文件受BCP 78和IETF信托有关IETF文件的法律规定的约束(http://trustee.ietf.org/license-info)自本文件出版之日起生效。请仔细阅读这些文件,因为它们描述了您对本文件的权利和限制。

This document may not be modified, and derivative works of it may not be created, except to format it for publication as an RFC or to translate it into languages other than English.

不得修改本文件,也不得创建其衍生作品,除非将其格式化为RFC出版或将其翻译为英语以外的其他语言。

Table of Contents

目录

   1. Introduction ....................................................2
   2. Arabic Language-Specific Issues .................................3
      2.1. Linguistic Issues ..........................................4
           2.1.1. Diacritics (Tashkeel) and Shadda ....................4
           2.1.2. Kasheeda or Tatweel (Horizontal Character
                  Size Extension) .....................................5
           2.1.3. Character Folding ...................................5
      2.2. Supported Character Set ....................................6
      2.3. Arabic Linguistic Issues Affected by Technical
           Constraints ................................................8
           2.3.1. Numerals ............................................8
           2.3.2. The Space Character .................................8
   3. Summary and Conclusion ..........................................8
   4. Security Considerations .........................................9
   5. Acknowledgments .................................................9
   6. References ......................................................9
      6.1. Normative References .......................................9
      6.2. Informative References .....................................9
        
   1. Introduction ....................................................2
   2. Arabic Language-Specific Issues .................................3
      2.1. Linguistic Issues ..........................................4
           2.1.1. Diacritics (Tashkeel) and Shadda ....................4
           2.1.2. Kasheeda or Tatweel (Horizontal Character
                  Size Extension) .....................................5
           2.1.3. Character Folding ...................................5
      2.2. Supported Character Set ....................................6
      2.3. Arabic Linguistic Issues Affected by Technical
           Constraints ................................................8
           2.3.1. Numerals ............................................8
           2.3.2. The Space Character .................................8
   3. Summary and Conclusion ..........................................8
   4. Security Considerations .........................................9
   5. Acknowledgments .................................................9
   6. References ......................................................9
      6.1. Normative References .......................................9
      6.2. Informative References .....................................9
        
1. Introduction
1. 介绍

The Internet Engineering Task Force (IETF) issued in March 2003 a set of RFCs for Internationalized Domain Names (IDN) ([1], [2], and [3]), which were planned to become the de facto standard for all languages. In 2007 and 2008, the following working drafts were released that propose revisions to the IDNA protocol:

互联网工程任务组(IETF)于2003年3月发布了一套国际域名(IDN)([1]、[2]和[3])的RFC,计划成为所有语言的事实标准。2007年和2008年,发布了以下工作草案,对IDNA议定书提出了修订:

o Internationalized Domain Names for Applications (IDNA): Background, Explanation, and Rationale [5]

o 应用程序国际化域名(IDNA):背景、解释和基本原理[5]

o Internationalized Domain Names in Applications (IDNA): Protocol [6]

o 应用程序中的国际化域名(IDNA):协议[6]

o An updated IDNA criterion for right-to-left scripts [7]

o 用于从右向左脚本的更新IDNA标准[7]

o The Unicode code points and IDNA [8]

o Unicode代码点和IDNA[8]

These documents are known collectively as "IDNA2008".

这些文件统称为“IDNA2008”。

This document constitutes a technical specification for the implementation of the IDN standards in the case of the Arabic language. It will allow the use of standard language tables to write domain names in Arabic characters. Therefore, it should be considered as a logical extension to the IDN standards. It thus presents guidelines for the proper use of Arabic characters with the IDN standards in an Arabic language context.

本文件构成了以阿拉伯语实施IDN标准的技术规范。它将允许使用标准语言表格以阿拉伯语字符书写域名。因此,应将其视为IDN标准的逻辑扩展。因此,它提出了在阿拉伯语环境中正确使用符合IDN标准的阿拉伯语字符的准则。

This document reflects the recommendations of the Arab Working Group on Arabic Domain Names (AWG-ADN), established by the League of Arab States (LAS), based on standardisation efforts of the United Nations Economic and Social Commission for Western Asia (UN-ESCWA) and on that group's document, "Guidelines for an Arabic Internet Domain Name" [9]. This document is also in full harmony with recent rigorous discussions that took place within the major language communities that use the Arabic script in their languages.

本文件反映了阿拉伯国家联盟(阿盟)根据联合国西亚经济社会委员会(联合国-西亚经社会)的标准化工作以及该工作组的文件“阿拉伯互联网域名指南”建立的阿拉伯域名工作组(AWG-ADN)的建议[9]。本文件还与最近在使用阿拉伯文的主要语言社区内进行的严格讨论完全一致。

This document provides guidelines for the ways Arabic characters may be used for registering Internet domain names and how linguistic-specific issues should be handled. A few rules are recommended for application at the protocol level.

本文件提供了有关如何使用阿拉伯语字符注册互联网域名以及如何处理语言特定问题的指南。建议在协议级别应用一些规则。

The key words "MUST", "REQUIRED", "SHOULD", "RECOMMENDED", and "MAY" in this document are to be interpreted as described in RFC 2119 [4].

本文件中的关键词“必须”、“要求”、“应该”、“建议”和“可能”应按照RFC 2119[4]中的说明进行解释。

Comments on this document are solicited and should be addressed to the working group's mailing list at ESCWA-ICTD@un.org and/or the author(s).

已征求对本文件的意见,并应将其发送至西亚经社会工作组的邮寄名单-ICTD@un.org和/或作者。

2. Arabic Language-Specific Issues
2. 阿拉伯文特定问题

The main objective of the creation of Arabic domain names is to have a vehicle to increase Internet use amongst all strata of the Arabic-speaking communities.

创建阿拉伯语域名的主要目标是使阿拉伯语社区的所有阶层都能更多地使用互联网。

Furthermore, a non-user-friendly domain name would further add to the ambiguity and the eccentricity of the Internet to the Arabic-speaking communities, thus contributing negatively to the spread of the

此外,非用户友好的域名将进一步增加阿拉伯语社区对互联网的模糊性和偏心性,从而对互联网的传播产生负面影响

Internet and leading to further isolation of these communities at the global level.

互联网,导致这些社区在全球范围内进一步孤立。

Hence, there have been intensive efforts (especially those spearheaded by Dr. Al-Zoman and contributed to by UN-ESCWA and its Arabic Domain Names Task Force (ADN-TF)) to reach consensus on a multitude of linguistic issues with the following goals:

因此,已经进行了大量努力(特别是由Al-Zoman博士牵头并由联合国-西亚经社会及其阿拉伯域名工作队(ADN-TF)提供帮助的努力),以就许多语言问题达成共识,目标如下:

o To define the accepted Arabic character set to be used for writing domain names in Arabic, which is the subject of this document.

o 定义用于以阿拉伯语编写域名的公认阿拉伯语字符集,这是本文档的主题。

o To define the top-level domains of the Arabic domain name tree structure (i.e., Arabic gTLDs and ccTLDs). This goal will be handled in a separate document.

o 定义阿拉伯语域名树结构的顶级域(即阿拉伯语gtld和cctld)。这一目标将在单独的文件中处理。

The first meeting of the AWG-ADN, held in Damascus from January-February 2005, gave special attention to the following:

2005年1月至2月在大马士革举行的特设工作组第一次会议特别注意以下事项:

o Simplification of the domain names, whenever possible, to facilitate the interaction of the Arabic user with the Internet.

o 尽可能简化域名,以方便阿拉伯语用户与互联网的互动。

o Adoption of solutions that do not lead to confusion either in reading or in writing, provided that this does not compromise the linguistic correctness of used words.

o 采用不会导致阅读或写作混乱的解决方案,前提是这不会影响所用词语的语言正确性。

o Mixing Arabic and non-Arabic letters in the domain name label is not acceptable.

o 在域名标签中混合使用阿拉伯语和非阿拉伯语字母是不可接受的。

2.1. Linguistic Issues
2.1. 语言问题

There are a number of linguistic issues that have been proposed with respect to the use of the Arabic language in domain names. This section will highlight some of them. This section is based on the papers of Dr. Al-Zoman ([10] and [11]) and on the report of the first meeting of AWG-ADN [12]. For details, the reader is encouraged to review these references.

关于在域名中使用阿拉伯语,已经提出了一些语言学问题。本节将重点介绍其中一些。本节以Al Zoman博士([10]和[11])的论文以及AWG-ADN第一次会议的报告[12]为基础。有关详细信息,鼓励读者查阅这些参考资料。

2.1.1. Diacritics (Tashkeel) and Shadda
2.1.1. 变音符号(塔什干)和沙达

Tashkeel and Shadda are accent marks placed above or below Arabic letters to produce proper pronunciation. They are thus used to differentiate different meanings for different words with the same base characters.

塔什基尔和沙达是放在阿拉伯字母上方或下方的重音符号,以产生正确的发音。因此,它们用于区分具有相同基本字符的不同单词的不同含义。

Neither Tashkeel nor Shadda are permitted in zone files when registering domain names in the Arabic language, although they are permitted in the current edition of IDNA2008. They can be supported

在以阿拉伯语注册域名时,Tashkeel和Shadda都不允许在区域文件中使用,尽管当前版本的IDNA2008允许使用它们。他们可以得到支持

or ignored, if necessary, in the user interface with local mappings and can be stripped before IDNA processing.

或者在用户界面中忽略本地映射(如有必要),并且可以在IDNA处理之前剥离。

The following are their Unicode presentations:

以下是他们的Unicode演示文稿:

U+064B ARABIC FATHATAN U+064C ARABIC DAMMATAN U+064D ARABIC KASRATAN U+064E ARABIC FATHA U+064F ARABIC DAMMA U+0650 ARABIC KASRA U+0651 ARABIC SHADDA U+0652 ARABIC SUKUN

U+064B阿拉伯语字母U+064C阿拉伯语字母U+064D阿拉伯语字母U+064E阿拉伯语字母U+064F阿拉伯语字母U+0650阿拉伯语字母U+0651阿拉伯语字母U+0652阿拉伯语字母U+0652

2.1.2. Kasheeda or Tatweel (Horizontal Character Size Extension)
2.1.2. Kasheeda或Tatweel(水平字符大小扩展)

Kasheeda (U+0640 ARABIC TATWEEL) must not be used in Arabic domain names and should be disallowed for Arabic language domain names. The Kasheeda is not a letter and does not have an effect on pronunciation. It is used to extend the horizontal length or change the shape of the preceding letter for graphical representation purposes in Arabic writing. Accordingly, it has no value for the writing of domain names. The same applies to all languages using the Arabic script. The authors recommend that it should be disallowed at the protocol level.

Kasheeda(U+0640 ARABIC TATWEEL)不得用于阿拉伯语域名,也不得用于阿拉伯语域名。Kasheeda不是一个字母,对发音没有影响。它用于延长水平长度或更改前面字母的形状,以便在阿拉伯文书写中进行图形表示。因此,它对域名的书写没有任何价值。这同样适用于所有使用阿拉伯文的语言。作者建议在协议层面上不允许这样做。

2.1.3. Character Folding
2.1.3. 字符折叠

Character folding is the process where multiple letters (that may have some similarity with respect to their shapes) are folded into one shape. Examples of such Arabic characters include:

字符折叠是将多个字母(在形状上可能有一些相似之处)折叠成一个形状的过程。此类阿拉伯字符的示例包括:

o Folding Teh Marbuta (U+0629) and Heh (U+0647) at the end of a word

o 在一个单词的末尾折叠Marbuta(U+0629)和Heh(U+0647)

o Folding different forms of Hamzah (U+0622, U+0623, U+0625, U+0627)

o 折叠不同形式的哈姆扎赫(U+0622、U+0623、U+0625、U+0627)

o Folding Alef Maksura (U+0649) and Yeh (U+064A) at the end of a word

o 在单词末尾折叠Alef Maksura(U+0649)和Yeh(U+064A)

o Folding Waw with Hamzah Above (U+0624) and Waw (U+0648)

o 上面有Hamzah(U+0624)和Waw(U+0648)的折叠Waw

With respect to the Arabic language, character folding is not acceptable because it changes the meaning of words and is against the principle of spelling rules. Replacing a character valid for use in domain names with another character also valid for use in domain names, which may have a similar shape, will give a different meaning. This will lead to only one word representing several words consisting

就阿拉伯语而言,字符折叠是不可接受的,因为它改变了单词的含义,违反了拼写规则的原则。将一个在域名中有效的字符替换为另一个在域名中也有效的字符(可能具有类似的形状),将给出不同的含义。这将导致只有一个单词代表多个单词

of all the combinations of folded characters. Hence, the other words will be masked by a single word [10].

所有折叠字符的组合。因此,其他单词将被单个单词所掩盖[10]。

Mis-spelling or handwriting errors do occur, leading to mixing different characters despite the fact that this is not the case in published and printed materials. One of the motivations of this effort is to preserve the language, particularly with the spread of the globalization movement. Within this context, character folding is working against this motivation since it is going to have a negative effect on the principle and ethics of the language. Technology should work to preserve the language and not to destroy it. Thus, character folding should not be allowed. The case of digits is treated in a separate section below.

拼写错误或书写错误确实会发生,导致不同字符的混合,尽管在出版和印刷材料中并非如此。这一努力的动机之一是保护语言,特别是随着全球化运动的蔓延。在这种情况下,字符折叠与这种动机背道而驰,因为它将对语言的原则和道德产生负面影响。技术应该致力于保护语言,而不是破坏它。因此,不应允许字符折叠。数字的情况在下面单独的一节中处理。

2.2. Supported Character Set
2.2. 支持的字符集

A domain name to be written in Arabic must be composed of a sequence of the following UNICODE characters and the FULL STOP (u+002E) to separate the labels. These are based on UNICODE version 5.0. The tables below are constructed using an inclusion-based approach. Thus, characters that are not part of these tables are prohibited.

用阿拉伯语书写的域名必须由以下UNICODE字符序列和句号(u+002E)组成,以分隔标签。它们基于UNICODE 5.0版。下表采用基于包含的方法构建。因此,不属于这些表的字符是被禁止的。

             +---------+-------------------------------------+
             | Unicode | Character Name                      |
             +---------+-------------------------------------+
             | 0621    | ARABIC LETTER HAMZA                 |
             | 0622    | ARABIC LETTER ALEF WITH MADDA ABOVE |
             | 0623    | ARABIC LETTER ALEF WITH HAMZA ABOVE |
             | 0624    | ARABIC LETTER WAW WITH HAMZA ABOVE  |
             | 0625    | ARABIC LETTER ALEF WITH HAMZA BELOW |
             | 0626    | ARABIC LETTER YEH WITH HAMZA ABOVE  |
             | 0627    | ARABIC LETTER ALEF                  |
             | 0628    | ARABIC LETTER BEH                   |
             | 0629    | ARABIC LETTER TEH MARBUTA           |
             | 062A    | ARABIC LETTER TEH                   |
             | 062B    | ARABIC LETTER THEH                  |
             | 062C    | ARABIC LETTER JEEM                  |
             | 062D    | ARABIC LETTER HAH                   |
             | 062E    | ARABIC LETTER KHAH                  |
             | 062F    | ARABIC LETTER DAL                   |
             | 0630    | ARABIC LETTER THAL                  |
             | 0631    | ARABIC LETTER REH                   |
             | 0632    | ARABIC LETTER ZAIN                  |
             | 0633    | ARABIC LETTER SEEN                  |
             | 0634    | ARABIC LETTER SHEEN                 |
             | 0635    | ARABIC LETTER SAD                   |
             | 0636    | ARABIC LETTER DAD                   |
             | 0637    | ARABIC LETTER TAH                   |
        
             +---------+-------------------------------------+
             | Unicode | Character Name                      |
             +---------+-------------------------------------+
             | 0621    | ARABIC LETTER HAMZA                 |
             | 0622    | ARABIC LETTER ALEF WITH MADDA ABOVE |
             | 0623    | ARABIC LETTER ALEF WITH HAMZA ABOVE |
             | 0624    | ARABIC LETTER WAW WITH HAMZA ABOVE  |
             | 0625    | ARABIC LETTER ALEF WITH HAMZA BELOW |
             | 0626    | ARABIC LETTER YEH WITH HAMZA ABOVE  |
             | 0627    | ARABIC LETTER ALEF                  |
             | 0628    | ARABIC LETTER BEH                   |
             | 0629    | ARABIC LETTER TEH MARBUTA           |
             | 062A    | ARABIC LETTER TEH                   |
             | 062B    | ARABIC LETTER THEH                  |
             | 062C    | ARABIC LETTER JEEM                  |
             | 062D    | ARABIC LETTER HAH                   |
             | 062E    | ARABIC LETTER KHAH                  |
             | 062F    | ARABIC LETTER DAL                   |
             | 0630    | ARABIC LETTER THAL                  |
             | 0631    | ARABIC LETTER REH                   |
             | 0632    | ARABIC LETTER ZAIN                  |
             | 0633    | ARABIC LETTER SEEN                  |
             | 0634    | ARABIC LETTER SHEEN                 |
             | 0635    | ARABIC LETTER SAD                   |
             | 0636    | ARABIC LETTER DAD                   |
             | 0637    | ARABIC LETTER TAH                   |
        
             | 0638    | ARABIC LETTER ZAH                   |
             | 0639    | ARABIC LETTER AIN                   |
             | 063A    | ARABIC LETTER GHAIN                 |
             | 0641    | ARABIC LETTER FEH                   |
             | 0642    | ARABIC LETTER QAF                   |
             | 0643    | ARABIC LETTER KAF                   |
             | 0644    | ARABIC LETTER LAM                   |
             | 0645    | ARABIC LETTER MEEM                  |
             | 0646    | ARABIC LETTER NOON                  |
             | 0647    | ARABIC LETTER HEH                   |
             | 0648    | ARABIC LETTER WAW                   |
             | 0649    | ARABIC LETTER ALEF MAKSURA          |
             | 064A    | ARABIC LETTER YEH                   |
             | 0660    | ARABIC-INDIC DIGIT ZERO             |
             | 0661    | ARABIC-INDIC DIGIT ONE              |
             | 0662    | ARABIC-INDIC DIGIT TWO              |
             | 0663    | ARABIC-INDIC DIGIT THREE            |
             | 0664    | ARABIC-INDIC DIGIT FOUR             |
             | 0665    | ARABIC-INDIC DIGIT FIVE             |
             | 0666    | ARABIC-INDIC DIGIT SIX              |
             | 0667    | ARABIC-INDIC DIGIT SEVEN            |
             | 0668    | ARABIC-INDIC DIGIT EIGHT            |
             | 0669    | ARABIC-INDIC DIGIT NINE             |
             +---------+-------------------------------------+
        
             | 0638    | ARABIC LETTER ZAH                   |
             | 0639    | ARABIC LETTER AIN                   |
             | 063A    | ARABIC LETTER GHAIN                 |
             | 0641    | ARABIC LETTER FEH                   |
             | 0642    | ARABIC LETTER QAF                   |
             | 0643    | ARABIC LETTER KAF                   |
             | 0644    | ARABIC LETTER LAM                   |
             | 0645    | ARABIC LETTER MEEM                  |
             | 0646    | ARABIC LETTER NOON                  |
             | 0647    | ARABIC LETTER HEH                   |
             | 0648    | ARABIC LETTER WAW                   |
             | 0649    | ARABIC LETTER ALEF MAKSURA          |
             | 064A    | ARABIC LETTER YEH                   |
             | 0660    | ARABIC-INDIC DIGIT ZERO             |
             | 0661    | ARABIC-INDIC DIGIT ONE              |
             | 0662    | ARABIC-INDIC DIGIT TWO              |
             | 0663    | ARABIC-INDIC DIGIT THREE            |
             | 0664    | ARABIC-INDIC DIGIT FOUR             |
             | 0665    | ARABIC-INDIC DIGIT FIVE             |
             | 0666    | ARABIC-INDIC DIGIT SIX              |
             | 0667    | ARABIC-INDIC DIGIT SEVEN            |
             | 0668    | ARABIC-INDIC DIGIT EIGHT            |
             | 0669    | ARABIC-INDIC DIGIT NINE             |
             +---------+-------------------------------------+
        

Source: Supporting the Arabic Language in Domain Names [10] Table 1: CHARACTERS FROM UNICODE ARABIC TABLE (0600-06FF)

来源:支持域名中的阿拉伯语[10]表1:UNICODE阿拉伯语表中的字符(0600-06FF)

                       +---------+-----------------+
                       | Unicode | Digit Name      |
                       +---------+-----------------+
                       | 0030    | DIGIT ZERO      |
                       | 0031    | DIGIT ONE       |
                       | 0032    | DIGIT TWO       |
                       | 0033    | DIGIT THREE     |
                       | 0034    | DIGIT FOUR      |
                       | 0035    | DIGIT FIVE      |
                       | 0036    | DIGIT SIX       |
                       | 0037    | DIGIT SEVEN     |
                       | 0038    | DIGIT EIGHT     |
                       | 0039    | DIGIT NINE      |
                       | 002D    | HYPHEN-MINUS    |
                       +---------+-----------------+
        
                       +---------+-----------------+
                       | Unicode | Digit Name      |
                       +---------+-----------------+
                       | 0030    | DIGIT ZERO      |
                       | 0031    | DIGIT ONE       |
                       | 0032    | DIGIT TWO       |
                       | 0033    | DIGIT THREE     |
                       | 0034    | DIGIT FOUR      |
                       | 0035    | DIGIT FIVE      |
                       | 0036    | DIGIT SIX       |
                       | 0037    | DIGIT SEVEN     |
                       | 0038    | DIGIT EIGHT     |
                       | 0039    | DIGIT NINE      |
                       | 002D    | HYPHEN-MINUS    |
                       +---------+-----------------+
        

Source: Supporting the Arabic Language in Domain Names [10] Table 2: CHARACTERS FROM UNICODE BASIC LATIN TABLE (0000-007F)

来源:支持域名中的阿拉伯语[10]表2:UNICODE基本拉丁表(0000-007F)中的字符

2.3. Arabic Linguistic Issues Affected by Technical Constraints
2.3. 受技术限制影响的阿拉伯语语言问题

In this section, technical aspects of some linguistic issues are discussed.

本节讨论了一些语言问题的技术方面。

2.3.1. Numerals
2.3.1. 数字

In the Arab countries, there are two sets of numerical digits used:

在阿拉伯国家,使用了两组数字:

o Set I: (0, 1, 2, 3, 4, 5, 6, 7, 8, 9) mostly used in the western part of the Arab world.

o 第一组:(0,1,2,3,4,5,6,7,8,9)主要用于阿拉伯世界的西部。

o Set II: (u+0660, u+0661, u+0662, u+0663, u+0664, u+0665, u+0666, u+0667, u+0668, u+0669) mostly used in the eastern part of the Arab world.

o 第二组:(u+0660、u+0661、u+0662、u+0663、u+0664、u+0665、u+0666、u+0667、u+0668、u+0669)主要用于阿拉伯世界东部。

Both sets may be supported in the user interface; however, the rule of numeral homogeneity must be observed. The rule specifies that digits from the Arabic-Indic set of numerals (u+0660 to u+0669) should not be allowed to mix with ASCII digits (u+0030 to u+0039) within the same Arabic domain name label. Thus, the appearance of a digit from one set prevents the use of any other digit from the other set.

用户界面中可能支持这两种设置;然而,必须遵守数字同质性规则。该规则规定,阿拉伯-印度数字集(u+0660至u+0669)中的数字不得与同一阿拉伯域名标签内的ASCII数字(u+0030至u+0039)混合。因此,一个集合中的一个数字的出现可以防止使用另一个集合中的任何其他数字。

2.3.2. The Space Character
2.3.2. 空间特征

The space character is strictly disallowed in domain names, as it is a control character. Instead, the hyphen (Al-sharta, i.e., u+02D) is proposed as a separator between Arabic words to avoid confusion that can take place if the words are typed without a separator.

域名中严格禁止使用空格字符,因为它是一个控制字符。相反,建议使用连字符(Al-sharta,即u+02D)作为阿拉伯语单词之间的分隔符,以避免在没有分隔符的情况下键入单词时可能出现的混淆。

It is acceptable to use the hyphen to separate between words within the same domain name label.

可以使用连字符分隔同一域名标签内的单词。

3. Summary and Conclusion
3. 总结与结论

The proposed guidelines are in full accordance with the IETF IDN standards and take into account Arabic-language-specific issues within a compromise between grammatical rules of the Arabic language and ease of use of that language on the Internet.

建议的指南完全符合IETF IDN标准,并在阿拉伯语语法规则和该语言在互联网上的易用性之间的折衷中考虑了阿拉伯语的具体问题。

In summary, the guidelines specify that, in Arabic domain names:

总之,准则规定,在阿拉伯语域名中:

o Accent marks (Tashkeel and Shadda) are not permitted.

o 不允许使用重音符号(塔什基尔和沙达)。

o Character folding is not permitted.

o 不允许字符折叠。

o If a numeral from the Arabic-Indic or ASCII digit sets appears in a label, numeral homogeneity is required.

o 如果标签中出现阿拉伯-印度或ASCII数字集的数字,则需要数字同质性。

o The hyphen must be used as a word separator instead of space.

o 连字符必须用作单词分隔符,而不是空格。

4. Security Considerations
4. 安全考虑

No particular security considerations could be identified regarding the use of Arabic characters in writing domain names. In particular, any potential visual confusion between different character strings is avoided using the guidelines proposed in this document.

关于在编写域名时使用阿拉伯字符,无法确定具体的安全考虑。特别是,使用本文档中提出的指南,可以避免不同字符串之间的任何潜在视觉混淆。

5. Acknowledgments
5. 致谢

ESCWA ICT Division provided support and funding for the development of this document with the objective of reaching a standard for comprehensive Arabic domain names. Thanks are due to SaudiNIC for its continuous efforts in supporting the development of Arabic domain names.

西亚经社会信息和通信技术司为编写本文件提供了支助和资金,目的是达到全面的阿拉伯域名标准。感谢SaudiNIC在支持阿拉伯域名发展方面的持续努力。

John Klensin and Harald Alvestrand reviewed the document and provided useful editorial and substantive support to enrich it.

John Klensin和Harald Alvestrand审查了该文件,并提供了有益的编辑和实质性支持,以充实该文件。

6. References
6. 工具书类
6.1. Normative References
6.1. 规范性引用文件

[1] Faltstrom, P., Hoffman, P., and A. Costello, "Internationalizing Domain Names in Applications (IDNA)", RFC 3490, March 2003.

[1] Faltstrom,P.,Hoffman,P.,和A.Costello,“应用程序中的域名国际化(IDNA)”,RFC 34902003年3月。

[2] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)", RFC 3491, March 2003.

[2] Hoffman,P.和M.Blanchet,“Nameprep:国际化域名(IDN)的Stringprep配置文件”,RFC 34912003年3月。

[3] Costello, A., "Punycode: A Bootstring encoding of Unicode for Internationalized Domain Names in Applications (IDNA)", RFC 3492, March 2003.

[3] Costello,A.,“Punycode:应用程序中国际化域名的Unicode引导字符串编码(IDNA)”,RFC 3492,2003年3月。

[4] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997.

[4] Bradner,S.,“RFC中用于表示需求水平的关键词”,BCP 14,RFC 2119,1997年3月。

6.2. Informative References
6.2. 资料性引用

[5] Klensin, J., "Internationalized Domain Names for Applications (IDNA): Definitions, Background and Rationale", Work in Progress, September 2008.

[5] Klensin,J.,“应用程序的国际化域名(IDNA):定义、背景和基本原理”,正在进行的工作,2008年9月。

[6] Klensin, J., "Internationalized Domain Names in Applications (IDNA): Protocol", Work in Progress, September 2008.

[6] Klensin,J.,“应用程序中的国际化域名(IDNA):协议”,正在进行的工作,2008年9月。

[7] Alvestrand, H. and C. Karp, "An updated IDNA criterion for right-to-left scripts", Work in Progress, July 2008.

[7] Alvestrand,H.和C.Karp,“从右向左脚本的最新IDNA标准”,正在进行的工作,2008年7月。

[8] Faltstrom, P., "The Unicode Codepoints and IDNA", Work in Progress, July 2008.

[8] Faltstrom,P.,“Unicode码点和IDNA”,正在进行的工作,2008年7月。

[9] United Nations Economic and Social Commission for Western Asia (UN-ESCWA), "Guidelines for an Arabic Domain Name System (ADNS)", Work in Progress, November 2007.

[9] 联合国西亚经济社会委员会(西亚经社会),“阿拉伯域名系统(ADNS)指南”,正在进行的工作,2007年11月。

[10] Al-Zoman, A., "Supporting the Arabic Language in Domain Names", October 2003, <http://www.arabic-domains.org/docs/ NIC-docs/SupportingArabicDomainNmaes.pdf>.

[10] Al Zoman,A.,“在域名中支持阿拉伯语”,2003年10月<http://www.arabic-domains.org/docs/ NIC文档/supportingarbicdomainmaes.pdf>。

[11] Al-Zoman, A., "Arabic Top-Level Domains", Paper presented in Expert Group Meeting on Promotion of Digital Arabic Content, the United Nations, Economic and Social Commission for Western Asia, Beirut, June 2003.

[11] Al-Zoman,A.,“阿拉伯顶级域名”,在促进阿拉伯数字内容专家组会议上提交的论文,联合国西亚经济社会委员会,贝鲁特,2003年6月。

[12] League of Arab States, "Report of the first meeting of AWG-ADN, Damascus", February 2005, <http://www.arabic-domains.org/ar/intrnational-entites.php>.

[12] 阿拉伯国家联盟,“特设工作组第一次会议报告,大马士革”,2005年2月<http://www.arabic-domains.org/ar/intrnational-entites.php>.

Authors' Addresses

作者地址

Ayman El-Sherbiny Information and Communication Technology Division ESCWA UN-House P.O. Box 11-8575 Beirut Lebanon

艾曼·谢尔比尼信息和通信技术司西亚经社会联合国之家黎巴嫩贝鲁特邮政信箱11-8575

   EMail: El-sherbiny@un.org
        
   EMail: El-sherbiny@un.org
        

Mansour Farah Information and Communication Technology Division ESCWA UN-House P.O. Box 11-8575 Beirut Lebanon

Mansour Farah信息和通信技术司西亚经社会联合国之家黎巴嫩贝鲁特邮政信箱11-8575

   EMail: farah14@un.org
        
   EMail: farah14@un.org
        

Ibaa Oueichek Syrian Telecom Establishment Damascus Syria

Ibaa Oueichek叙利亚电信公司叙利亚大马士革

   EMail: oueichek@scs-net.org
        
   EMail: oueichek@scs-net.org
        

Abdulaziz H. Al-Zoman, PhD SaudiNIC, General Directorate of Internet Services IT Sector, CITC King Abdulaziz City for Science and Technology PO Box 6086 Riyadh 11442 Saudi Arabia

阿卜杜勒·阿齐兹·祖曼博士,沙特阿拉伯利雅得11442号邮政信箱6086,CITC阿卜杜勒·阿齐兹国王科技城互联网服务IT部门总理事会沙特迪尼博士

   EMail: azoman@citc.gov.sa
        
   EMail: azoman@citc.gov.sa