rfc9233xml2.original.xml   rfc9233.xml 
<?xml version="1.0" encoding="US-ASCII"?> <?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE RFC SYSTEM "rfc2629.dtd" [
<!ENTITY RFC3490 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.3490.xml">
<!ENTITY RFC3454 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.3454.xml">
<!ENTITY RFC3491 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.3491.xml">
<!ENTITY RFC5890 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5890.xml">
<!ENTITY RFC5891 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5891.xml">
<!ENTITY RFC5892 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5892.xml">
<!ENTITY RFC5893 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5893.xml">
<!ENTITY RFC5894 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5894.xml">
<!ENTITY RFC5895 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.5895.xml">
<!ENTITY RFC6452 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.6452.xml">
<!ENTITY RFC6912 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.6912.xml">
<!ENTITY RFC8753 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml/reference.RFC.8753.xml">
<!ENTITY I-D.klensin-idna-5892upd-unicode70 SYSTEM
"http://xml2rfc.ietf.org/public/rfc/bibxml3/reference.I-D.draft-klensin-idna-589
2upd-unicode70-05.xml">
]>
<?xml-stylesheet type='text/xsl' href='RFC2629.xslt' ?>
<?rfc compact="yes"?>
<?rfc toc="yes"?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="no"?>
<!-- Expand crefs and put them inline -->
<?rfc comments='yes' ?>
<?rfc inline='yes' ?>
<rfc docName="draft-faltstrom-unicode12-07" ipr="trust200902" category="std"> <!DOCTYPE rfc [
<!ENTITY nbsp "&#160;">
<!ENTITY zwsp "&#8203;">
<!ENTITY nbhy "&#8209;">
<!ENTITY wj "&#8288;">
]>
<rfc xmlns:xi="http://www.w3.org/2001/XInclude" docName="draft-faltstrom-unicode
12-07" number="9233" ipr="trust200902" obsoletes="" updates="" submissionType="I
ETF" category="std" consensus="true" xml:lang="en" tocInclude="true" symRefs="tr
ue" sortRefs="false" version="3">
<front> <front>
<title abbrev="IDNA2008 and Unicode 12"> <title abbrev="IDNA2008 and Unicode 12">
IDNA2008 and Unicode 12.0.0 Internationalized Domain Names for Applications 2008 (IDNA2008) and Unicod e 12.0.0
</title> </title>
<author fullname="Patrik Faltstrom" initials="P." surname="Faltstrom"> <seriesInfo name="RFC" value="9233"/>
<author fullname="Patrik Fältström" initials="P." surname="Fältström">
<organization abbrev="Netnod">Netnod</organization> <organization abbrev="Netnod">Netnod</organization>
<address> <address>
<email>paf@netnod.se</email> <email>paf@netnod.se</email>
</address> </address>
</author> </author>
<date month="February" year="2022" day="13"/> <date month="March" year="2022"/>
<area>Operations</area>
<area>ART</area>
<keyword>IDN</keyword> <keyword>IDN</keyword>
<keyword>IDNA</keyword> <keyword>IDNA</keyword>
<keyword>IDNA2008</keyword> <keyword>IDNA2008</keyword>
<keyword>RFC</keyword>
<keyword>I-D</keyword>
<keyword>Internet-Draft</keyword>
<abstract> <abstract>
<t> <t>
This document describes the changes between Unicode 6.0.0 and This document describes the changes between Unicode 6.0.0 and
Unicode 12.0.0 in the context of IDNA2008. Some additions and Unicode 12.0.0 in the context of the current version of
Internationalized Domain Names for Applications 2008 (IDNA2008).
Some additions and
changes have been made in the Unicode Standard that affect the changes have been made in the Unicode Standard that affect the
values produced by the algorithm IDNA2008 specifies. IDNA2008 values produced by the algorithm IDNA2008 specifies. IDNA2008
allows adding exceptions to the algorithm for backward allows adding exceptions to the algorithm for backward
compatibility; however, this document does not add any such compatibility; however, this document does not add any such
exceptions. This document provides the necessary tables to exceptions. This document provides the necessary tables to
IANA to make its database consistent with Unicode 12.0.0. IANA to make its database consistent with Unicode 12.0.0.
</t> </t>
<t> <t>
To improve understanding, this document describes systems that To improve understanding, this document describes systems that
are being used as alternatives to those that conform to are being used as alternatives to those that conform to
IDNA2008. IDNA2008.
</t> </t>
<t>
TO BE REMOVED AT TIME OF PUBLICATION AS AN RFC:
</t>
<t>
This document is discussed on the i18n-discuss@ietf.org mailing list
of the IETF.
</t>
</abstract> </abstract>
</front> </front>
<middle> <middle>
<section anchor="intro" title="Introduction"> <section anchor="intro" numbered="true" toc="default">
<name>Introduction</name>
<t> <t>
The current version of Internationalized Domain Names for The current version of Internationalized Domain Names for
Applications (IDNA) was initiated in 2008, and despite not Applications (IDNA) was initiated in 2008, and despite not
being completed until 2010, is widely known as "IDNA2008". It being completed until 2010, is widely known as "IDNA2008". It
is specified in the series of documents listed in is specified in the series of documents listed in
<xref target="idna2008"/>. The IDNA2008 standard includes an <xref target="idna2008" format="default"/>. The IDNA2008 standard include s an
algorithm by which a derived property value is calculated algorithm by which a derived property value is calculated
based on the properties defined from the Unicode Standard. based on the properties defined in the Unicode Standard.
</t> </t>
<t> <t>
The derived property values that can be calculated are defined The derived property values that can be calculated are defined
in <xref target="RFC5892">RFC 5892</xref>. Below is a summary in <xref target="RFC5892" format="default">RFC 5892</xref>. Below is a su mmary
to aid in the reading of this document. For definition of the to aid in the reading of this document. For definition of the
terms, please see <xref target="RFC5892">RFC 5892</xref>. terms, please see <xref target="RFC5892" format="default">RFC 5892</xref>
<list style="symbols"> .
<t> </t>
PROTOCOL VALID: Those that are allowed to be used in IDNs. <dl>
<dt>
PROTOCOL VALID:
</dt>
<dd>
Those that are allowed to be used in IDNs.
Code points with this property value are permitted for Code points with this property value are permitted for
general use in IDNs. However, that a label consists only general use in IDNs. However, the fact that a label consists only
of code points that have this property value does not of code points with this property value does not
imply that the label can be used in DNS. The abbreviated imply that the label can be used in DNS. The abbreviated
term PVALID is used to refer to this value. term PVALID is used to refer to this value.
</t> </dd>
<t> <dt>
CONTEXTUAL RULE REQUIRED: Some characteristics of the CONTEXTUAL RULE REQUIRED:
</dt>
<dd>
Some characteristics of the
character, such as it being invisible in certain contexts character, such as it being invisible in certain contexts
or problematic in others, require that it not be used in or problematic in others, require that it not be used in
labels unless specific other characters or properties are labels unless specific other characters or properties are
present. The abbreviated term CONTEXT is used to refer to present. The abbreviated term CONTEXT is used to refer to
this value. As explained in <xref target="RFC5892">RFC this value. As explained in <xref target="RFC5892" format="default">R
5892</xref> CONTEXT is in turn divided into CONTEXTJ and FC
5892</xref>, CONTEXT is in turn divided into CONTEXTJ and
CONTEXTO. CONTEXTO.
</t> </dd>
<t> <dt>
DISALLOWED: Those that should clearly not be included in DISALLOWED:
</dt>
<dd>
Those that should clearly not be included in
IDNs. Code points with this property value are not IDNs. Code points with this property value are not
permitted in IDNs. permitted in IDNs.
</t> </dd>
<t> <dt>
UNASSIGNED: Those code points that are not designated (i.e., are UNASSIGNED:
</dt>
<dd>
Those code points that are not designated (i.e., are
unassigned) in the Unicode Standard. unassigned) in the Unicode Standard.
</t> </dd>
</list> </dl>
</t>
<t> <t>
When the Unicode Standard is updated, new code points are assigned When the Unicode Standard is updated, new code points are assigned
and already-assigned code points can have their property values changed. and already assigned code points can have their property values changed.
<list style="symbols"> </t>
<t> <ul spacing="normal">
<li>
Assigning code points can create problems if the Assigning code points can create problems if the
newly-assigned code points are compositions of existing newly assigned code points are compositions of existing
code points and because of that the normalization code points and the normalization
relationships associated with those code points should relationships associated with those code points should
have been changed. have been changed because of that.
</t> </li>
<t> <li>
Changing properties for already-assigned code points can Changing properties for already assigned code points can
create problems if the property change results in changes create problems if the property change results in changes
to the derived property value. This might make an earlier to the derived property value. A previously allowed code point
allowed code point whose derived property value is PVALID whose derived property value is PVALID may now be prohibited
to then not be allowed anymore if its derived property if its derived property value changes to DISALLOWED. The
value changes to DISALLOWED. The problem can also happen problem can also happen the other way around: a code point that
the other way around: a code point that was not allowed was not allowed (and thus was prohibited) can suddenly be
(and thus is prohibited) can suddenly end up being allowed.
allowed. </li>
</t> <li>
<t>
Problems can also be created if the properties assigned to Problems can also be created if the properties assigned to
those code points are inconsistent with IDNA2008 those code points are inconsistent with IDNA2008
assumptions about how properties are assigned and/or about assumptions about how properties are assigned and/or about
how code points with those properties are used or behave. how code points with those properties are used or behave.
</t> </li>
</list> </ul>
</t>
<t> <t>
There were three incompatible changes in the Unicode standard There were three incompatible changes in the Unicode Standard
between <xref target="Unicode-5.2.0">Unicode 5.2.0</xref> and between <xref target="Unicode-5.2.0" format="default">Unicode 5.2.0</xref
<xref target="Unicode-6.0.0">Unicode 6.0.0</xref>; they are > and
described in <xref target="RFC6452">RFC 6452</xref>. The code <xref target="Unicode-6.0.0" format="default">Unicode 6.0.0</xref>; they
are
described in <xref target="RFC6452" format="default">RFC 6452</xref>. The
code
points U+0CF1 and U+0CF2 had a derived property value change points U+0CF1 and U+0CF2 had a derived property value change
from DISALLOWED to PVALID, and the code point U+19DA had a from DISALLOWED to PVALID, and the code point U+19DA had a
change in derived property value from PVALID to change in derived property value from PVALID to
DISALLOWED. These changes where examined in great detail, but DISALLOWED. These changes where examined in great detail, but
the IETF concluded that these changes to the Unicode standard the IETF concluded that these changes to the Unicode Standard
did not warrant an update to <xref target="RFC5892">RFC did not warrant an update to <xref target="RFC5892" format="default">RFC
5892</xref>. 5892</xref>.
</t> </t>
<t> <t>
As described in <xref target="changes"/>, more incompatible As described in <xref target="changes" format="default"/>, more incompati ble
changes have been made to code points between Unicode 6.0.0 changes have been made to code points between Unicode 6.0.0
and <xref target="Unicode-12.0.0">Unicode 12.0.0</xref>; and <xref target="Unicode-12.0.0" format="default">Unicode 12.0.0</xref>;
however, the changes in the derived property values do not however, the changes in the derived property values do not
result in exceptions (as defined in section 2.6 of <xref result in exceptions (as defined in Section <xref target="RFC5892" sectio
target="RFC5892">RFC 5892</xref>) being added to <xref nFormat="bare" section="2.6"/> of <xref target="RFC5892" format="default">RFC 58
target="RFC5892">RFC 5892</xref>. 92</xref>) that would require an update to the
"IDNA Contextual Rules" registry (which would also be considered an
update to <xref target="RFC5892" format="default">RFC 5892</xref>).
</t> </t>
<t> <t>
Further, in 2015, the Internet Architecture Board (IAB) issued Further, in 2015, the Internet Architecture Board (IAB) issued
<xref target="IAB2005-1">a statement</xref> that advised the <xref target="IAB2005-1" format="default">a statement</xref> that advised the
community to avoid using any of the potentially problematic community to avoid using any of the potentially problematic
code points and asked the IETF to resolve the issues related code points and asked the IETF to resolve the issues related
to the code point ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) to the code point ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1)
that was introduced in <xref target="Unicode-7.0.0">Unicode that was introduced in <xref target="Unicode-7.0.0" format="default">Unic ode
7.0.0</xref>. In February of that year, the statement was 7.0.0</xref>. In February of that year, the statement was
<xref target="IAB2005-2">revised</xref> to focus on the latter <xref target="IAB2005-2" format="default">revised</xref> to focus on the latter
request. More details about the problem of code point request. More details about the problem of code point
sequences not normalizing as one might expect appear in <xref sequences not normalizing as one might expect appear in <xref target="I-D
target="IDNA7">a draft that was part of the .klensin-idna-5892upd-unicode70" format="default">a draft that was part of the
discussion</xref>. discussion</xref>.
</t> </t>
<t> <t>
The result of the work in the IETF was that no exception was The result of the work in the IETF was that no exception was
added to <xref target="RFC5892">RFC 5892</xref>; however, it added to <xref target="RFC5892" format="default">RFC 5892</xref>; however , it
should be noted that the review of the issues around U+08A1 should be noted that the review of the issues around U+08A1
indicated that this code point is not an isolated case and indicated that this code point is not an isolated case and
that a number of long-standing PVALID code points may have that a number of long-standing PVALID code points may have
similar issues. While the affected code points remain PVALID similar issues. While the affected code points remain PVALID
in this document, identification of the problem resulted in a in this document, identification of the problem resulted in a
clarification of the review process for new Unicode clarification of the review process for new Unicode
versions. That clarification, which reinforces the original versions. That clarification, which reinforces the original
review plan to capture issues like these, was published as review plan to capture issues like these, was published as
<xref target="RFC8753">RFC 8753</xref>. Any review of Unicode <xref target="RFC8753" format="default">RFC 8753</xref>. Any review of U
versions after 12.0.0 should be made according to <xref nicode
target="RFC8753">RFC 8753</xref>; an objective of this versions after 12.0.0 should be made according to <xref target="RFC8753"
format="default">RFC 8753</xref>; an objective of this
document is to ensure that a proper review of such versions document is to ensure that a proper review of such versions
after version 12.0.0 can be made. after version 12.0.0 can be made.
</t> </t>
</section> </section>
<section title="Background"> <section numbered="true" toc="default">
<section anchor="idna2008" title="IDNA2008 Documents"> <name>Background</name>
<t> <section anchor="idna2008" numbered="true" toc="default">
<name>IDNA2008 Documents</name>
<t>
IDNA2008 consists of the following documents. IDNA2008 consists of the following documents.
The documents in the set have informal names. The documents in the set have informal names.
<list style="symbols"> </t>
<t> <ul spacing="normal">
<xref target="RFC5890">Internationalized Domain Names <li>
<xref target="RFC5890" format="default">"Internationalized Domain Na
mes
for Applications (IDNA): Definitions and Document for Applications (IDNA): Definitions and Document
Framework</xref>, informally called "Defs" or Framework"</xref>, informally called "Defs" or
"Definitions", contains definitions and other material "Definitions", contains definitions and other material
that are needed for understanding other documents in the that are needed for understanding other documents in the
set. set.
</t> </li>
<t> <li>
<xref target="RFC5891">Internationalized Domain Names in <xref target="RFC5891" format="default">"Internationalized Domain Na
Applications (IDNA): Protocol</xref>, informally called mes in
Applications (IDNA): Protocol"</xref>, informally called
"Protocol", describes the core IDNA2008 protocol and its "Protocol", describes the core IDNA2008 protocol and its
operations. It needs to be interpreted in combination operations. It needs to be interpreted in combination
with the Bidi document (described below). with the Bidi document (described below).
</t> <xref target="RFC5891" format="default">RFC 5891</xref>
<t> obsoletes <xref target="RFC3491" format="default">RFC 3491</xref>
<xref target="RFC5892">The Unicode Code Points and and, in
particular, the use of the tables to
which <xref target="RFC3491" format="default">RFC 3491</xref> refe
rs.
</li>
<li>
<xref target="RFC5892" format="default">"The Unicode Code Points and
Internationalized Domain Names for Applications Internationalized Domain Names for Applications
(IDNA)</xref>, informally called "Tables", lists the (IDNA)"</xref>, informally called "Tables", lists the
categories and rules that identify the code points categories and rules that identify the code points
allowed in a label written in native character form allowed in a label written in native character form
(called a "U-label"), and is based (called a "U-label"), and is based
on <xref target="Unicode-5.2.0">Unicode 5.2.0</xref> on <xref target="Unicode-5.2.0" format="default">Unicode 5.2.0</xr ef>
code point assignments and additional rules unique to code point assignments and additional rules unique to
IDNA2008. The Unicode-based rules in RFC 5892 are IDNA2008. The Unicode-based rules in RFC 5892 are
expected to be stable across Unicode updates and hence expected to be stable across Unicode updates and hence
independent of Unicode independent of Unicode
versions. <xref target="RFC5892">RFC 5892</xref> versions.
obsoletes <xref target="RFC3491">RFC 3491</xref>, and in </li>
particular the use of the tables to <li>
which <xref target="RFC3491">RFC 3491</xref> refers. <xref target="RFC5893" format="default">"Right-to-Left Scripts for
</t>
<t>
<xref target="RFC5893">Right-to-Left Scripts for
Internationalized Domain Names for Applications Internationalized Domain Names for Applications
(IDNA)</xref>, informally called "Bidi", specifies (IDNA)"</xref>, informally called "Bidi", specifies
special rules for labels that contain characters that special rules for labels that contain characters that
are written from right to left. are written from right to left.
</t> </li>
<t> <li>
<xref target="RFC5894">Internationalized Domain Names <xref target="RFC5894" format="default">"Internationalized Domain Na
mes
for Applications (IDNA): Background, Explanation, and for Applications (IDNA): Background, Explanation, and
Rationale</xref>, informally called "Rationale", Rationale"</xref>, informally called "Rationale",
provides an overview of the protocol and associated provides an overview of the protocol and associated
tables, and gives explanatory material and some tables, and gives explanatory material and some
rationale for the decisions that led to IDNA2008. It rationale for the decisions that led to IDNA2008. It
also contains advice for DNS registry operators and also contains advice for DNS registry operators and
others who use Internationalized Domain Names (IDNs). others who use Internationalized Domain Names (IDNs).
</t> </li>
<t> <li>
<xref target="RFC5895">Mapping Characters for <xref target="RFC5895" format="default">"Mapping Characters for
Internationalized Domain Names in Applications (IDNA) Internationalized Domain Names in Applications (IDNA)
2008</xref>, informally called "Mapping", discusses the 2008"</xref>, informally called "Mapping", discusses the
issue of mapping characters into other characters and issue of mapping characters into other characters and
provides guidance for doing so when that is appropriate. provides guidance for doing so when that is appropriate.
RFC 5895 provides advice only and is not a required part RFC 5895 provides advice only and is not a required part
of IDNA. of IDNA.
</t> </li>
</list> </ul>
</t>
</section> </section>
<section title="Additional important IDNA2008-related documents"> <section numbered="true" toc="default">
<t> <name>Additional Important IDNA2008-Related Documents</name>
<t>
There are other documents important for the understanding There are other documents important for the understanding
and functioning of IDNA2008, for example this. and functioning of IDNA2008, for example this.
<list style="symbols"> </t>
<t> <ul spacing="normal">
<xref target="RFC6452">The Unicode Code Points and <li>
<xref target="RFC6452" format="default">"The Unicode Code Points and
Internationalized Domain Names for Applications (IDNA) - Internationalized Domain Names for Applications (IDNA) -
Unicode 6.0</xref> describes some changes made Unicode 6.0"</xref> describes some changes made
to <xref target="Unicode-6.0.0">Unicode 6.0.0</xref> to <xref target="Unicode-6.0.0" format="default">Unicode 6.0.0</xr
that resulted in derived property value change for ef>
the code points U+0CF1, U+0CF2 and U+19DA. U+0CF1 and that resulted in derived property value changes for
the code points U+0CF1, U+0CF2, and U+19DA. U+0CF1 and
U+0CF2 changed from DISALLOWED to PVALID, while U+19DA U+0CF2 changed from DISALLOWED to PVALID, while U+19DA
changed from PVALID to DISALLOWED. The IETF concluded changed from PVALID to DISALLOWED. The IETF concluded
that no update to <xref target="RFC5892">RFC 5892</xref> that no update to <xref target="RFC5892" format="default">RFC 5892 </xref>
was needed based on the changes made was needed based on the changes made
in <xref target="Unicode-6.0.0">Unicode 6.0.0</xref>. As in <xref target="Unicode-6.0.0" format="default">Unicode 6.0.0</xr ef>. As
a result, the derived property value remained aligned a result, the derived property value remained aligned
with the Unicode Standard. Specifically, no exception with the Unicode Standard. Specifically, no exception
was added. was added.
</t> </li>
</list> </ul>
</t>
</section> </section>
<section anchor="deployment" title="Deployment"> <section anchor="deployment" numbered="true" toc="default">
<t> <name>Deployment</name>
<t>
There are many variations on the general IDNA model in use There are many variations on the general IDNA model in use
in the various parts of the community. The following lists in the various parts of the community. The following lists
some of the strategies that implementations that claim to be some of the strategies that implementations that claim to be
IDNA compliant are known to use, but it should be noted the IDNA compliant are known to use, but it should be noted the
list is not complete: list is not complete:
<list style="symbols"> </t>
<t> <ul spacing="normal">
IDNA2003 as specified in <xref target="RFC3490">RFC <li>
3490</xref> and <xref target="RFC3491">RFC 3491</xref>. IDNA2003 as specified in <xref target="RFC3490" format="default">RF
Those specifications are dependent on case folding and C
NFKC normalization and on tables that specify for each 3490</xref> and <xref target="RFC3491" format="default">RFC 3491</x
ref>.
Those specifications are dependent on case folding,
Normalization Form KC (NFKC), and on tables that specify for each
code point whether it is allowed to be used or not, with code point whether it is allowed to be used or not, with
a distinction made between use for "stored strings" and a distinction made between use for "stored strings" and
"query strings". The tables themselves are dependent on "query strings". The tables themselves are dependent on
<xref target="Unicode-3.2.0">Unicode 3.2</xref>. <xref target="Unicode-3.2.0" format="default">Unicode 3.2</xref>.
</t> </li>
<t> <li>
A number of variations on IDNA2003, sometimes presented A number of variations on IDNA2003, sometimes presented
as "updated IDNA2003" or the like, which follow the as "updated IDNA2003" or the like, which follow the
principles of IDNA2003 as understood by the implementers principles of IDNA2003 as understood by the implementers
but that use tables that represent how the implementers but that use tables that represent how the implementers
believe <xref target="RFC3454">Stringprep</xref> believe <xref target="RFC3454" format="default">Stringprep</xref>
and <xref target="RFC3491">Nameprep</xref> would have and <xref target="RFC3491" format="default">Nameprep</xref> would
have
evolved had the IETF not moved in the direction of evolved had the IETF not moved in the direction of
IDNA2008 instead. IDNA2008 instead.
</t> </li>
<t> <li>
A mix between IDNA2003 and IDNA2008 where code points A mix between IDNA2003 and IDNA2008 where code points
assigned to Unicode assigned to Unicode
after <xref target="Unicode-3.2.0">Unicode 3.2.0</xref> after <xref target="Unicode-3.2.0" format="default">Unicode 3.2.0</ xref>
have derived property value calculated according to the have derived property value calculated according to the
algorithm specified in IDNA2008. algorithm specified in IDNA2008.
</t> </li>
<t> <li>
A mix between IDNA2003 and IDNA2008 according to A mix between IDNA2003 and IDNA2008 according to
the <xref target="UTS-46">Unicode Technical Standard the <xref target="UTS-46" format="default">Unicode Technical Standa rd
#46</xref>. Because that document specifies different #46</xref>. Because that document specifies different
profiles, there are several variations that profiles, there are several variations that
leave users with no guarantee that two applications leave users with no guarantee that two applications
claiming conformance to UTS#46 will interoperate well claiming conformance to UTS#46 will interoperate well
with each other much less with conforming IDNA2008 with each other much less with conforming IDNA2008
implementations. UTS#46 is ultimately based on a implementations. UTS#46 is ultimately based on a
normative table very much like the one used by normative table very much like the one used by
<xref target="RFC3454">Stringprep</xref> but updated for <xref target="RFC3454" format="default">Stringprep</xref> but updat ed for
each new version of Unicode. each new version of Unicode.
</t> </li>
<t> <li>
The (normative) IDNA2008 algorithm applied to whatever The (normative) IDNA2008 algorithm applied to whatever
version of Unicode Standard exists in the operating version of Unicode Standard exists in the operating
system and/or libraries used, independent of whatever system and/or libraries used, independent of whatever
version of tables appears in the (non-normative) IANA version of tables appears in the (non-normative) IANA
database. database.
</t> </li>
</list> </ul>
<t>
In practice, the Unicode Consortium creates a maximum set of In practice, the Unicode Consortium creates a maximum set of
code points by assigning code points in the Unicode code points by assigning code points in the Unicode
Standard. The IDNA2008 rules use the Unicode Standard to Standard. The IDNA2008 rules use the Unicode Standard to
create a further subset of code points and context that are create a further subset of code points and context that are
permitted in DNS labels associated with its PVALID, and permitted in DNS labels associated with its PVALID and
CONTEXT (CONTEXTJ or CONTEXTO) derived property values. DNS CONTEXT (CONTEXTJ or CONTEXTO) derived property values. DNS
registries and other organizations that deal with IDNs are registries and other organizations that deal with IDNs are
supposed to create their own subsets from IDNA2008 for use supposed to create their own subsets from IDNA2008 for use
by those registries and organizations. by those registries and organizations.
</t> </t>
<t> <t>
This progressive subsetting and narrowing of the repertoire This progressive subsetting and narrowing of the repertoire
of code points that can be used in labels is an of code points that can be used in labels is an
implementation of the principles of being conservative when implementation of the principles of being conservative when
deciding what code points to include in such a subset. deciding what code points to include in such a subset.
<xref target="SAC-084">SAC-084</xref> <xref target="SAC-084" format="default">SAC-084</xref>
and <xref target="RFC6912">RFC 6912</xref> recommend to DNS and <xref target="RFC6912" format="default">RFC 6912</xref> recommend t
o DNS
registries and other organizations to be conservative when registries and other organizations to be conservative when
creating their subsets, and to use the principle of creating creating their subsets and to use the principle of creating
subsets by inclusion. subsets by inclusion.
</t> </t>
<t> <t>
See also the Security Considerations section in this See also <xref target="security">Security Considerations</xref> in this
document. document.
</t> </t>
</section> </section>
</section> </section>
<section anchor="changes" title="Notable Changes Between Unicode 6.0.0 and 1 <section anchor="changes" numbered="true" toc="default">
2.0.0"> <name>Notable Changes between Unicode 6.0.0 and 12.0.0</name>
<t> <t>
Among the changes between the Unicode versions, most code Among the changes between the Unicode versions, most code
points that change derived property value change from points that change derived property value change from
UNASSIGNED to PVALID or from UNASSIGNED to DISALLOWED. The UNASSIGNED to PVALID or from UNASSIGNED to DISALLOWED. The
interesting changes in derived property values include other interesting changes in derived property values include other
changes. All changes between the major versions of Unicode can be changes. All changes between the major versions of Unicode can be
found in <xref target="Appendix-6.0.0"/> (6.0.0-7.0.0), <xref found in <xref target="Appendix-6.0.0" format="default"/> (6.0.0-7.0.0),
target="Appendix-7.0.0"/> (7.0.0-8.0.0), <xref target="Appendix-8.0.0"/> <xref target="Appendix-7.0.0" format="default"/> (7.0.0-8.0.0), <xref target="Ap
(8.0.0-9.0.0), pendix-8.0.0" format="default"/> (8.0.0-9.0.0),
<xref target="Appendix-9.0.0"/> (9.0.0-10.0.0), <xref <xref target="Appendix-9.0.0" format="default"/> (9.0.0-10.0.0), <xref ta
target="Appendix-10.0.0"/> (10.0.0-11.0.0) and <xref rget="Appendix-10.0.0" format="default"/> (10.0.0-11.0.0), and <xref target="App
target="Appendix-11.0.0"/> (11.0.0-12.0.0). endix-11.0.0" format="default"/> (11.0.0-12.0.0).
</t> </t>
<section title="Changes between Unicode 6.0.0 and 7.0.0"> <section numbered="true" toc="default">
<t> <name>Changes between Unicode 6.0.0 and 7.0.0</name>
<list> <t>
<t>
Change in number of characters in each category: Change in number of characters in each category:
</t> </t>
<t> <ul spacing="normal">
<li>
PVALID changed from 97418 to 99867 (+2449) PVALID changed from 97418 to 99867 (+2449)
</t> </li>
<t> <li>
UNASSIGNED changed from 865081 to 861509 (-3572) UNASSIGNED changed from 865081 to 861509 (-3572)
</t> </li>
<t> <li>
CONTEXTJ did not change, at 2 CONTEXTJ did not change, at 2
</t> </li>
<t> <li>
CONTEXTO did not change, at 25 CONTEXTO did not change, at 25
</t> </li>
<t> <li>
DISALLOWED changed from 151586 to 152709 (+1123) DISALLOWED changed from 151586 to 152709 (+1123)
</t> </li>
<t> <li>
TOTAL did not change, at 1114112 TOTAL did not change, at 1114112
</t> </li>
<t> </ul>
<t>
There are no changes made to Unicode between version There are no changes made to Unicode between version
6.0.0 and 7.0.0 that impact IDNA2008 calculation of the 6.0.0 and 7.0.0 that impact IDNA2008 calculation of the
derived property values. derived property values.
</t> </t>
</list> <t>
</t>
<t>
The code points U+17B4 KHMER VOWEL INHERENT AQ and U+17B5 The code points U+17B4 KHMER VOWEL INHERENT AQ and U+17B5
KHMER VOWEL INHERENT AA both changed the general category KHMER VOWEL INHERENT AA both changed the General Category
from Cf (Format) to Mn (Nonspacing_Mark), but that did not from Cf (Format) to Mn (Nonspacing_Mark), but that did not
impact the calculation of the derived property value which impact the calculation of the derived property value which
stayed at DISALLOWED. stayed at DISALLOWED.
</t> </t>
<t> <t>
The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was
introduced in Unicode 7.0.0. This was discussed extensively in the IETF introduced in Unicode 7.0.0. This was discussed extensively in the IETF
, and also by the IAB in <xref target="IAB2005-1" format="default">their
and by the IAB in <xref target="IAB2005-1">their
statement</xref> requesting the IETF to investigate the statement</xref> requesting the IETF to investigate the
issue. Specifically, the IAB stated: issue. Specifically, the IAB stated:
</t> </t>
<t> <blockquote>
<list>
<t>
On the same precautionary principle, the IAB recommends On the same precautionary principle, the IAB recommends
that the Internationalized Domain Names for Applications that the Internationalized Domain Names for Applications
(IDNA) Parameters registry (IDNA) Parameters registry
<https://www.iana.org/assignments/idna-tables/&gt; not be <eref target="https://www.iana.org/assignments/idna-tables/" bracke ts="angle"/&gt; not be
updated to Unicode 7.0.0 until the IETF has consensus on a updated to Unicode 7.0.0 until the IETF has consensus on a
solution to this problem. solution to this problem.
</t> </blockquote>
</list> <t>
</t>
<t>
The discussion in the IETF concluded that although it is The discussion in the IETF concluded that although it is
possible to create "the same" character in multiple ways, possible to create "the same" character in multiple ways,
the issue with U+08A1 is not unique. The character U+08A1 the issue with U+08A1 is not unique. The character U+08A1
(ARABIC LETTER BEH WITH HAMZA ABOVE) can be represented with (ARABIC LETTER BEH WITH HAMZA ABOVE) can be represented with
the sequence ARABIC LETTER BEH (U+0628) and ARABIC HAMZA the sequence ARABIC LETTER BEH (U+0628) and ARABIC HAMZA
ABOVE (U+0654). This identical to LATIN SMALL LETTER O WITH ABOVE (U+0654). This is identical to LATIN SMALL LETTER O WITH
STROKE (U+00F8), which can be represented with the sequence STROKE (U+00F8), which can be represented with the sequence
LATIN SMALL LETTER O (U+006F) followed by COMBINING SHORT LATIN SMALL LETTER O (U+006F) followed by COMBINING SHORT
SOLIDUS OVERLAY (U+0337). SOLIDUS OVERLAY (U+0337).
</t> </t>
<t> <t>
Although the discussion about this specific code point Although the discussion about this specific code point
resulted in acceptance of the derived property value of resulted in acceptance of the derived property value of
PVALID, the underlying problem with combining sequences is PVALID, the underlying problem with combining sequences is
not understood fully. Therefore, it cannot be claimed that not understood fully. Therefore, it cannot be claimed that
this case can be extrapolated to other situations and other this case can be extrapolated to other situations and other
code points. code points.
</t> </t>
</section> </section>
<section title="Changes between Unicode 7.0.0 and 10.0.0"> <section numbered="true" toc="default">
<t> <name>Changes between Unicode 7.0.0 and 10.0.0</name>
<list> <t>
<t>
Change in number of characters in each category: Change in number of characters in each category:
</t> </t>
<t> <ul spacing="normal">
<li>
Code points that changed derived property value: 0 Code points that changed derived property value: 0
</t> </li>
<t> <li>
PVALID changed from 99867 to 122411 (+22544) PVALID changed from 99867 to 122411 (+22544)
</t> </li>
<t> <li>
UNASSIGNED changed from 861509 to 837775 (-23734) UNASSIGNED changed from 861509 to 837775 (-23734)
</t> </li>
<t> <li>
CONTEXTJ did not change, at 2 CONTEXTJ did not change, at 2
</t> </li>
<t> <li>
CONTEXTO did not change, at 25 CONTEXTO did not change, at 25
</t> </li>
<t> <li>
DISALLOWED changed from 152709 to 153899 (+1190) DISALLOWED changed from 152709 to 153899 (+1190)
</t> </li>
<t> <li>
TOTAL did not change, at 1114112 TOTAL did not change, at 1114112
</t> </li>
<t> </ul>
<t>
There are no changes made to Unicode between version 7.0.0 There are no changes made to Unicode between version 7.0.0
and 10.0.0 that impact IDNA2008 calculation of the derived and 10.0.0 that impact IDNA2008 calculation of the derived
property values. property values.
</t>
</list>
</t> </t>
</section> </section>
<section title="Changes between Unicode 10.0.0 and 11.0.0"> <section numbered="true" toc="default">
<t> <name>Changes between Unicode 10.0.0 and 11.0.0</name>
<list> <t>
<t>
Change in number of characters in each category: Change in number of characters in each category:
</t> </t>
<t> <ul spacing="normal">
<li>
Code points that changed derived property value: 1 Code points that changed derived property value: 1
</t> </li>
<t> <li>
PVALID changed from 122411 to 122734 (+323) PVALID changed from 122411 to 122734 (+323)
</t> </li>
<t> <li>
UNASSIGNED changed from 837775 to 837091 (-684) UNASSIGNED changed from 837775 to 837091 (-684)
</t> </li>
<t> <li>
CONTEXTJ did not change, at 2 CONTEXTJ did not change, at 2
</t> </li>
<t> <li>
CONTEXTO did not change, at 25 CONTEXTO did not change, at 25
</t> </li>
<t> <li>
DISALLOWED changed from 153899 to 154260 (+361) DISALLOWED changed from 153899 to 154260 (+361)
</t> </li>
<t> <li>
TOTAL did not change, at 1114112 TOTAL did not change, at 1114112
</t> </li>
<t> <li>
Georgian letters in the ranges U+10D0..U+10FA and Georgian letters in the ranges U+10D0..U+10FA and
U+10FD..U+10FF had their General Properties changed from Lo to Ll, U+10FD..U+10FF had their General Category changed
to reflect from Lo (Other_Letter) to Ll (Lowercase_Letter) to reflect
their status as the lowercase of new Georgian case their status as the lowercase of new Georgian case
pairs. Case mappings were also added. pairs. Case mappings were also added.
</t> </li>
<t> <li>
SHARADA SANDHI MARK (U+111C9) was changed from Po to Mn, SHARADA SANDHI MARK (U+111C9) General Category was
and from bc=L to bc=NSM. changed from Po (Other_Punctuation) to Mn (Nonspacing_Mark),
</t> and the Bidi property was changed from L (Left to Right) to
<t> NSM (Nonspacing Mark).
</li>
<li>
The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and
ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were corrected from Mc to ZANABAZAR SQUARE VOWEL SIGN AU (U+11A08) were corrected from Mc to
Mn. Mn.
</t> </li>
<t> <li>
SPHERICAL ANGLE OPENING UP (U+29A1) was changed to SPHERICAL ANGLE OPENING UP (U+29A1) was changed to
Bidi_M=N. Bidi Mirrored to No.
</t> </li>
</list> </ul>
<t>
These changes to the Unicode Standard have the following These changes to the Unicode Standard have the following
implications for these code points: implications for these code points:
<list style="symbols"> </t>
<t> <ul spacing="normal">
<li>
The newly assigned 684 characters are assigned a derived The newly assigned 684 characters are assigned a derived
property value as of a result of applying the IDNA2008 property value as of a result of applying the IDNA2008
algorithm. algorithm.
</t> </li>
<t> <li>
The Georgian letters in the ranges U+10D0..U+10FA and The Georgian letters in the ranges U+10D0..U+10FA and
U+10FD..U+10FF existed before IDNA2008 was created. U+10FD..U+10FF existed before IDNA2008 was created.
Applying the IDNA2008 algorithm to the code points Applying the IDNA2008 algorithm to the code points
assigned the derived property value PVALID, and that assigned the derived property value PVALID, and that
value is unchanged even if the underlying Unicode value is unchanged even if the underlying Unicode
properties have changed. The newly encoded Mtavruli properties have changed. The newly encoded Mtavruli
letters have general category "Lu" and are therefore letters have General Category Lu (Uppercase_Letter)
DISALLOWED. and are therefore DISALLOWED.
</t> </li>
<t> <li>
The U+111C9 SHARADA SANDHI MARK was added The U+111C9 SHARADA SANDHI MARK was added
to <xref target="Unicode-8.0.0">Unicode to <xref target="Unicode-8.0.0" format="default">Unicode
8.0.0</xref>. Applying the IDNA2008 algorithm to the 8.0.0</xref>. Applying the IDNA2008 algorithm to the
code point assigned the derived property value code point assigned the derived property value
DISALLOWED. The changes in the underlying properties in DISALLOWED. The changes in the underlying properties in
the <xref target="Unicode-11.0.0">Unicode Standard <xref target="Unicode-11.0.0" format="default">Unicode
Version 11.0.0</xref> caused the derived property value 11.0.0</xref> caused the derived property value
to change to PVALID. to change to PVALID.
</t> </li>
<t> <li>
The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07)
and ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were added to and ZANABAZAR SQUARE VOWEL SIGN AU (U+11A08) were added to
<xref target="Unicode-10.0.0">Unicode 10.0.0</xref>. <xref target="Unicode-10.0.0" format="default">Unicode 10.0.0</xref
>.
Applying the IDNA2008 algorithm to the code points Applying the IDNA2008 algorithm to the code points
assigned the derived property value PVALID, and that value assigned the derived property value PVALID, and that value
is unchanged even if the underlying Unicode properties is unchanged even if the underlying Unicode properties
have changed. have changed.
</t> </li>
<t> <li>
SPHERICAL ANGLE OPENING UP (U+29A1) existed SPHERICAL ANGLE OPENING UP (U+29A1) existed
before IDNA2008 was created. Applying the IDNA2008 before IDNA2008 was created. Applying the IDNA2008
algorithm to the code point assigned the derived algorithm to the code point assigned the derived
property value DISALLOWED, and that value is unchanged even property value DISALLOWED, and that value is unchanged even
if the underlying Unicode properties have changed. if the underlying Unicode properties have changed.
</t> </li>
</list> </ul>
</t>
</section> </section>
<section title="Changes between Unicode 11.0.0 and 12.0.0"> <section numbered="true" toc="default">
<t> <name>Changes between Unicode 11.0.0 and 12.0.0</name>
<list> <t>
<t>
Change in number of characters in each category: Change in number of characters in each category:
</t> </t>
<t> <ul spacing="normal">
<li>
Code points that changed derived property value: 0 Code points that changed derived property value: 0
</t> </li>
<t> <li>
PVALID changed from 122734 to 123006 (+272) PVALID changed from 122734 to 123006 (+272)
</t> </li>
<t> <li>
UNASSIGNED changed from 837091 to 836537 (-554) UNASSIGNED changed from 837091 to 836537 (-554)
</t> </li>
<t> <li>
CONTEXTJ did not change, at 2 CONTEXTJ did not change, at 2
</t> </li>
<t> <li>
CONTEXTO did not change, at 25 CONTEXTO did not change, at 25
</t> </li>
<t> <li>
DISALLOWED changed from 154260 to 154542 (+282) DISALLOWED changed from 154260 to 154542 (+282)
</t> </li>
<t> <li>
TOTAL did not change, at 1114112 TOTAL did not change, at 1114112
</t> </li>
</list> </ul>
</t>
</section> </section>
</section> </section>
<section anchor="SHARADA" title="U+111C9 SHARADA SANDHI MARK"> <section anchor="SHARADA" numbered="true" toc="default">
<name>U+111C9 SHARADA SANDHI MARK</name>
<t> <t>
As one can see in <xref target="changes"/>, an incompatible As one can see in <xref target="changes" format="default"/>, an incompati ble
property change was made between Unicode 6.0.0 and 12.0.0, property change was made between Unicode 6.0.0 and 12.0.0,
affecting the code point U+111C9. Its derived property value affecting the code point U+111C9. Its derived property value
thus changed from DISALLOWED to PVALID. In situations like thus changed from DISALLOWED to PVALID. In situations like
these, IDNA2008 allow for addition of rules to <xref these, IDNA2008 allows for addition of rules to <xref target="RFC5892" fo
target="RFC5892">RFC 5892</xref> section 2.7. If the code rmat="default">RFC 5892</xref>, Section <xref target="RFC5892" section="2.7" sec
tionFormat="bare"/>. If the code
point is accepted, it might still be rejected if validated by point is accepted, it might still be rejected if validated by
software based on older versions of Unicode than 12.0.0. As software based on versions of Unicode older than 12.0.0. As
the character is rarely used outside the group of Sharada the character is rarely used outside the group of Sharada
specialists, and used in some records for indicating sandhi specialists but is used in some records for indicating sandhi
breaks, the conclusion is that it could either be added as an breaks, the conclusion was that it could either be added as an
exception or allowed to change its property value, as the use exception or allowed to change its property value. As
of the code point is limited outside a special community. As including an exception would require implementation changes to
including an exception would require implementation changes in deployments of IDNA20008, the IETF has decided not to
deployed implementations of IDNA20008, the IETF has decided to add a BackwardCompatible rule to IDNA2008 (i.e., Section
not add a BackwardCompatible rule to IDNA2008 (i.e. Section <xref target="RFC5892" section="2.7" sectionFormat="bare"/> of <xref targ
2.7 of <xref target="RFC5892">RFC 5892</xref> for this code et="RFC5892" format="default">RFC 5892</xref>) for this code
point. This also ensures all sandhi marks being treated in an point. This also ensures all sandhi marks are treated equally.
equal way.
</t> </t>
</section> </section>
<section title="Conclusion"> <section numbered="true" toc="default">
<name>Conclusion</name>
<t> <t>
As described in <xref target="changes"/> and <xref As described in Sections <xref target="changes" format="counter"/> and <x
target="SHARADA"/>, changes have been made to Unicode between ref target="SHARADA" format="counter"/>, changes have been made to Unicode betwe
en
version 6.0.0 and 12.0.0. Some changes to specific characters version 6.0.0 and 12.0.0. Some changes to specific characters
changed their derived property value, whereas other changes changed their derived property value, whereas other changes
did not. Given the deployment considerations described in did not. Given the deployment considerations described in
<xref target="deployment"/> and changes in the Unicode <xref target="deployment" format="default"/> and changes in the Unicode
Standard described in <xref target="changes"/> and <xref Standard described in Sections <xref target="changes" format="counter"/>
target="SHARADA"/>, including implications to normalization, and <xref target="SHARADA" format="counter"/>, including implications to normali
the conclusion is to not add any exception rules to IDNA2008. zation,
the conclusion is not to add any exception rules to IDNA2008.
</t> </t>
<t> <t>
This document addresses only changes to Unicode between This document addresses only changes to Unicode between
version 6.0.0 and version 12.0.0. Changes in future Unicode version 6.0.0 and version 12.0.0. Changes in future Unicode
versions might result in the conclusion that exception rules versions might result in the conclusion that exception rules
need to be added to IDNA2008 after the review process need to be added to IDNA2008 after the review process
explained in <xref target="RFC8753">RFC 8753</xref>. explained in <xref target="RFC8753" format="default">RFC 8753</xref>.
Separately from any changes in Unicode, the IETF might Separately from any changes in Unicode, the IETF might
conclude that updates to <xref target="RFC5892">RFC conclude that updates to <xref target="RFC5892" format="default">RFC
5892</xref> or other IDNA2008 documents might become 5892</xref> or other IDNA2008 documents might become
necessary; such updates might include changes to the algorithm necessary; such updates might include changes to the algorithm
specified in IDNA2008 as well as additional rules, categories, specified in IDNA2008 as well as additional rules, categories,
or other forms of tuning, like the clarifications in <xref or other forms of tuning, like the clarifications in <xref target="RFC875
target="RFC8753">RFC 8753</xref>. 3" format="default">RFC 8753</xref>.
</t> </t>
</section> </section>
<section title="IANA Considerations"> <section numbered="true" toc="default">
<name>IANA Considerations</name>
<t> <t>
IANA is requested to update the <xref target="IANA-IDNA">IDNA IANA updated the <xref target="IANA-IDNA" format="default">"IDNA
Parameters registry</xref> of derived property values, after Rules and Derived Property Values"</xref> registry after
the expert reviewer validates that the derived property values the expert reviewer validated that the derived property values
are calculated correctly. were calculated correctly.
</t> </t>
</section> </section>
<section title="Security Considerations"> <section numbered="true" anchor="security" toc="default">
<name>Security Considerations</name>
<t> <t>
This document makes recommendations regarding the use of the This document makes recommendations regarding the use of the
IDNA2008 algorithm for calculation of derived property values, IDNA2008 algorithm for calculation of derived property values,
based on Unicode version 12.0.0. This recommendation does not based on Unicode version 12.0.0. This recommendation does not
say anything about what recommendations to make for future say anything about what recommendations to make for future
versions of the Unicode Standard. versions of the Unicode Standard.
</t> </t>
<t> <t>
Not following these recommendations can lead to various Not following these recommendations can lead to various
security issues. Specifically, allowing confusable characters security issues. Specifically, allowing confusable characters
may lead to various phishing attacks, as described in the may lead to various phishing attacks, as described in the
Security Consideration Sections in the documents listed in Security Consideration Sections in the documents listed in
<xref target="idna2008"/>. <xref target="idna2008" format="default"/>.
</t>
</section>
<section title="Acknowledgements">
<t>
Thanks to Harald Alvestrand, Marc Blanchet, Martin Duerst,
Asmus Freytag, Ted Hardie, John Klensin, Erik Nordmark, Pete
Resnick, Peter Saint-Andre, Michel Suignard, Andrew Sullivan
and Suzanne Woolf for input to this document.
</t> </t>
</section> </section>
</middle> </middle>
<back> <back>
<references title='Normative References'> <displayreference target="I-D.klensin-idna-5892upd-unicode70" to="IDNA7"/>
&RFC3491; <references>
&RFC5890; <name>References</name>
&RFC5891; <references>
&RFC5892; <name>Normative References</name>
&RFC5893; <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
&RFC6452; FC.3491.xml"/>
</references> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
<references title='Non-normative references'> FC.5890.xml"/>
<reference anchor="IAB2005-1"> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
<front> FC.5891.xml"/>
<title>IAB Statement on Identifiers and Unicode 7.0.0</title> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
<author> FC.5892.xml"/>
<organization>Internet Architecture Board</organization> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
</author> FC.5893.xml"/>
<date month="January" year="2015" day="27"/> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
</front> FC.6452.xml"/>
<seriesInfo name="IAB Statement on Identifiers and Unicode 7.0.0" </references>
value="&lt;https://www.iab.org/documents/correspondence-repor <references>
ts-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0-0/archive/&gt;"
/> <name>Informative References</name>
</reference> <reference anchor="IAB2005-1" target="https://www.iab.org/documents/corr
<reference anchor="IAB2005-2"> espondence-reports-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0
<front> -0/archive/">
<title>IAB Statement on Identifiers and Unicode 7.0.0</title> <front>
<author> <title>IAB Statement on Identifiers and Unicode 7.0.0</title>
<organization>Internet Architecture Board</organization> <author>
</author> <organization>Internet Architecture Board</organization>
<date month="February" year="2015" day="11"/> </author>
</front> <date month="January" year="2015" day="27"/>
<seriesInfo name="IAB Statement on Identifiers and Unicode 7.0.0" </front>
value="&lt;https://www.iab.org/documents/correspondence-repor </reference>
ts-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0-0/&gt;" />
</reference> <reference anchor="IAB2005-2" target="https://www.iab.org/documents/corr
<reference anchor="IANA-IDNA"> espondence-reports-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0
<front> -0/">
<title>IDNA Rules and Derived Property Values</title> <front>
<author> <title>IAB Statement on Identifiers and Unicode 7.0.0</title>
<organization>IANA</organization> <author>
</author> <organization>Internet Architecture Board</organization>
<date month="April" year="2020" /> </author>
</front> <date month="February" year="2015" day="11"/>
<seriesInfo name="IDNA Rules and Derived Property Values" </front>
value="&lt;https://www.iana.org/assignments/idna-tables-6.0.0 </reference>
/idna-tables-6.0.0.xhtml&gt;" /> <reference anchor="IANA-IDNA" target="https://www.iana.org/assignments/i
</reference> dna-tables-12.0.0/">
<reference anchor="IDNA7"> <front>
<front> <title>IDNA Rules and Derived Property Values</title>
<title>IDNA Update for Unicode 7.0 and Later Versions</title> <author>
<author fullname="John Klensin" initials="J." surname="Klensin"> <organization>IANA</organization>
</author> </author>
<author fullname="Patrik Faltstrom" initials="P." surname="Faltstrom"> <date month="February" year="2022"/>
</author> </front>
<date month="October" year="2017" /> </reference>
</front> <xi:include href="https://datatracker.ietf.org/doc/bibxml3/reference.I-D
<seriesInfo name="draft-klensin-idna-5892upd-unicode70" .klensin-idna-5892upd-unicode70.xml"/>
value="&lt;https://datatracker.ietf.org/doc/draft-klensin-idn <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
a-5892upd-unicode70/&gt;" /> FC.3454.xml"/>
</reference> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
&RFC3454; FC.3490.xml"/>
&RFC3490; <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
&RFC5894; FC.5894.xml"/>
&RFC5895; <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
&RFC6912; FC.5895.xml"/>
&RFC8753; <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
<reference anchor="SAC-084"> FC.6912.xml"/>
<front> <xi:include href="https://xml2rfc.ietf.org/public/rfc/bibxml/reference.R
<title>SAC084</title> FC.8753.xml"/>
<author>
<organization>The Security and Stability Advisory Committee</organiz <reference anchor="SAC-084" target="https://www.icann.org/en/system/file
ation> s/files/sac-084-en.pdf">
</author> <front>
<date month="August" year="2016" /> <title>SAC084</title>
</front> <author>
<seriesInfo name="SSAC Comments on Guidelines for the Extended Process Si <organization>The Security and Stability Advisory Committee</organ
milarity Review Panel for the IDN ccTLD Fast Track Process" ization>
value="&lt;https://www.icann.org/en/system/files/files/sac-08 </author>
4-en.pdf&gt;" /> <date month="August" year="2016"/>
</reference> </front>
<reference anchor="Unicode-3.2.0"> <refcontent>SSAC Comments on Guidelines for the Extended Process Simil
<front> arity Review Panel for the IDN ccTLD Fast Track Process</refcontent>
<title>The Unicode Standard, Version 3.2.0</title> </reference>
<author>
<organization>The Unicode Consortium</organization> <reference anchor="Unicode-3.2.0" target="https://www.unicode.org/versio
</author> ns/Unicode3.2.0/">
<date month="March" year="2002" /> <front>
</front> <title>The Unicode Standard, Version 3.2.0</title>
<seriesInfo name="The Unicode Standard, Version 3.2.0" <author>
value="ISBN 0-201-61633-5" /> <organization>The Unicode Consortium</organization>
</reference> </author>
<reference anchor="Unicode-5.2.0"> <date month="March" year="2002"/>
<front> </front>
<title>The Unicode Standard, Version 5.2.0</title> <seriesInfo name="ISBN" value="0-201-61633-5"/>
<author> <refcontent>Mountain View: The Unicode Consortium</refcontent>
<organization>The Unicode Consortium</organization> </reference>
</author> <reference anchor="Unicode-5.2.0" target="https://www.unicode.org/versio
<date month="October" year="2009" /> ns/Unicode5.2.0/">
</front> <front>
<seriesInfo name="The Unicode Standard, Version 5.2.0" <title>The Unicode Standard, Version 5.2.0</title>
value="ISBN 978-1-936213-00-9" /> <author>
</reference> <organization>The Unicode Consortium</organization>
<reference anchor="Unicode-6.0.0"> </author>
<front> <date month="October" year="2009"/>
<title>The Unicode Standard, Version 6.0.0</title> </front>
<author> <seriesInfo name="ISBN" value="978-1-936213-00-9"/>
<organization>The Unicode Consortium</organization> <refcontent>Mountain View: The Unicode Consortium</refcontent>
</author> </reference>
<date month="October" year="2011" /> <reference anchor="Unicode-6.0.0" target="https://www.unicode.org/versio
</front> ns/Unicode6.0.0/">
<seriesInfo name="The Unicode Standard, Version 6.0.0" <front>
value="ISBN 978-1-936213-01-6" /> <title>The Unicode Standard, Version 6.0.0</title>
</reference> <author>
<reference anchor="Unicode-7.0.0"> <organization>The Unicode Consortium</organization>
<front> </author>
<title>The Unicode Standard, Version 7.0.0</title> <date month="October" year="2011"/>
<author> </front>
<organization>The Unicode Consortium</organization> <seriesInfo name="ISBN" value="978-1-936213-01-6"/>
</author> <refcontent>Mountain View: The Unicode Consortium</refcontent>
<date month="June" year="2014" /> </reference>
</front> <reference anchor="Unicode-7.0.0" target="https://www.unicode.org/versio
<seriesInfo name="The Unicode Standard, Version 7.0.0" ns/Unicode7.0.0/">
value="ISBN 978-1-936213-09-2" /> <front>
</reference> <title>The Unicode Standard, Version 7.0.0</title>
<reference anchor="Unicode-8.0.0"> <author>
<front> <organization>The Unicode Consortium</organization>
<title>The Unicode Standard, Version 8.0.0</title> </author>
<author> <date month="June" year="2014"/>
<organization>The Unicode Consortium</organization> </front>
</author> <seriesInfo name="ISBN" value="978-1-936213-09-2"/>
<date month="June" year="2015" /> <refcontent>Mountain View: The Unicode Consortium</refcontent>
</front> </reference>
<seriesInfo name="The Unicode Standard, Version 8.0.0" <reference anchor="Unicode-8.0.0" target="https://www.unicode.org/versio
value="ISBN 978-1-936213-10-8" /> ns/Unicode8.0.0/">
</reference> <front>
<reference anchor="Unicode-10.0.0"> <title>The Unicode Standard, Version 8.0.0</title>
<front> <author>
<title>The Unicode Standard, Version 10.0.0</title> <organization>The Unicode Consortium</organization>
<author> </author>
<organization>The Unicode Consortium</organization> <date month="June" year="2015"/>
</author> </front>
<date month="June" year="2017" /> <seriesInfo name="ISBN" value="978-1-936213-10-8"/>
</front> <refcontent>Mountain View: The Unicode Consortium</refcontent>
<seriesInfo name="The Unicode Standard, Version 10.0.0" </reference>
value="ISBN 978-1-936213-16-0" /> <reference anchor="Unicode-10.0.0" target="https://www.unicode.org/versi
</reference> ons/Unicode10.0.0/">
<reference anchor="Unicode-11.0.0"> <front>
<front> <title>The Unicode Standard, Version 10.0.0</title>
<title>The Unicode Standard, Version 11.0.0</title> <author>
<author> <organization>The Unicode Consortium</organization>
<organization>The Unicode Consortium</organization> </author>
</author> <date month="June" year="2017"/>
<date month="June" year="2018" /> </front>
</front> <seriesInfo name="ISBN" value="978-1-936213-16-0"/>
<seriesInfo name="The Unicode Standard, Version 11.0.0" <refcontent>Mountain View: The Unicode Consortium</refcontent>
value="ISBN 978-1-936213-19-1" /> </reference>
</reference> <reference anchor="Unicode-11.0.0" target="https://www.unicode.org/versi
<reference anchor="Unicode-12.0.0"> ons/Unicode11.0.0/">
<front> <front>
<title>The Unicode Standard, Version 12.0.0</title> <title>The Unicode Standard, Version 11.0.0</title>
<author> <author>
<organization>The Unicode Consortium</organization> <organization>The Unicode Consortium</organization>
</author> </author>
<date month="March" year="2019" /> <date month="June" year="2018"/>
</front> </front>
<seriesInfo name="The Unicode Standard, Version 12.0.0" <seriesInfo name="ISBN" value="978-1-936213-19-1"/>
value="ISBN 978-1-936213-22-1" /> <refcontent>Mountain View: The Unicode Consortium</refcontent>
</reference> </reference>
<reference anchor="UTS-46"> <reference anchor="Unicode-12.0.0" target="https://www.unicode.org/versi
<front> ons/Unicode12.0.0/">
<title>Unicode Technical Standard #46, Version 12.0.0</title> <front>
<author> <title>The Unicode Standard, Version 12.0.0</title>
<organization>The Unicode Consortium</organization> <author>
</author> <organization>The Unicode Consortium</organization>
<date month="March" year="2019" /> </author>
</front> <date month="March" year="2019"/>
<seriesInfo name="UNICODE IDNA COMPATIBILITY PROCESSING" </front>
value="&lt;https://www.unicode.org/reports/tr46/&gt;" /> <seriesInfo name="ISBN" value="978-1-936213-22-1"/>
</reference> <refcontent>Mountain View: The Unicode Consortium</refcontent>
</reference>
<reference anchor="UTS-46" target="https://www.unicode.org/reports/tr46/
tr46-23.html">
<front>
<title>Unicode Technical Standard #46, Version 12.0.0</title>
<author>
<organization>The Unicode Consortium</organization>
</author>
<date month="March" year="2019"/>
</front>
<refcontent>UNICODE IDNA COMPATIBILITY PROCESSING</refcontent>
</reference>
</references>
</references> </references>
<section anchor="Appendix-6.0.0" title="Changes from Unicode 6.0.0 to Unicod <section anchor="Appendix-6.0.0" numbered="true" toc="default">
e 7.0.0"> <name>Changes from Unicode 6.0.0 to Unicode 7.0.0</name>
<t> <t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
037F ; DISALLOWED # GREEK CAPITAL LETTER YOT 037F ; DISALLOWED # GREEK CAPITAL LETTER YOT
0528 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK 0528 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK
0529 ; PVALID # CYRILLIC SMALL LETTER EN WITH LEFT HOOK 0529 ; PVALID # CYRILLIC SMALL LETTER EN WITH LEFT HOOK
052A ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZHE 052A ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZHE
052B ; PVALID # CYRILLIC SMALL LETTER DZZHE 052B ; PVALID # CYRILLIC SMALL LETTER DZZHE
052C ; DISALLOWED # CYRILLIC CAPITAL LETTER DCHE 052C ; DISALLOWED # CYRILLIC CAPITAL LETTER DCHE
052D ; PVALID # CYRILLIC SMALL LETTER DCHE 052D ; PVALID # CYRILLIC SMALL LETTER DCHE
052E ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH DESCENDER 052E ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH DESCENDER
052F ; PVALID # CYRILLIC SMALL LETTER EL WITH DESCENDER 052F ; PVALID # CYRILLIC SMALL LETTER EL WITH DESCENDER
058D..058F ; DISALLOWED # RIGHT-FACING ARMENIAN ETERNITY SIGN..ARMENIAN 058D..058F ; DISALLOWED # RIGHT-FACING ARMENIAN ETERNITY SIGN..ARMENIAN
skipping to change at line 953 skipping to change at line 912
2B76..2B95 ; DISALLOWED # NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGH 2B76..2B95 ; DISALLOWED # NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGH
2B98..2BB9 ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARR 2B98..2BB9 ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARR
2BBD..2BC8 ; DISALLOWED # BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-P 2BBD..2BC8 ; DISALLOWED # BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-P
2BCA..2BD1 ; DISALLOWED # TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN 2BCA..2BD1 ; DISALLOWED # TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN
2CF2 ; DISALLOWED # COPTIC CAPITAL LETTER BOHAIRIC KHEI 2CF2 ; DISALLOWED # COPTIC CAPITAL LETTER BOHAIRIC KHEI
2CF3 ; PVALID # COPTIC SMALL LETTER BOHAIRIC KHEI 2CF3 ; PVALID # COPTIC SMALL LETTER BOHAIRIC KHEI
2D27 ; PVALID # GEORGIAN SMALL LETTER YN 2D27 ; PVALID # GEORGIAN SMALL LETTER YN
2D2D ; PVALID # GEORGIAN SMALL LETTER AEN 2D2D ; PVALID # GEORGIAN SMALL LETTER AEN
2D66..2D67 ; PVALID # TIFINAGH LETTER YE..TIFINAGH LETTER YO 2D66..2D67 ; PVALID # TIFINAGH LETTER YE..TIFINAGH LETTER YO
2E32..2E42 ; DISALLOWED # TURNED COMMA..DOUBLE LOW-REVERSED-9 QUOTATION 2E32..2E42 ; DISALLOWED # TURNED COMMA..DOUBLE LOW-REVERSED-9 QUOTATION
9FCC ; PVALID # <CJK Ideograph&gt; 9FCC ; PVALID # <CJK Ideograph&gt;
A674..A67B ; PVALID # COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBI A674..A67B ; PVALID # COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBI
A698 ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE O A698 ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE O
A699 ; PVALID # CYRILLIC SMALL LETTER DOUBLE O A699 ; PVALID # CYRILLIC SMALL LETTER DOUBLE O
A69A ; DISALLOWED # CYRILLIC CAPITAL LETTER CROSSED O A69A ; DISALLOWED # CYRILLIC CAPITAL LETTER CROSSED O
A69B ; PVALID # CYRILLIC SMALL LETTER CROSSED O A69B ; PVALID # CYRILLIC SMALL LETTER CROSSED O
A69C..A69D ; DISALLOWED # MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER A69C..A69D ; DISALLOWED # MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER
A69F ; PVALID # COMBINING CYRILLIC LETTER IOTIFIED E A69F ; PVALID # COMBINING CYRILLIC LETTER IOTIFIED E
A792 ; DISALLOWED # LATIN CAPITAL LETTER C WITH BAR A792 ; DISALLOWED # LATIN CAPITAL LETTER C WITH BAR
A793..A795 ; PVALID # LATIN SMALL LETTER C WITH BAR..LATIN SMALL LE A793..A795 ; PVALID # LATIN SMALL LETTER C WITH BAR..LATIN SMALL LE
A796 ; DISALLOWED # LATIN CAPITAL LETTER B WITH FLOURISH A796 ; DISALLOWED # LATIN CAPITAL LETTER B WITH FLOURISH
skipping to change at line 1172 skipping to change at line 1131
1F6C6..1F6CF; DISALLOWED # TRIANGLE WITH ROUNDED CORNERS..BED 1F6C6..1F6CF; DISALLOWED # TRIANGLE WITH ROUNDED CORNERS..BED
1F6E0..1F6EC; DISALLOWED # HAMMER AND WRENCH..AIRPLANE ARRIVING 1F6E0..1F6EC; DISALLOWED # HAMMER AND WRENCH..AIRPLANE ARRIVING
1F6F0..1F6F3; DISALLOWED # SATELLITE..PASSENGER SHIP 1F6F0..1F6F3; DISALLOWED # SATELLITE..PASSENGER SHIP
1F780..1F7D4; DISALLOWED # BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE. 1F780..1F7D4; DISALLOWED # BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE.
1F800..1F80B; DISALLOWED # LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD 1F800..1F80B; DISALLOWED # LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD
1F810..1F847; DISALLOWED # LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWH 1F810..1F847; DISALLOWED # LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWH
1F850..1F859; DISALLOWED # LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERI 1F850..1F859; DISALLOWED # LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERI
1F860..1F887; DISALLOWED # WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE- 1F860..1F887; DISALLOWED # WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE-
1F890..1F8AD; DISALLOWED # LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHA 1F890..1F8AD; DISALLOWED # LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHA
</artwork></figure></section> ]]></artwork>
<section anchor="Appendix-7.0.0" title="Changes from Unicode 7.0.0 to Unicod </section>
e 8.0.0"> <section anchor="Appendix-7.0.0" numbered="true" toc="default">
<name>Changes from Unicode 7.0.0 to Unicode 8.0.0</name>
<t> <t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
08B3..08B4 ; PVALID # ARABIC LETTER AIN WITH THREE DOTS BELOW..ARAB 08B3..08B4 ; PVALID # ARABIC LETTER AIN WITH THREE DOTS BELOW..ARAB
08E3 ; PVALID # ARABIC TURNED DAMMA BELOW 08E3 ; PVALID # ARABIC TURNED DAMMA BELOW
0AF9 ; PVALID # GUJARATI LETTER ZHA 0AF9 ; PVALID # GUJARATI LETTER ZHA
0C5A ; PVALID # TELUGU LETTER RRRA 0C5A ; PVALID # TELUGU LETTER RRRA
0D5F ; PVALID # MALAYALAM LETTER ARCHAIC II 0D5F ; PVALID # MALAYALAM LETTER ARCHAIC II
13F5 ; PVALID # CHEROKEE LETTER MV 13F5 ; PVALID # CHEROKEE LETTER MV
13F8..13FD ; DISALLOWED # CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETT 13F8..13FD ; DISALLOWED # CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETT
20BE ; DISALLOWED # LARI SIGN 20BE ; DISALLOWED # LARI SIGN
218A..218B ; DISALLOWED # TURNED DIGIT TWO..TURNED DIGIT THREE 218A..218B ; DISALLOWED # TURNED DIGIT TWO..TURNED DIGIT THREE
2BEC..2BEF ; DISALLOWED # LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARRO 2BEC..2BEF ; DISALLOWED # LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARRO
9FCD..9FD5 ; PVALID # <CJK Ideograph&gt;..&lt;CJK Ideograph&gt; 9FCD..9FD5 ; PVALID # <CJK Ideograph>..<CJK Ideograph&gt;
A69E ; PVALID # COMBINING CYRILLIC LETTER EF A69E ; PVALID # COMBINING CYRILLIC LETTER EF
A78F ; PVALID # LATIN LETTER SINOLOGICAL DOT A78F ; PVALID # LATIN LETTER SINOLOGICAL DOT
A7B2..A7B4 ; DISALLOWED # LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LAT A7B2..A7B4 ; DISALLOWED # LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LAT
A7B5 ; PVALID # LATIN SMALL LETTER BETA A7B5 ; PVALID # LATIN SMALL LETTER BETA
A7B6 ; DISALLOWED # LATIN CAPITAL LETTER OMEGA A7B6 ; DISALLOWED # LATIN CAPITAL LETTER OMEGA
A7B7 ; PVALID # LATIN SMALL LETTER OMEGA A7B7 ; PVALID # LATIN SMALL LETTER OMEGA
A8FC ; DISALLOWED # DEVANAGARI SIGN SIDDHAM A8FC ; DISALLOWED # DEVANAGARI SIGN SIDDHAM
A8FD ; PVALID # DEVANAGARI JAIN OM A8FD ; PVALID # DEVANAGARI JAIN OM
AB60..AB63 ; PVALID # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LET AB60..AB63 ; PVALID # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LET
AB70..ABBF ; DISALLOWED # CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTE AB70..ABBF ; DISALLOWED # CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTE
skipping to change at line 1255 skipping to change at line 1216
1F37E..1F37F; DISALLOWED # BOTTLE WITH POPPING CORK..POPCORN 1F37E..1F37F; DISALLOWED # BOTTLE WITH POPPING CORK..POPCORN
1F3CF..1F3D3; DISALLOWED # CRICKET BAT AND BALL..TABLE TENNIS PADDLE AND 1F3CF..1F3D3; DISALLOWED # CRICKET BAT AND BALL..TABLE TENNIS PADDLE AND
1F3F8..1F3FF; DISALLOWED # BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MODI 1F3F8..1F3FF; DISALLOWED # BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MODI
1F4FF ; DISALLOWED # PRAYER BEADS 1F4FF ; DISALLOWED # PRAYER BEADS
1F54B..1F54F; DISALLOWED # KAABA..BOWL OF HYGIEIA 1F54B..1F54F; DISALLOWED # KAABA..BOWL OF HYGIEIA
1F643..1F644; DISALLOWED # UPSIDE-DOWN FACE..FACE WITH ROLLING EYES 1F643..1F644; DISALLOWED # UPSIDE-DOWN FACE..FACE WITH ROLLING EYES
1F6D0 ; DISALLOWED # PLACE OF WORSHIP 1F6D0 ; DISALLOWED # PLACE OF WORSHIP
1F910..1F918; DISALLOWED # ZIPPER-MOUTH FACE..SIGN OF THE HORNS 1F910..1F918; DISALLOWED # ZIPPER-MOUTH FACE..SIGN OF THE HORNS
1F980..1F984; DISALLOWED # CRAB..UNICORN FACE 1F980..1F984; DISALLOWED # CRAB..UNICORN FACE
1F9C0 ; DISALLOWED # CHEESE WEDGE 1F9C0 ; DISALLOWED # CHEESE WEDGE
2B820..2CEA1; PVALID # &lt;CJK Ideograph Extension E&gt;..&lt;CJK Ideograph 2B820..2CEA1; PVALID # <CJK Ideograph Extension E>..<CJK Ideograph E
E ]]></artwork>
</artwork></figure></section> </section>
<section anchor="Appendix-8.0.0" title="Changes from Unicode 8.0.0 to Unicod <section anchor="Appendix-8.0.0" numbered="true" toc="default">
e 9.0.0"> <name>Changes from Unicode 8.0.0 to Unicode 9.0.0</name>
<t> <t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
08B6..08BD ; PVALID # ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARAB 08B6..08BD ; PVALID # ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARAB
08D4..08E1 ; PVALID # ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL H 08D4..08E1 ; PVALID # ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL H
08E2 ; DISALLOWED # ARABIC DISPUTED END OF AYAH 08E2 ; DISALLOWED # ARABIC DISPUTED END OF AYAH
0C80 ; PVALID # KANNADA SIGN SPACING CANDRABINDU 0C80 ; PVALID # KANNADA SIGN SPACING CANDRABINDU
0D4F ; DISALLOWED # MALAYALAM SIGN PARA 0D4F ; DISALLOWED # MALAYALAM SIGN PARA
0D54..0D56 ; PVALID # MALAYALAM LETTER CHILLU M..MALAYALAM LETTER C 0D54..0D56 ; PVALID # MALAYALAM LETTER CHILLU M..MALAYALAM LETTER C
0D58..0D5E ; DISALLOWED # MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIE 0D58..0D5E ; DISALLOWED # MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIE
0D76..0D78 ; DISALLOWED # MALAYALAM FRACTION ONE SIXTEENTH..MALAYALAM F 0D76..0D78 ; DISALLOWED # MALAYALAM FRACTION ONE SIXTEENTH..MALAYALAM F
1C80..1C88 ; DISALLOWED # CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SM 1C80..1C88 ; DISALLOWED # CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SM
1DFB ; PVALID # COMBINING DELETION MARK 1DFB ; PVALID # COMBINING DELETION MARK
skipping to change at line 1298 skipping to change at line 1261
11C0A..11C36; PVALID # BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOCA 11C0A..11C36; PVALID # BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOCA
11C38..11C40; PVALID # BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN AVAGRA 11C38..11C40; PVALID # BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN AVAGRA
11C41..11C45; DISALLOWED # BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-2 11C41..11C45; DISALLOWED # BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-2
11C50..11C59; PVALID # BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE 11C50..11C59; PVALID # BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE
11C5A..11C6C; DISALLOWED # BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT 11C5A..11C6C; DISALLOWED # BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT
11C70..11C71; DISALLOWED # MARCHEN HEAD MARK..MARCHEN MARK SHAD 11C70..11C71; DISALLOWED # MARCHEN HEAD MARK..MARCHEN MARK SHAD
11C72..11C8F; PVALID # MARCHEN LETTER KA..MARCHEN LETTER A 11C72..11C8F; PVALID # MARCHEN LETTER KA..MARCHEN LETTER A
11C92..11CA7; PVALID # MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINE 11C92..11CA7; PVALID # MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINE
11CA9..11CB6; PVALID # MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CAN 11CA9..11CB6; PVALID # MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CAN
16FE0 ; PVALID # TANGUT ITERATION MARK 16FE0 ; PVALID # TANGUT ITERATION MARK
17000..187EC; PVALID # <Tangut Ideograph&gt;..&lt;Tangut Ideograph&gt; 17000..187EC; PVALID # <Tangut Ideograph>..<Tangut Ideograph&gt;
18800..18AF2; PVALID # TANGUT COMPONENT-001..TANGUT COMPONENT-755 18800..18AF2; PVALID # TANGUT COMPONENT-001..TANGUT COMPONENT-755
1E000..1E006; PVALID # COMBINING GLAGOLITIC LETTER AZU..COMBINING GL 1E000..1E006; PVALID # COMBINING GLAGOLITIC LETTER AZU..COMBINING GL
1E008..1E018; PVALID # COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING 1E008..1E018; PVALID # COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING
1E01B..1E021; PVALID # COMBINING GLAGOLITIC LETTER SHTA..COMBINING G 1E01B..1E021; PVALID # COMBINING GLAGOLITIC LETTER SHTA..COMBINING G
1E023..1E024; PVALID # COMBINING GLAGOLITIC LETTER YU..COMBINING GLA 1E023..1E024; PVALID # COMBINING GLAGOLITIC LETTER YU..COMBINING GLA
1E026..1E02A; PVALID # COMBINING GLAGOLITIC LETTER YO..COMBINING GLA 1E026..1E02A; PVALID # COMBINING GLAGOLITIC LETTER YO..COMBINING GLA
1E900..1E921; DISALLOWED # ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETT 1E900..1E921; DISALLOWED # ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETT
1E922..1E94A; PVALID # ADLAM SMALL LETTER ALIF..ADLAM NUKTA 1E922..1E94A; PVALID # ADLAM SMALL LETTER ALIF..ADLAM NUKTA
1E950..1E959; PVALID # ADLAM DIGIT ZERO..ADLAM DIGIT NINE 1E950..1E959; PVALID # ADLAM DIGIT ZERO..ADLAM DIGIT NINE
1E95E..1E95F; DISALLOWED # ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL 1E95E..1E95F; DISALLOWED # ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL
skipping to change at line 1322 skipping to change at line 1285
1F5A4 ; DISALLOWED # BLACK HEART 1F5A4 ; DISALLOWED # BLACK HEART
1F6D1..1F6D2; DISALLOWED # OCTAGONAL SIGN..SHOPPING TROLLEY 1F6D1..1F6D2; DISALLOWED # OCTAGONAL SIGN..SHOPPING TROLLEY
1F6F4..1F6F6; DISALLOWED # SCOOTER..CANOE 1F6F4..1F6F6; DISALLOWED # SCOOTER..CANOE
1F919..1F91E; DISALLOWED # CALL ME HAND..HAND WITH INDEX AND MIDDLE FING 1F919..1F91E; DISALLOWED # CALL ME HAND..HAND WITH INDEX AND MIDDLE FING
1F920..1F927; DISALLOWED # FACE WITH COWBOY HAT..SNEEZING FACE 1F920..1F927; DISALLOWED # FACE WITH COWBOY HAT..SNEEZING FACE
1F930 ; DISALLOWED # PREGNANT WOMAN 1F930 ; DISALLOWED # PREGNANT WOMAN
1F933..1F93E; DISALLOWED # SELFIE..HANDBALL 1F933..1F93E; DISALLOWED # SELFIE..HANDBALL
1F940..1F94B; DISALLOWED # WILTED FLOWER..MARTIAL ARTS UNIFORM 1F940..1F94B; DISALLOWED # WILTED FLOWER..MARTIAL ARTS UNIFORM
1F950..1F95E; DISALLOWED # CROISSANT..PANCAKES 1F950..1F95E; DISALLOWED # CROISSANT..PANCAKES
1F985..1F991; DISALLOWED # EAGLE..SQUID 1F985..1F991; DISALLOWED # EAGLE..SQUID
</artwork></figure></section> ]]></artwork>
<section anchor="Appendix-9.0.0" title="Changes from Unicode 9.0.0 to Unicod </section>
e 10.0.0"> <section anchor="Appendix-9.0.0" numbered="true" toc="default">
<name>Changes from Unicode 9.0.0 to Unicode 10.0.0</name>
<t> <t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
0860..086A ; PVALID # SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MA 0860..086A ; PVALID # SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MA
09FC ; PVALID # BENGALI LETTER VEDIC ANUSVARA 09FC ; PVALID # BENGALI LETTER VEDIC ANUSVARA
09FD ; DISALLOWED # BENGALI ABBREVIATION SIGN 09FD ; DISALLOWED # BENGALI ABBREVIATION SIGN
0AFA..0AFF ; PVALID # GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCLE 0AFA..0AFF ; PVALID # GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCLE
0D00 ; PVALID # MALAYALAM SIGN COMBINING ANUSVARA ABOVE 0D00 ; PVALID # MALAYALAM SIGN COMBINING ANUSVARA ABOVE
0D3B..0D3C ; PVALID # MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM 0D3B..0D3C ; PVALID # MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM
1CF7 ; PVALID # VEDIC SIGN ATIKRAMA 1CF7 ; PVALID # VEDIC SIGN ATIKRAMA
1DF6..1DF9 ; PVALID # COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE 1DF6..1DF9 ; PVALID # COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE
20BF ; DISALLOWED # BITCOIN SIGN 20BF ; DISALLOWED # BITCOIN SIGN
23FF ; DISALLOWED # OBSERVER EYE SYMBOL 23FF ; DISALLOWED # OBSERVER EYE SYMBOL
2BD2 ; DISALLOWED # GROUP MARK 2BD2 ; DISALLOWED # GROUP MARK
2E45..2E49 ; DISALLOWED # INVERTED LOW KAVYKA..DOUBLE STACKED COMMA 2E45..2E49 ; DISALLOWED # INVERTED LOW KAVYKA..DOUBLE STACKED COMMA
312E ; PVALID # BOPOMOFO LETTER O WITH DOT ABOVE 312E ; PVALID # BOPOMOFO LETTER O WITH DOT ABOVE
9FD6..9FEA ; PVALID # <CJK Ideograph&gt;..&lt;CJK Ideograph&gt; 9FD6..9FEA ; PVALID # <CJK Ideograph>..<CJK Ideograph&gt;
1032D..1032F; PVALID # OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUTH 1032D..1032F; PVALID # OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUTH
11A00..11A3E; PVALID # ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE C 11A00..11A3E; PVALID # ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE C
11A3F..11A46; DISALLOWED # ZANABAZAR SQUARE INITIAL HEAD MARK..ZANABAZAR 11A3F..11A46; DISALLOWED # ZANABAZAR SQUARE INITIAL HEAD MARK..ZANABAZAR
11A47 ; PVALID # ZANABAZAR SQUARE SUBJOINER 11A47 ; PVALID # ZANABAZAR SQUARE SUBJOINER
11A50..11A83; PVALID # SOYOMBO LETTER A..SOYOMBO LETTER KSSA 11A50..11A83; PVALID # SOYOMBO LETTER A..SOYOMBO LETTER KSSA
11A86..11A99; PVALID # SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO SU 11A86..11A99; PVALID # SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO SU
11A9A..11A9C; DISALLOWED # SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD 11A9A..11A9C; DISALLOWED # SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD
11A9E..11AA2; DISALLOWED # SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPL 11A9E..11AA2; DISALLOWED # SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPL
11D00..11D06; PVALID # MASARAM GONDI LETTER A..MASARAM GONDI LETTER 11D00..11D06; PVALID # MASARAM GONDI LETTER A..MASARAM GONDI LETTER
11D08..11D09; PVALID # MASARAM GONDI LETTER AI..MASARAM GONDI LETTER 11D08..11D09; PVALID # MASARAM GONDI LETTER AI..MASARAM GONDI LETTER
skipping to change at line 1372 skipping to change at line 1337
1F6D3..1F6D4; DISALLOWED # STUPA..PAGODA 1F6D3..1F6D4; DISALLOWED # STUPA..PAGODA
1F6F7..1F6F8; DISALLOWED # SLED..FLYING SAUCER 1F6F7..1F6F8; DISALLOWED # SLED..FLYING SAUCER
1F900..1F90B; DISALLOWED # CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD 1F900..1F90B; DISALLOWED # CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD
1F91F ; DISALLOWED # I LOVE YOU HAND SIGN 1F91F ; DISALLOWED # I LOVE YOU HAND SIGN
1F928..1F92F; DISALLOWED # FACE WITH ONE EYEBROW RAISED..SHOCKED FACE WI 1F928..1F92F; DISALLOWED # FACE WITH ONE EYEBROW RAISED..SHOCKED FACE WI
1F931..1F932; DISALLOWED # BREAST-FEEDING..PALMS UP TOGETHER 1F931..1F932; DISALLOWED # BREAST-FEEDING..PALMS UP TOGETHER
1F94C ; DISALLOWED # CURLING STONE 1F94C ; DISALLOWED # CURLING STONE
1F95F..1F96B; DISALLOWED # DUMPLING..CANNED FOOD 1F95F..1F96B; DISALLOWED # DUMPLING..CANNED FOOD
1F992..1F997; DISALLOWED # GIRAFFE FACE..CRICKET 1F992..1F997; DISALLOWED # GIRAFFE FACE..CRICKET
1F9D0..1F9E6; DISALLOWED # FACE WITH MONOCLE..SOCKS 1F9D0..1F9E6; DISALLOWED # FACE WITH MONOCLE..SOCKS
2CEB0..2EBE0; PVALID # &lt;CJK Ideograph Extension F&gt;..&lt;CJK Ideograph 2CEB0..2EBE0; PVALID # <CJK Ideograph Extension F>..<CJK Ideograph E
E ]]></artwork>
</artwork></figure></section> </section>
<section anchor="Appendix-10.0.0" title="Changes from Unicode 10.0.0 to Unic <section anchor="Appendix-10.0.0" numbered="true" toc="default">
ode 11.0.0"> <name>Changes from Unicode 10.0.0 to Unicode 11.0.0</name>
<t> <t>
Changes from derived property value DISALLOWED to PVALID. Changes from derived property value DISALLOWED to PVALID.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
111C9 ; PVALID # SHARADA SANDHI MARK 111C9 ; PVALID # SHARADA SANDHI MARK
</artwork></figure> ]]></artwork>
<t> <t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
0560 ; PVALID # ARMENIAN SMALL LETTER TURNED AYB 0560 ; PVALID # ARMENIAN SMALL LETTER TURNED AYB
0588 ; PVALID # ARMENIAN SMALL LETTER YI WITH STROKE 0588 ; PVALID # ARMENIAN SMALL LETTER YI WITH STROKE
05EF ; PVALID # HEBREW YOD TRIANGLE 05EF ; PVALID # HEBREW YOD TRIANGLE
07FD ; PVALID # NKO DANTAYALAN 07FD ; PVALID # NKO DANTAYALAN
07FE..07FF ; DISALLOWED # NKO DOROME SIGN..NKO TAMAN SIGN 07FE..07FF ; DISALLOWED # NKO DOROME SIGN..NKO TAMAN SIGN
08D3 ; PVALID # ARABIC SMALL LOW WAW 08D3 ; PVALID # ARABIC SMALL LOW WAW
09FE ; PVALID # BENGALI SANDHI MARK 09FE ; PVALID # BENGALI SANDHI MARK
0A76 ; DISALLOWED # GURMUKHI ABBREVIATION SIGN 0A76 ; DISALLOWED # GURMUKHI ABBREVIATION SIGN
0C04 ; PVALID # TELUGU SIGN COMBINING ANUSVARA ABOVE 0C04 ; PVALID # TELUGU SIGN COMBINING ANUSVARA ABOVE
0C84 ; DISALLOWED # KANNADA SIGN SIDDHAM 0C84 ; DISALLOWED # KANNADA SIGN SIDDHAM
1878 ; PVALID # MONGOLIAN LETTER CHA WITH TWO DOTS 1878 ; PVALID # MONGOLIAN LETTER CHA WITH TWO DOTS
1C90..1CBA ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN 1C90..1CBA ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN
1CBD..1CBF ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIA 1CBD..1CBF ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIA
2BBA..2BBC ; DISALLOWED # OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK 2BBA..2BBC ; DISALLOWED # OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK
2BD3..2BEB ; DISALLOWED # PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK 2BD3..2BEB ; DISALLOWED # PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK
2BF0..2BFE ; DISALLOWED # ERIS FORM ONE..REVERSED RIGHT ANGLE 2BF0..2BFE ; DISALLOWED # ERIS FORM ONE..REVERSED RIGHT ANGLE
2E4A..2E4E ; DISALLOWED # DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK 2E4A..2E4E ; DISALLOWED # DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK
312F ; PVALID # BOPOMOFO LETTER NN 312F ; PVALID # BOPOMOFO LETTER NN
9FEB..9FEF ; PVALID # <CJK Ideograph&gt;..&lt;CJK Ideograph&gt; 9FEB..9FEF ; PVALID # <CJK Ideograph>..<CJK Ideograph&gt;
A7AF ; PVALID # LATIN LETTER SMALL CAPITAL Q A7AF ; PVALID # LATIN LETTER SMALL CAPITAL Q
A7B8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH STROKE A7B8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH STROKE
A7B9 ; PVALID # LATIN SMALL LETTER U WITH STROKE A7B9 ; PVALID # LATIN SMALL LETTER U WITH STROKE
A8FE..A8FF ; PVALID # DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN A A8FE..A8FF ; PVALID # DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN A
10A34..10A35; PVALID # KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VHA 10A34..10A35; PVALID # KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VHA
10A48 ; DISALLOWED # KHAROSHTHI FRACTION ONE HALF 10A48 ; DISALLOWED # KHAROSHTHI FRACTION ONE HALF
10D00..10D27; PVALID # HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SIG 10D00..10D27; PVALID # HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SIG
10D30..10D39; PVALID # HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA D 10D30..10D39; PVALID # HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA D
10F00..10F1C; PVALID # OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER 10F00..10F1C; PVALID # OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER
10F1D..10F26; DISALLOWED # OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION 10F1D..10F26; DISALLOWED # OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION
skipping to change at line 1437 skipping to change at line 1404
11D67..11D68; PVALID # GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER 11D67..11D68; PVALID # GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER
11D6A..11D8E; PVALID # GUNJALA GONDI LETTER OO..GUNJALA GONDI VOWEL 11D6A..11D8E; PVALID # GUNJALA GONDI LETTER OO..GUNJALA GONDI VOWEL
11D90..11D91; PVALID # GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VO 11D90..11D91; PVALID # GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VO
11D93..11D98; PVALID # GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI OM 11D93..11D98; PVALID # GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI OM
11DA0..11DA9; PVALID # GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT 11DA0..11DA9; PVALID # GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT
11EE0..11EF6; PVALID # MAKASAR LETTER KA..MAKASAR VOWEL SIGN O 11EE0..11EF6; PVALID # MAKASAR LETTER KA..MAKASAR VOWEL SIGN O
11EF7..11EF8; DISALLOWED # MAKASAR PASSIMBANG..MAKASAR END OF SECTION 11EF7..11EF8; DISALLOWED # MAKASAR PASSIMBANG..MAKASAR END OF SECTION
16E40..16E5F; DISALLOWED # MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAP 16E40..16E5F; DISALLOWED # MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAP
16E60..16E7F; PVALID # MEDEFAIDRIN SMALL LETTER M..MEDEFAIDRIN SMALL 16E60..16E7F; PVALID # MEDEFAIDRIN SMALL LETTER M..MEDEFAIDRIN SMALL
16E80..16E9A; DISALLOWED # MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN EXCLAMATI 16E80..16E9A; DISALLOWED # MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN EXCLAMATI
187ED..187F1; PVALID # <Tangut Ideograph&gt;..&lt;Tangut Ideograph&gt; 187ED..187F1; PVALID # <Tangut Ideograph>..<Tangut Ideograph&gt;
1D2E0..1D2F3; DISALLOWED # MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN 1D2E0..1D2F3; DISALLOWED # MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
1D372..1D378; DISALLOWED # IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE 1D372..1D378; DISALLOWED # IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE
1EC71..1ECB4; DISALLOWED # INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNATE 1EC71..1ECB4; DISALLOWED # INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNATE
1F12F ; DISALLOWED # COPYLEFT SYMBOL 1F12F ; DISALLOWED # COPYLEFT SYMBOL
1F6F9 ; DISALLOWED # SKATEBOARD 1F6F9 ; DISALLOWED # SKATEBOARD
1F7D5..1F7D8; DISALLOWED # CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE 1F7D5..1F7D8; DISALLOWED # CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE
1F94D..1F94F; DISALLOWED # LACROSSE STICK AND BALL..FLYING DISC 1F94D..1F94F; DISALLOWED # LACROSSE STICK AND BALL..FLYING DISC
1F96C..1F970; DISALLOWED # LEAFY GREEN..SMILING FACE WITH SMILING EYES A 1F96C..1F970; DISALLOWED # LEAFY GREEN..SMILING FACE WITH SMILING EYES A
1F973..1F976; DISALLOWED # FACE WITH PARTY HORN AND PARTY HAT..FREEZING 1F973..1F976; DISALLOWED # FACE WITH PARTY HORN AND PARTY HAT..FREEZING
1F97A ; DISALLOWED # FACE WITH PLEADING EYES 1F97A ; DISALLOWED # FACE WITH PLEADING EYES
1F97C..1F97F; DISALLOWED # LAB COAT..FLAT SHOE 1F97C..1F97F; DISALLOWED # LAB COAT..FLAT SHOE
1F998..1F9A2; DISALLOWED # KANGAROO..SWAN 1F998..1F9A2; DISALLOWED # KANGAROO..SWAN
1F9B0..1F9B9; DISALLOWED # EMOJI COMPONENT RED HAIR..SUPERVILLAIN 1F9B0..1F9B9; DISALLOWED # EMOJI COMPONENT RED HAIR..SUPERVILLAIN
1F9C1..1F9C2; DISALLOWED # CUPCAKE..SALT SHAKER 1F9C1..1F9C2; DISALLOWED # CUPCAKE..SALT SHAKER
1F9E7..1F9FF; DISALLOWED # RED GIFT ENVELOPE..NAZAR AMULET 1F9E7..1F9FF; DISALLOWED # RED GIFT ENVELOPE..NAZAR AMULET
1FA60..1FA6D; DISALLOWED # XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER 1FA60..1FA6D; DISALLOWED # XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
</artwork></figure></section> ]]></artwork>
<section anchor="Appendix-11.0.0" title="Changes from Unicode 11.0.0 to Unicode </section>
12.0.0"> <section anchor="Appendix-11.0.0" numbered="true" toc="default">
<t> <name>Changes from Unicode 11.0.0 to Unicode 12.0.0</name>
<t>
Changes from derived property value UNASSIGNED to either Changes from derived property value UNASSIGNED to either
PVALID or DISALLOWED. PVALID or DISALLOWED.
</t> </t>
<figure><artwork> <artwork name="" type="" align="left" alt=""><![CDATA[
0C77 ; DISALLOWED # TELUGU SIGN SIDDHAM 0C77 ; DISALLOWED # TELUGU SIGN SIDDHAM
0E86 ; PVALID # LAO LETTER PALI GHA 0E86 ; PVALID # LAO LETTER PALI GHA
0E89 ; PVALID # LAO LETTER PALI CHA 0E89 ; PVALID # LAO LETTER PALI CHA
0E8C ; PVALID # LAO LETTER PALI JHA 0E8C ; PVALID # LAO LETTER PALI JHA
0E8E..0E93 ; PVALID # LAO LETTER PALI NYA..LAO LETTER PALI NNA 0E8E..0E93 ; PVALID # LAO LETTER PALI NYA..LAO LETTER PALI NNA
0E98 ; PVALID # LAO LETTER PALI DHA 0E98 ; PVALID # LAO LETTER PALI DHA
0EA0 ; PVALID # LAO LETTER PALI BHA 0EA0 ; PVALID # LAO LETTER PALI BHA
0EA8..0EA9 ; PVALID # LAO LETTER SANSKRIT SHA..LAO LETTER SANSKRIT 0EA8..0EA9 ; PVALID # LAO LETTER SANSKRIT SHA..LAO LETTER SANSKRIT
0EAC ; PVALID # LAO LETTER PALI LLA 0EAC ; PVALID # LAO LETTER PALI LLA
0EBA ; PVALID # LAO SIGN PALI VIRAMA 0EBA ; PVALID # LAO SIGN PALI VIRAMA
skipping to change at line 1502 skipping to change at line 1471
119E3..119E4; PVALID # NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SIG 119E3..119E4; PVALID # NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SIG
11A84..11A85; PVALID # SOYOMBO SIGN JIHVAMULIYA..SOYOMBO SIGN UPADHM 11A84..11A85; PVALID # SOYOMBO SIGN JIHVAMULIYA..SOYOMBO SIGN UPADHM
11FC0..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIET 11FC0..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIET
11FFF ; DISALLOWED # TAMIL PUNCTUATION END OF TEXT 11FFF ; DISALLOWED # TAMIL PUNCTUATION END OF TEXT
13430..13438; DISALLOWED # EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN 13430..13438; DISALLOWED # EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN
16F45..16F4A; PVALID # MIAO LETTER BRI..MIAO LETTER RTE 16F45..16F4A; PVALID # MIAO LETTER BRI..MIAO LETTER RTE
16F4F ; PVALID # MIAO SIGN CONSONANT MODIFIER BAR 16F4F ; PVALID # MIAO SIGN CONSONANT MODIFIER BAR
16F7F..16F87; PVALID # MIAO VOWEL SIGN UOG..MIAO VOWEL SIGN UI 16F7F..16F87; PVALID # MIAO VOWEL SIGN UOG..MIAO VOWEL SIGN UI
16FE2 ; DISALLOWED # OLD CHINESE HOOK MARK 16FE2 ; DISALLOWED # OLD CHINESE HOOK MARK
16FE3 ; PVALID # OLD CHINESE ITERATION MARK 16FE3 ; PVALID # OLD CHINESE ITERATION MARK
187F2..187F7; PVALID # <Tangut Ideograph&gt;..&lt;Tangut Ideograph&gt; 187F2..187F7; PVALID # <Tangut Ideograph>..<Tangut Ideograph&gt;
1B150..1B152; PVALID # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMA 1B150..1B152; PVALID # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMA
1B164..1B167; PVALID # KATAKANA LETTER SMALL WI..KATAKANA LETTER SMA 1B164..1B167; PVALID # KATAKANA LETTER SMALL WI..KATAKANA LETTER SMA
1E100..1E12C; PVALID # NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PU 1E100..1E12C; PVALID # NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PU
1E130..1E13D; PVALID # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACH 1E130..1E13D; PVALID # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACH
1E140..1E149; PVALID # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG P 1E140..1E149; PVALID # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG P
1E14E ; PVALID # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ 1E14E ; PVALID # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ
1E14F ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA 1E14F ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA
1E2C0..1E2F9; PVALID # WANCHO LETTER AA..WANCHO DIGIT NINE 1E2C0..1E2F9; PVALID # WANCHO LETTER AA..WANCHO DIGIT NINE
1E2FF ; DISALLOWED # WANCHO NGUN SIGN 1E2FF ; DISALLOWED # WANCHO NGUN SIGN
1E94B ; PVALID # ADLAM NASALIZATION MARK 1E94B ; PVALID # ADLAM NASALIZATION MARK
skipping to change at line 1532 skipping to change at line 1501
1F9A5..1F9AA; DISALLOWED # SLOTH..OYSTER 1F9A5..1F9AA; DISALLOWED # SLOTH..OYSTER
1F9AE..1F9AF; DISALLOWED # GUIDE DOG..PROBING CANE 1F9AE..1F9AF; DISALLOWED # GUIDE DOG..PROBING CANE
1F9BA..1F9BF; DISALLOWED # SAFETY VEST..MECHANICAL LEG 1F9BA..1F9BF; DISALLOWED # SAFETY VEST..MECHANICAL LEG
1F9C3..1F9CA; DISALLOWED # BEVERAGE BOX..ICE CUBE 1F9C3..1F9CA; DISALLOWED # BEVERAGE BOX..ICE CUBE
1F9CD..1F9CF; DISALLOWED # STANDING PERSON..DEAF PERSON 1F9CD..1F9CF; DISALLOWED # STANDING PERSON..DEAF PERSON
1FA00..1FA53; DISALLOWED # NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP 1FA00..1FA53; DISALLOWED # NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP
1FA70..1FA73; DISALLOWED # BALLET SHOES..SHORTS 1FA70..1FA73; DISALLOWED # BALLET SHOES..SHORTS
1FA78..1FA7A; DISALLOWED # DROP OF BLOOD..STETHOSCOPE 1FA78..1FA7A; DISALLOWED # DROP OF BLOOD..STETHOSCOPE
1FA80..1FA82; DISALLOWED # YO-YO..PARACHUTE 1FA80..1FA82; DISALLOWED # YO-YO..PARACHUTE
1FA90..1FA95; DISALLOWED # RINGED PLANET..BANJO 1FA90..1FA95; DISALLOWED # RINGED PLANET..BANJO
</artwork></figure></section> ]]></artwork>
</back> </section>
<section numbered="false" toc="default">
<name>Acknowledgments</name>
<t>
Thanks to <contact fullname="Harald Alvestrand"/>, <contact fullname="Mar
c Blanchet"/>, <contact fullname="Martin Dürst"/>,
<contact fullname="Asmus Freytag"/>, <contact fullname="Ted Hardie"/>, <c
ontact fullname="John Klensin"/>, <contact fullname="Erik Nordmark"/>, <contact
fullname="Pete
Resnick"/>, <contact fullname="Peter Saint-Andre"/>, <contact fullname="M
ichel Suignard"/>, <contact fullname="Andrew Sullivan"/>,
and <contact fullname="Suzanne Woolf"/> for input to this document.
</t>
</section>
</back>
</rfc> </rfc>
 End of changes. 173 change blocks. 
609 lines changed or deleted 637 lines changed or added

This html diff was produced by rfcdiff 1.48. The latest version is available from http://tools.ietf.org/tools/rfcdiff/