You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -2506,7 +2517,7 @@ Appendix D: Scripts and Spacing</h2>
2506
2517
The following <a>Unicode scripts</a> are included:
2507
2518
Bopomofo, Han, Hangul, Hiragana, Katakana, and Yi.
2508
2519
Characters of the <a>East Asian Width property</a><code>W</code> and <code>F</code> are also included,
2509
-
but <code>A</code> characters are included only if the <a>content language</a> is Chinese, Korean, or Japanese.
2520
+
but <code>A</code> characters are included only if the <a>writing system</a> is <a for=writing-system>Chinese</a>, <a for=writing-system>Korean</a>, or <a for=writing-system>Japanese</a>.
2510
2521
<dt><dfn>clustered scripts</dfn></dt>
2511
2522
<dd>Clustered scripts have discrete units
2512
2523
and break only at word boundaries,
@@ -2579,6 +2590,8 @@ Characters and Properties</h2>
2579
2590
<h2 id="script-tagging" class="no-num">Appendix F.
2580
2591
Tagging Content by Writing System</h2>
2581
2592
2593
+
<p><em>This appendix is normative.</em></p>
2594
+
2582
2595
While most languages have a preferred writing system,
2583
2596
many can also be transcribed into a different writing system.
2584
2597
As a common example, most languages have at least one Latin transcription,
@@ -2591,7 +2604,8 @@ Tagging Content by Writing System</h2>
2591
2604
does not use word spaces,
2592
2605
and should therefore be typeset as for Chinese.
2593
2606
2594
-
Authors can indicate the use of an atypical writing system
2607
+
In [[HTML]] or any other <a>document language</a> using [[BCP47]] to identify the [=content language=],
2608
+
authors can indicate the use of an atypical writing system
2595
2609
with script subtags.
2596
2610
For example, to indicate use of the Latin writing system
2597
2611
for languages which don't natively use it,
@@ -2629,6 +2643,26 @@ Tagging Content by Writing System</h2>
2629
2643
not the conventions of that language in a different writing system,
2630
2644
which would be inappropriate to the writing system used in this case.
2631
2645
2646
+
The full correspondence between languages and their most common writing system
2647
+
is out of scope for this document.
2648
+
However, User Agents must assume at least the following:
2649
+
2650
+
* If the [=content language=] is Chinese and the [=writing system=] is unspecified,
2651
+
or for any [=content language=] if the [=writing system=] to specified to be one of the ''Hant'', ''Hans'', ''Hani'', ''Hanb'', or ''Bopo''[[ISO15924]] codes,
2652
+
then the [=writing system=] is <dfn no-export for=writing-system>Chinese</dfn>.
2653
+
* If the [=content language=] is Japanese and the [=writing system=] is unspecified,
2654
+
or for any [=content language=] if the [=writing system=] to specified to be one of the ''Jpan'', ''Hrkt'', ''Hira'' or ''Kana''[[ISO15924]] codes,
2655
+
then the [=writing system=] is <dfn no-export for=writing-system>Japanese</dfn>.
2656
+
* If the [=content language=] is Korean and the [=writing system=] is unspecified,
2657
+
or for any [=content language=] if the [=writing system=] to specified to be one of the ''Kore'', ''Hang'', or ''Jamo''[[ISO15924]] codes,
2658
+
then the [=writing system=] is <dfn no-export for=writing-system>Korean</dfn>.
2659
+
* The [=writing system=] is only considered to be <dfn for=writing-system lt='known | unknown'>unknown</dfn>
2660
+
if the [=content language=] itself is unknown,
2661
+
or if it explicitly indicates an unknown writing system.
2662
+
2663
+
Note: Mere omission of the [=writing system=] information when the [=content language=] is specified
2664
+
means the that the [=writing system=] is implied, not unknown.
2665
+
2632
2666
More advice on language tagging can be found in
2633
2667
the <a href="https://www.w3.org/International/core/">Internationalization Working Group</a>’s
2634
2668
<a href="https://www.w3.org/International/articles/language-tags/">“Language tags in HTML and XML”</a>
0 commit comments