@@ -191,7 +191,8 @@ Languages and Typesetting</h3>
191191 or content in a <a>document language</a> that does not have a language-tagging facility
192192 is considered to have an unknown <a>content language</a> .
193193
194- Note: Authors can tag content using the global <code> lang</code> attribute in HTML
194+ Note: Authors can declare the [=content language=]
195+ using the global <code> lang</code> attribute in HTML
195196 or the universal <code> xml:lang</code> attribute in XML.
196197 See the <a href="https://html.spec.whatwg.org/multipage/dom.html#language">rules
197198 for determining the content language of an HTML element</a> in [[HTML]] ,
@@ -205,6 +206,40 @@ Languages and Typesetting</h3>
205206 this information can be explicit or implied.
206207 See the normative [[#script-tagging]] .
207208
209+ Note: Some languages have more than one writing system tradition;
210+ in other cases a language can be transliterated into a foreign writing system.
211+ Authors should <a href="#script-tagging">subtag</a> such cases
212+ so that the UA can adapt appropriately.
213+
214+ <!--
215+ <div class=example>
216+ For example, Korean (<code> ko</code> ) can be written in
217+ Hangul (<code> -Hang</code> ),
218+ Hanja (<code> -Hani</code> ),
219+ or a combination (<code> -Kore</code> ).
220+ Historical documents written solely in Hanja
221+ do not use word spaces and
222+ are formatted more like modern Chinese than modern Korean.
223+ In other words, for typographic purposes <code> ko-Hani</code>
224+ behaves more like <code> zh-Hant</code>
225+ than <code> ko</code> (<code> ko-Kore</code> ).
226+
227+ As another example Japanese (<code> ja</code> ) is typically written
228+ in a combination (<code> -Japn</code> ) of Hiragana (<code> -Hira</code> ),
229+ Katakana (<code> -Kana</code> ), and Kanji (<code> -Hani</code> ).
230+ However, it can also be ”romanized” into Latin (<code> -Latn</code> )
231+ for special purposes like language-learning textbooks,
232+ in which case it should be formatted more like English than Japanese.
233+
234+ As a third example contemporary Mongolian is written in two scripts:
235+ Cyrillic (<code> -Cyrl</code> , officially used in Mongolia)
236+ and Mongolian (<code> -Mong</code> , more common in Inner Mongolia, part of China).
237+ These have very different formatting requirements,
238+ with Cyrillic behaving similar to Latin and Greek,
239+ and Mongolian deriving from both Arabic and Chinese writing conventions.
240+ </div>
241+ -->
242+
208243<h3 id="characters">
209244Characters and Letters</h3>
210245
0 commit comments