@@ -618,19 +618,6 @@ <h3 id=text-transform-property><span class=secno>2.1. </span> <a
618
618
< dd > Puts all characters in fullwidth form. If the character does not have
619
619
a corresponding fullwidth form, it is left as is. This value is typically
620
620
used to typeset Latin characters and digits like ideographic characters.
621
- <!--
622
- <dt><dfn title="full-size-kana!!text-transform">''full-size-kana''</dfn></dt>
623
- <dd>Converts all small Kana characters to normal Kana.
624
- The mappings for small Kana to normal Kana are defined in
625
- <a href="#small-kana">Small Kana Mappings</a>.</p>
626
- <p class=note>
627
- This value is typically used for ruby annotation text,
628
- where due to the small type size, small Kana
629
- is often drawn as large Kana.
630
- This value allows such underlying text to use correct orthography
631
- so that it is accessible and can be styled correctly
632
- when not presented as ruby.
633
- -->
634
621
</ dl >
635
622
636
623
< p > The definition of “word“ used for ‘< code
@@ -673,15 +660,6 @@ <h3 id=text-transform-property><span class=secno>2.1. </span> <a
673
660
This rule is thus conditionally defined in Unicode's SpecialCasing.txt
674
661
file.
675
662
</ div >
676
- <!--
677
- <div class="example">
678
- <p>An example where the UA may choose to include rules beyond those
679
- in Unicode is Greek. In Greek, if the entire word is in upper case
680
- (''text-transform: capitalize'' vs. ''text-transform: uppercase''),
681
- accents are dropped or transformed according to language-specific rules.
682
- http://blogs.msdn.com/b/michkap/archive/2006/08/18/706383.aspx
683
- </div>
684
- -->
685
663
686
664
< p > The definition of fullwidth and halfwidth forms can be found on the
687
665
Unicode consortium web site at < a href ="#UAX11 "
@@ -705,11 +683,6 @@ <h3 id=text-transform-property><span class=secno>2.1. </span> <a
705
683
‘< code class =css > @text-transform</ code > ’ rule similar to ‘< code
706
684
class =css > @counter-style</ code > ’ from < a href ="#CSS-COUNTER-STYLES-3 "
707
685
rel =biblioentry > [CSS-COUNTER-STYLES-3]<!--{{CSS-COUNTER-STYLES-3}}--> </ a > .
708
- <!--
709
- This mechanism may be used to replace ''full-size-kana''. This would
710
- require authors needing this functionality to copy out the conversion
711
- tables, however.
712
- -->
713
686
714
687
< h2 id =white-space-property > < span class =secno > 3. </ span > < a
715
688
id =white-space-collapsing > </ a > < a id =text-wrap > </ a > White Space and
@@ -1226,15 +1199,9 @@ <h2 id=line-breaking><span class=secno>5. </span> Line Breaking and Word
1226
1199
Japanese, < a href ="#ZHMARK " rel =biblioentry > [ZHMARK]<!--{{ZHMARK}}--> </ a >
1227
1200
for Chinese, and in < a href ="#UAX14 "
1228
1201
rel =biblioentry > [UAX14]<!--{{!UAX14}}--> </ a > for all scripts in Unicode.
1229
- <!-- The CSS Working Group notes that although UAX 14 contains a wealth of
1230
- information about line breaking conventions, a literal implementation
1231
- of its algorithm has been found to be inadequate in multiple situations. -->
1232
1202
1233
1203
< p class ="feedback issue "> Any guidance for appropriate references here
1234
1204
would be much appreciated.
1235
- <!-- Additionally, some guidance could be provided on how
1236
- to break or not break Southeast Asian in the absence of a dictionary.
1237
- (See e.g. <a href="http://lists.w3.org/Archives/Public/www-style/2011Feb/0126.html">notes on Thai</a>.) -->
1238
1205
</ div >
1239
1206
1240
1207
< h3 id =line-break-details > < span class =secno > 5.1. </ span > Line Breaking
@@ -1366,11 +1333,7 @@ <h3 id=line-break-property><span class=secno>5.2. </span> Breaking Rules
1366
1333
1367
1334
< dd > The UA determines the set of line-breaking restrictions to use, and it
1368
1335
may vary the restrictions based on the length of the line; e.g., use a
1369
- less restrictive set of line-break rules for short lines. <!--
1370
- However, if 'word-break' is ''keep-all'',
1371
- then all the line-breaking restrictions listed below for ''line-break: strict''
1372
- must be applied.
1373
- -->
1336
+ less restrictive set of line-break rules for short lines.
1374
1337
1375
1338
< dt > < dfn id =loose title ="loose!!line-break "> ‘< code
1376
1339
class =css > loose</ code > ’</ dfn >
@@ -1733,8 +1696,6 @@ <h3 id=hyphens-property><span class=secno>6.1. </span>Hyphenation Control:
1733
1696
1734
1697
< dd > Words are only hyphenated where there are characters inside the word
1735
1698
that explicitly suggest hyphenation opportunities.
1736
- <!-- Characters can be <del>explicit</del><ins>???</ins> or conditional. -->
1737
-
1738
1699
< div class =example >
1739
1700
< p > In Unicode, U+00AD is a conditional "soft hyphen" and U+2010 is an
1740
1701
unconditional hyphen. Unicode Standard Annex #14 describes the < a
@@ -1773,7 +1734,6 @@ <h3 id=hyphens-property><span class=secno>6.1. </span>Hyphenation Control:
1773
1734
< p > For example, if the word “نوشتنن” were hyphenated, it would
1774
1735
appear as “ﻧﻮﺷ-ﺘﻦ” not as “ﻧﻮﺵ-ﺗﻦ”.
1775
1736
</ div >
1776
- <!-- add a sample prioritization algorithm -->
1777
1737
1778
1738
< h3 id =overflow-wrap-property > < span class =secno > 6.2. </ span > Overflow
1779
1739
Wrapping: the ‘< a href ="#word-wrap "> < code
@@ -1847,9 +1807,8 @@ <h3 id=overflow-wrap-property><span class=secno>6.2. </span> Overflow
1847
1807
< dd > Lines may break only at allowed break points. However, the
1848
1808
restrictions introduced by ‘< code class =css > word-break:
1849
1809
keep-all</ code > ’ may be relaxed to match ‘< code class =css > word-break:
1850
- normal</ code > ’ <!-- and the various hyphenation limit properties
1851
- may be ignored -->
1852
- if there are no otherwise-acceptable break points in the line.
1810
+ normal</ code > ’ if there are no otherwise-acceptable break points in the
1811
+ line.
1853
1812
1854
1813
< dt > < dfn id =break-word title ="break-word!!overflow-wrap "> ‘< code
1855
1814
class =css > break-word</ code > ’</ dfn >
@@ -2017,18 +1976,7 @@ <h3 id=text-align-property><span class=secno>7.1. </span> Text Alignment:
2017
1976
2018
1977
< dd > Text is justified according to the method specified by the ‘< a
2019
1978
href ="#text-justify "> < code class =property > text-justify</ code > </ a > ’
2020
- property, in order to exactly fill the line box. <!--
2021
- <dt><dfn title="<string>">''<span title="<string>!!text-align"><a
2022
- class="noxref" href="http://www.w3.org/TR/CSS21/syndata.html#value-def-string"
2023
- ><span class="value-inst-string"><string></span></a></span>''</dfn></dt>
2024
- <dd>The string must be a single <i>character</i>; otherwise the declaration
2025
- must be <a href="http://www.w3.org/TR/CSS21/conform.html#ignore">ignored</a>.
2026
- When applied to a table cell, specifies the <dfn>alignment character</dfn>
2027
- around which the cell's contents will align. See
2028
- <a href="#character-alignment">below</a> for further details and
2029
- how this value combines with keywords.
2030
- </dd>
2031
- -->
1979
+ property, in order to exactly fill the line box.
2032
1980
2033
1981
< dt > < dfn id =match-parent title ="match-parent!!text-align "> ‘< code
2034
1982
class =css > match-parent</ code > ’</ dfn >
@@ -2287,18 +2235,6 @@ <h3 id=text-justify-property><span class=secno>7.3. </span> Justification
2287
2235
< p class =caption > Mixed-script text with ‘< code class =css > text-justify:
2288
2236
inter-word</ code > ’
2289
2237
</ div >
2290
- <!--
2291
- <dt><dfn title="inter-ideographic!!text-justify">''inter-ideograph''</dfn></dt>
2292
- <dd>Justification primarily changes spacing at word separators and
2293
- between characters in <a href="#block-scripts">block scripts</a>.
2294
- This value is typically used for <abbr title="Chinese/Japanese/Korean">CJK</abbr>
2295
- languages.</dd>
2296
- <dt><dfn title="inter-cluster!!text-justify">''inter-cluster''</dfn></dt>
2297
- <dd>Justification primarily changes spacing at word separators and
2298
- between characters in <a href="#clustered-scripts">clustered
2299
- scripts</a>.
2300
- This value is typically used for Southeast Asian scripts such as Thai.</dd>
2301
- -->
2302
2238
2303
2239
< dt > < dfn id =distribute title ="distribute!!text-justify "> ‘< code
2304
2240
class =css > distribute</ code > ’</ dfn >
@@ -2316,26 +2252,7 @@ <h3 id=text-justify-property><span class=secno>7.3. </span> Justification
2316
2252
< p class =caption > Mixed-script text with ‘< code class =css > text-justify:
2317
2253
distribute</ code > ’
2318
2254
</ div >
2319
- <!--
2320
- <dt><a name="kashida-prop"></a><a name="text-kashida-space"></a>
2321
- <dfn title="kashida!!text-justify">''kashida''</dfn></dt>
2322
- <dd>Justification primarily stretches
2323
- <a href="#cursive-scripts">cursive scripts</a> through the use of
2324
- kashida or other calligraphic elongation.
2325
- This value is <em>optional</em> for conformance to CSS3 Text.
2326
- (UAs that do not support cursive elongation must
2327
- <a href="http://www.w3.org/TR/css-2010/#partial">treat the value
2328
- as invalid</a>.)
2329
- -->
2330
2255
</ dl >
2331
- <!--
2332
- <div class="figure" id="fig-text-justify">
2333
- <p>
2334
- <img alt="Examples of text-justify values commonly used in East Asian scripts"
2335
- src="text-justify-east-asia.png" height="376" width="491"></p>
2336
- <p class="caption">Values of 'text-justify': ''inter-word'', ''auto'', and ''distribute''</p>
2337
- </div>
2338
- -->
2339
2256
2340
2257
< p > The exact justification algorithm is UA-dependent; however, CSS provides
2341
2258
some general guidelines...
@@ -3411,114 +3328,6 @@ <h2 class=no-num id=script-groups> Appendix C: Cursive Scripts</h2>
3411
3328
< p > User agents should update this list as they update their Unicode support
3412
3329
to handle as-yet-unencoded cursive scripts in future versions of Unicode,
3413
3330
and are encourage to request the CSSWG to update this spec accordingly.
3414
- <!--
3415
- <p>Typographic behavior varies somewhat by language, but varies drastically
3416
- by writing system. This appendix categorizes some common scripts in
3417
- Unicode 6.0 according to their justification and spacing behavior. Category
3418
- descriptions are descriptive, not prescriptive; the determining factor is
3419
- the prioritization of <i>expansion opportunities</i>.
3420
-
3421
- <dl>
3422
- <dt id="block-scripts"><dfn>block scripts</dfn></dt>
3423
- <dd>CJK and by extension all Wide characters. (See [[!UAX11]])
3424
- The following scripts are included:
3425
- Bopomofo,
3426
- Han,
3427
- Hangul,
3428
- Hiragana,
3429
- Katakana,
3430
- Yi
3431
- </dd>
3432
- <dt id="clustered-scripts"><dfn>clustered scripts</dfn></dt>
3433
- <dd>Scripts that have discrete units but do not use spaces between words,
3434
- such as many Southeast Asian systems.
3435
- The following scripts are included:
3436
- Javanese,
3437
- Khmer,
3438
- Lao,
3439
- Myanmar,
3440
- Thai,
3441
- <span class="issue">This list is likely incomplete. What else fits here?</span>
3442
- </dd>
3443
- <dt id="connected-scripts"><dfn>connected scripts</dfn></dt>
3444
- <dd>Devanagari, Ogham, and other scripts that use spaces between
3445
- words and baseline connectors within words.
3446
- By extension this category also includes any other Indic scripts
3447
- whose typographic behavior is similar to Devanagari.
3448
- The following scripts are included:
3449
- Bengali,
3450
- Brahmi,
3451
- Devanagari,
3452
- Gujarati,
3453
- Gurmukhi,
3454
- Kannada,
3455
- Malayalam,
3456
- Oriya?,
3457
- Ogham,
3458
- Tamil?,
3459
- Telugu
3460
- </dd>
3461
- <dt id="cursive-scripts"><dfn>cursive scripts</dfn></dt>
3462
- <dd>Arabic and similar inherently cursive scripts.
3463
- The following scripts are included:
3464
- Arabic,
3465
- Mongolian,
3466
- N'Ko,
3467
- Phags Pa,
3468
- Syriac
3469
- </dd>
3470
- <dt id="discrete-scripts"><dfn>discrete scripts</dfn></dt>
3471
- <dd>Scripts that use spaces or visible word-separating
3472
- punctuation between words and have discrete,
3473
- unconnected (in print) units within words.
3474
- The following scripts are included:
3475
- Armenian,
3476
- Bamum?,
3477
- Braille,
3478
- Canadian Aboriginal,
3479
- Cherokee,
3480
- Coptic,
3481
- Cyrillic,
3482
- Deseret,
3483
- Ethiopic
3484
- Greek,
3485
- Hebrew,
3486
- Kharoshthi,
3487
- Latin,
3488
- Lisu,
3489
- Osmanya,
3490
- Shavian,
3491
- Tifinagh,
3492
- Vai?
3493
- </dd>
3494
- </dl>
3495
-
3496
- <p>UAs should treat unrecognized scripts as <i>discrete</i>.
3497
-
3498
- <p class="issue">This listing should ideally be exhaustive wrt Unicode.
3499
- Please <a href="#status">send</a> suggestions and corrections to the CSS
3500
- Working Group.</p>
3501
-
3502
- <div class="note">
3503
- <p>Guidelines for classification consider letter-spacing and justification:
3504
- <ol>
3505
- <li>If the script is cursive and may expand cursively but must not
3506
- space between letters, it is <i>cursive</i>.
3507
- <li>If the script primarily flexes word separators, it is either
3508
- <i>discrete</i> or <i>connected</i>. <i>Discrete</i> scripts can
3509
- space between letters. <i>Connected</i> scripts must not space
3510
- between letters (typically because that would break the connections
3511
- or otherwise look bad).
3512
- <li>If the script commonly expands equally between its "letters"
3513
- in native typesettings, it is either <i>block</i> or
3514
- <i>clustered</i>. The exact classification depends on whether it
3515
- always spaces when mixed with CJK and sometimes stays together
3516
- when mixed with Thai and related scripts (<i>block</i>) or
3517
- sometimes spaces when mixed with CJK and always spaces with Thai
3518
- (<i>clustered</i>).
3519
- </ol>
3520
- </div>
3521
- -->
3522
3331
3523
3332
< h2 class =no-num id =acknowledgements > Acknowledgements</ h2 >
3524
3333
0 commit comments