8000 [css-text-decor] Switch text-emphasis skipped punctuation to all of P… · w3c/csswg-drafts@81d1a51 · GitHub
Skip to content

Commit 81d1a51

Browse files
committed
[css-text-decor] Switch text-emphasis skipped punctuation to all of P* except a small blacklist. #839
1 parent e3fc364 commit 81d1a51

2 files changed

Lines changed: 44 additions & 19 deletions

File tree

css-text-decor-3/Overview.bs

Lines changed: 20 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -732,17 +732,26 @@ Emphasis Mark Style: the 'text-emphasis-style' property</h3>
732732
that combines with any combining characters.)
733733

734734
<li>Punctuation--specifically,
735-
any <a>characters<a> that belong to the
736-
Unicode Pc, Pd, Ps, Pe, Pi, or Pf <a>general categories</a> or
737-
that belong to the Po <a>general category</a> and
738-
have a <code>Sentence_Break</code> property of
739-
<code>ATerm</code>, <code>Close</code>, <code>SContinue</code>, or <code>STerm</code>.
740-
741-
ISSUE: Semicolons should likely be categorized with punctuation here,
742-
but the Unicode data file places them under <code>Sentence_Break=Other</code>.
743-
744-
ISSUE: This list needs review by JLTF/CLTF.
745-
See <a href="https://unicode.org/cldr/utility/list-unicodeset.jsp?a=%5B%3AGeneral_category%3DPo%3A%5D%26%5B%5B%3ASentence_Break%3DATerm%3A%5D%5B%3ASentence_Break%3DSTerm%3A%5D%5B%3ASentence_Break%3DSContinue%3A%5D%5B%3ASentence_Break%3DClose%3A%5D%5D&g=Sentence_Break&i=">full list of characters</a>.
735+
any <a>characters</a> that belong to the
736+
Unicode P* <a>general category</a> and
737+
do not <code>NFKD</code> normalize [[!UAX15]] to
738+
any of the following symbols:
739+
<table class="data">
740+
<tr><td># <td>U+0023 <td>NUMBER SIGN
741+
<tr><td>% <td>U+0025 <td>PERCENT SIGN
742+
<tr><td><td>U+2030 <td>PER MILLE SIGN
743+
<tr><td><td>U+2031 <td>PER TEN THOUSAND SIGN
744+
<tr><td>٪ <td>U+066A <td>ARABIC PERCENT SIGN
745+
<tr><td>؉ <td>U+0609 <td>ARABIC-INDIC PER MILLE SIGN
746+
<tr><td>؊ <td>U+060A <td>ARABIC-INDIC PER TEN THOUSAND SIGN
747+
<tr><td>& <td>U+0026 <td>AMPERSAND
748+
<tr><td><td>U+204A <td>TIRONIAN SIGN ET
749+
<tr><td>@ <td>U+0040 <td>COMMERCIAL AT
750+
<tr><td>§ <td>U+00A7 <td>SECTION SIGN
751+
<tr><td><td>U+00B6 <td>PILCROW SIGN
752+
<tr><td><td>U+204B <td>REVERSED PILCROW SIGN
753+
<tr><td>〽️ <td>U+303D <td>PART ALTERNATION MARK
754+
</table>
746755

747756
<li>Characters belonging to the Unicode classes for control codes
748757
and unassigned characters (Cc, Cf, Cn).

css-text-decor-4/Overview.bs

Lines changed: 24 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -441,17 +441,33 @@ Emphasis Mark Skip: the 'text-emphasis-skip' property</h3>
441441
<dd>
442442
Skip punctuation.
443443
Punctuation in this definition includes characters belonging to
444-
the Unicode Pc, Pd, Ps, Pe, Pi, or Pf categories.
445-
It also includes characters where the Unicode category is Po and
446-
the <code>Sentence_Break</code> property [[!UAX29]] of the Unicode database
447-
[[!UAX44]] is <code>ATerm</code>, <code>Close</code>, <code>SContinue</code>, or <code>STerm</code>.
444+
the Unicode P* category
445+
that are not defined as ''symbols'' (see below).
448446

449447
<dt><dfn>symbols</dfn>
450448
<dd>Skip symbols.
451-
Symbols in this definition includes characters belonging to
452-
the Unicode S* category.
453-
It also includes the Unicode Po category
454-
that are not defined as ''<a>punctuation</a>'' above.
449+
Symbols in this definition includes
450+
all <a>typographic character units</a> belonging to
451+
the Unicode S* <a>general category</a>
452+
as well as any which are <code>NFKD</code>-equivalent [[!UAX15]]
453+
to the following <a>characters</a> from the Unicode Po category:
454+
455+
<table class="data">
456+
<tr><td># <td>U+0023 <td>NUMBER SIGN
457+
<tr><td>% <td>U+0025 <td>PERCENT SIGN
458+
<tr><td><td>U+2030 <td>PER MILLE SIGN
459+
<tr><td><td>U+2031 <td>PER TEN THOUSAND SIGN
460+
<tr><td>٪ <td>U+066A <td>ARABIC PERCENT SIGN
461+
<tr><td>؉ <td>U+0609 <td>ARABIC-INDIC PER MILLE SIGN
462+
<tr><td>؊ <td>U+060A <td>ARABIC-INDIC PER TEN THOUSAND SIGN
463+
<tr><td>& <td>U+0026 <td>AMPERSAND
464+
<tr><td><td>U+204A <td>TIRONIAN SIGN ET
465+
<tr><td>@ <td>U+0040 <td>COMMERCIAL AT
466+
<tr><td>§ <td>U+00A7 <td>SECTION SIGN
467+
<tr><td><td>U+00B6 <td>PILCROW SIGN
468+
<tr><td><td>U+204B <td>REVERSED PILCROW SIGN
469+
<tr><td>〽️ <td>U+303D <td>PART ALTERNATION MARK
470+
</table>
455471

456472
<dt><dfn>narrow</dfn>
457473
<dd>Skip characters where the <code>East_Asian_Width</code> property [[!UAX11]]

0 commit comments

Comments
 (0)