55 < head >
66 < title > CSS3 Speech Module</ title >
77 < meta content ="text/html; charset=us-ascii " http-equiv =Content-Type >
8- < link href =default.css rel =stylesheet type ="text/css ">
8+ < link href ="http://dev.w3.org/csswg/default.css " rel =stylesheet
9+ type ="text/css ">
910
1011 < style type ="text/css ">
1112 .prod { font-family : inherit; font-size : inherit }
2829 }
2930/*]]>*/
3031</ style >
31- < link href ="http://www.w3.org/StyleSheets/TR/W3C-ED.css " rel =stylesheet
32+ <!-- link rel="stylesheet" type="text/css" href="http://www.w3.org/StyleSheets/TR/W3C-ED.css" -->
33+ < link href ="http://www.w3.org/StyleSheets/TR/W3C-ED " rel =stylesheet
3234 type ="text/css ">
3335
3436 < body >
3840
3941 < h1 id =top > CSS3 Speech Module</ h1 >
4042
41- < h2 class ="no-num no-toc " id =longstatus-date > Editor's Draft 07 February
42- 2011</ h2 >
43+ < h2 class ="no-num no-toc " id =longstatus-date > Editor's Draft 05 April 2011</ h2 >
4344
4445 < dl >
4546 < dt > This version:
4647
47- < dd > <!--<a href="http://www.w3.org/TR/2011/WD-css3-speech-20110207 ">http://www.w3.org/TR/2011/ED-css3-speech-20110207 </a>-->
48+ < dd > <!--<a href="http://www.w3.org/TR/2011/WD-css3-speech-20110405 ">http://www.w3.org/TR/2011/ED-css3-speech-20110405 </a>-->
4849 < a
4950 href ="http://dev.w3.org/csswg/css3-speech "> http://dev.w3.org/csswg/css3-speech</ a >
5051
@@ -158,20 +159,32 @@ <h2 class="no-num no-toc" id=status>Status of this document</h2>
158159 href ="/Voice/Group/ "> Voice Browser working group</ a > (< a
159160 href ="http://cgi.w3.org/MemberAccess/AccessRequest "> W3C Members only</ a > ).
160161
161- < p > The following features are at-risk and may be dropped at the end of the
162- CR period if there has not been enough interest from implementers:
162+ < p > </ p >
163+ <!-- p>The following features are judged "at risk" and may be dropped at the
164+ end of the CR period,
165+ especially if there has not been enough interest from implementers:
166+ <ul>
167+
168+ </ul>
169+ </p -->
170+ <!-- li><a href="#issue-mark">mark</a></li -->
171+
172+ < p > The following issues need to be discussed and require working group
173+ resolutions:
163174
164175 < ul >
165- < li > < a href ="#issue-phonemes "> phonemes</ a >
176+ < li > < a href ="#issue-phonemes "> ‘< code
177+ class =property > phonemes</ code > ’ property breaks principle of
178+ content/presentation separation</ a >
166179
167- < li > < a href ="#issue-voice-cues-volume "> audio cues volume level relative to
168- voice-volume</ a >
180+ < li > < a href ="#issue-speakability-name "> ‘< code
181+ class =property > speakability</ code > ’ not an ideal property name
182+ ?</ a >
169183
170- < li > < a href ="#issue-speakability-name "> "speakability" not an ideal name
171- ?</ a > </ li >
172- <!-- li><a href="#issue-mark">mark</a></li -->
173- <!-- li><a href="#issue-speak_none">speak:none</a></li -->
184+ < li > < a href ="#issue-lists "> handling of list styles, numbered items</ a >
174185 </ ul >
186+ <!-- li><a href="#issue-voice-cues-volume">audio cues volume level relative to voice-volume</a></li -->
187+ <!-- li><a href="#issue-speak_none">speak:none</a></li -->
175188
176189 < p > The CSS WG maintains an < a
177190 href ="http://www.w3.org/Style/CSS/Tracker/products/29 "> issues list</ a > for
@@ -462,23 +475,19 @@ <h2 id=mixing-props><span class=secno>4. </span>Mixing properties:
462475 class =property > voice-volume</ code > </ a > ’ refers to the amplitude of
463476 the waveform output by the speech synthesiser. This may be mixed with
464477 other audio sources, influencing the perceived loudness of synthetic
465- speech relative to these sources.
466-
467- < p > ‘< a href ="#voice-volume "> < code
468- class =property > voice-volume</ code > </ a > ’ does not apply to < a
469- href ="#cue-props "> audio cues</ a > for which there is a separate means to
470- set loudness (absolute, or relative to ‘< a
471- href ="#voice-volume "> < code class =property > voice-volume</ code > </ a > ’).
472- < span class =issue id =issue-voice-cues-volume > ISSUE: shouldn't audio cues
473- be subjected to the voice-volume, so that an inaudible element gets its
474- entire content (within the aural "box" model) muted as well ? The
475- rationale is that voice-volume:silent is largely equivalent to
476- visibility:hidden, which affects border, padding and spacing (i.e. the
477- entire content with regards to the CSS "visual" box model). The audio cues
478- should therefore only be allowed to set their volume levels
479- < em > relatively</ em > to the inherited voice-volume, not with absolute
480- values as these may incorrectly enable audio output when the voice-volume
481- itself is muted. </ span >
478+ speech relative to these sources.</ p >
479+ <!-- p>
480+ ‘<code class=property>voice-volume</code>’ does not apply to <a href
481+ ="#cue-props">audio cues</a> for which there is a separate means
482+ to set loudness (absolute, or relative to ‘<code class=property>voice-volume</code>’).
483+ <span class="issue" id="issue-voice-cues-volume">
484+ ISSUE: shouldn't audio cues be subjected to the voice-volume,
485+ so that an inaudible element gets its entire content (within the aural "box" model) muted as well ?
486+ The rationale is that voice-volume:silent is largely equivalent to visibility:hidden,
487+ which affects border, padding and spacing (i.e. the entire content with regards to the CSS "visual" box model).
488+ The audio cues should therefore only be allowed to set their volume levels <em>relatively</em> to the inherited voice-volume, not with absolute values as these may incorrectly enable audio output when the voice-volume itself is muted.
489+ </span>
490+ </p -->
482491
483492 < p > Values have the following meanings:</ p >
484493 <!-- dl>
@@ -637,11 +646,9 @@ <h2 id=mixing-props><span class=secno>4. </span>Mixing properties:
637646 < p > Many speech synthesizers only support a single channel. The ‘< a
638647 href ="#voice-balance "> < code class =property > voice-balance</ code > </ a > ’
639648 property can then be treated as part of a post synthesis mixing step. This
640- is where speech is mixed with other audio sources. Unlike ‘< a
641- href ="#voice-volume "> < code class =property > voice-volume</ code > </ a > ’,
642- ‘< a href ="#voice-balance "> < code
643- class =property > voice-balance</ code > </ a > ’ < em > does</ em > apply to < a
644- href ="#cue-props "> audio cues</ a > .
649+ is where speech is mixed with other audio sources.</ p >
650+ <!-- p>Unlike ‘<code class=property>voice-volume</code>’, ‘<code class=property>voice-balance</code>’
651+ <em>does</em> apply to <a href="#cue-props">audio cues</a>.</p -->
645652
646653 < h2 id =speaking-props > < span class =secno > 5. </ span > Speaking properties:
647654 ‘< a href ="#speakability "> < code
@@ -1282,10 +1289,11 @@ <h2 id=cue-props><span class=secno>8. </span>Cue properties: ‘<a
12821289 < td > < dfn id =cue-before > cue-before</ dfn >
12831290
12841291 < tr >
1285- < td width ="20% "> < em > Value:</ em >
1292+ < td width ="20% "> < em > Value:</ em > </ td >
1293+ <!-- td><uri> [<non-negative number> | <percentage> | silent
1294+ | x-soft | soft | medium | loud | x-loud] | none | inherit</td -->
12861295
1287- < td > <uri> [<non-negative number> | <percentage> |
1288- silent | x-soft | soft | medium | loud | x-loud] | none | inherit
1296+ < td > <uri> [<percentage>] | none | inherit
12891297
12901298 < tr >
12911299 < td width ="20% "> < em > Initial:</ em >
@@ -1324,10 +1332,11 @@ <h2 id=cue-props><span class=secno>8. </span>Cue properties: ‘<a
13241332 < td > < dfn id =cue-after > cue-after</ dfn >
13251333
13261334 < tr >
1327- < td width ="20% "> < em > Value:</ em >
1335+ < td width ="20% "> < em > Value:</ em > </ td >
1336+ <!-- td><uri> [<non-negative number> | <percentage> | silent
1337+ | x-soft | soft | medium | loud | x-loud] | none | inherit</td -->
13281338
1329- < td > <uri> [<non-negative number> | <percentage> |
1330- silent | x-soft | soft | medium | loud | x-loud] | none | inherit
1339+ < td > <uri> [<percentage>] | none | inherit
13311340
13321341 < tr >
13331342 < td width ="20% "> < em > Initial:</ em >
@@ -1371,43 +1380,44 @@ <h2 id=cue-props><span class=secno>8. </span>Cue properties: ‘<a
13711380
13721381 < dt > < strong > none</ strong >
13731382
1374- < dd > No auditory icon is specified.
1375-
1376- < dt > < strong > <non-negative number></ strong >
1383+ < dd > No auditory icon is specified.</ dd >
1384+ <!-- dt><strong><non-negative number></strong></dt>
13771385
1378- < dd > An integer or floating point number in the range ‘< code
1379- class =css > 0</ code > ’ to ‘< code class =css > 100</ code > ’.
1380- ‘< code class =css > 0</ code > ’ represents silence (the
1381- < em > minimum</ em > level), and 100 corresponds to the < em > maximum</ em >
1382- level. The volume scale is linear amplitude.
1386+ <dd>An integer or floating point number in the range ‘<code class=css>0</code>’ to ‘<code class=css>100</code>’.
1387+ ‘<code class=css>0</code>’ represents silence (the <em>minimum</em> level), and 100
1388+ corresponds to the <em>maximum</em> level. The volume scale is
1389+ linear amplitude.</dd -->
13831390
13841391 < dt > < strong > <percentage></ strong >
13851392
13861393 < dd > Only positive values are allowed. Computed values are calculated
13871394 relative to the inherited value of the ‘< a
13881395 href ="#voice-volume "> < code class =property > voice-volume</ code > </ a > ’
13891396 property, and are then clipped to the range ‘< code
1390- class =css > 0</ code > ’ to ‘< code class =css > 100</ code > ’. By
1391- basing the percentage upon the inherited value for ‘< a
1392- href ="#voice-volume "> < code class =property > voice-volume</ code > </ a > ’,
1393- it is easy to adjust the relative loudness of cues compared to synthetic
1394- speech for whatever volume setting has been provided for that speech.
1397+ class =css > 0</ code > ’ to ‘< code class =css > 100</ code > ’.
1398+ ‘< code class =css > 0</ code > ’ represents silence (the
1399+ < em > minimum</ em > level), and 100 corresponds to the < em > maximum</ em >
1400+ level. The volume scale is linear amplitude. By basing the percentage
1401+ upon the inherited value for ‘< a href ="#voice-volume "> < code
1402+ class =property > voice-volume</ code > </ a > ’, it is easy to adjust the
1403+ relative loudness of cues compared to synthetic speech for whatever
1404+ volume setting has been provided for that speech.
13951405 < p class =note > Note that a leading "+" sign does not denote an increment.
13961406 For example, +50% is equivalent to 50%, so the computed value equals the
13971407 inherited value times 0.5 (divided by 2), then clipped to [0,100].</ p >
1398-
1399- < dt > < strong > silent</ strong > , < strong > x-soft </ strong > ,
1400- < strong > soft</ strong > , < strong > medium </ strong > , < strong > loud </ strong > ,
1401- and < strong > x-loud </ strong >
1402-
1403- < dd > A sequence of monotonically non-decreasing volume levels. The value of
1404- ‘ < code class = property > silent </ code > ’ is mapped to
1405- ‘ < code class = css > 0 </ code > ’ and ‘ < code
1406- class = property > x-loud </ code > ’ is mapped to ‘ < code
1407- class =css > 100 </ code > ’. The mapping of other values to numerical
1408- volume levels is implementation dependent, but the intention is to match
1409- the corresponding levels for ‘ < a href =" #voice-volume " > < code
1410- class =property > voice-volume</ code > </ a > ’.
1408+ </ dd >
1409+ <!-- dt><strong>silent</strong>,
1410+ <strong>x- soft</strong>,
1411+ <strong>soft </strong>,
1412+ <strong>medium</strong>,
1413+ <strong>loud</strong>, and
1414+ <strong>x-loud</strong></dt>
1415+
1416+ <dd>A sequence of monotonically non-decreasing volume levels.
1417+ The value of ‘<code class=property>silent </code>’ is mapped to ‘<code class=css>0</code>’ and ‘<code class=property>x-loud</code>’ is mapped
1418+ to ‘<code class=css>100</code>’. The mapping of other values to numerical volume levels
1419+ is implementation dependent, but the intention is to match the
1420+ corresponding levels for ‘<code class=property>voice-volume</code>’.</dd-->
14111421 </ dl >
14121422
14131423 < div class =example >
@@ -1416,9 +1426,9 @@ <h2 id=cue-props><span class=secno>8. </span>Cue properties: ‘<a
14161426 < pre >
14171427a { cue-before: url(bell.aiff); cue-after: url(dong.wav) }
14181428
1419- h1 { cue-before: url(pop.au) 80; cue-after: url(pop.au) 50% }
1429+ h1 { cue-before: url(pop.au) 80% ; cue-after: url(pop.au) 50% }
14201430
1421- div.caution { cue-before: url(caution.wav) loud }
1431+ div.caution { cue-before: url(caution.wav) 130% }
14221432</ pre >
14231433 </ div >
14241434
@@ -2206,8 +2216,15 @@ <h2 id=phonetic-props><span class=secno>11. </span>Phonetics: ‘<a
22062216
22072217 < p class =issue id =issue-phonemes > ISSUE: the ‘< a
22082218 href ="#phonemes "> < code class =property > phonemes</ code > </ a > ’ property
2209- covers functionality that is arguably not strictly related to styling (it
2210- relates to the broader aspect of pronunciation lexicons). There is a < a
2219+ covers functionality that arguably doesn't belong to CSS, as it breaks the
2220+ principle of separation between content and presentation (i.e. any change
2221+ in the source text must be echoed by an corresponding change in the
2222+ content of the aural ‘< a href ="#phonemes "> < code
2223+ class =property > phonemes</ code > </ a > ’ property). The broader aspect of
2224+ pronunciation lexicons would be better addressed in the content markup.
2225+ For example, < a href ="http://wiki.whatwg.org/wiki/RelExtensions "> it is
2226+ proposed</ a > that formats like the W3C's own PLS standard should be
2227+ supported by the link/rel mechanism. There is a < a
22112228 href ="http://lists.w3.org/Archives/Public/www-style/2011Jan/0248.html "> related
22122229 discussion</ a > on the public mailing-list.
22132230
@@ -2320,9 +2337,9 @@ <h2 id=content><span class=secno>12. </span>Inserted and replaced content
23202337 the text-to-speech with its own synthesizer.</ p >
23212338 </ div >
23222339
2323- < p > Furthermore authors, or users in a user stylesheet, may want to add some
2324- information to ease understanding the structure for non-visual interaction
2325- with the document. They can do so by using the ‘< code
2340+ < p > Furthermore, authors ( or users in a user stylesheet) may want to add
2341+ some information to ease understanding the structure for non-visual
2342+ interaction with the document. They can do so by using the ‘< code
23262343 class =css > ::before</ code > ’ and ‘< code
23272344 class =css > ::after</ code > ’ pseudo-elements that will be inserted
23282345 between the element's contents and the ‘< a href ="#rest "> < code
@@ -2347,7 +2364,17 @@ <h2 id=content><span class=secno>12. </span>Inserted and replaced content
23472364
23482365 < p class =note > Note that detailed information can be found in the CSS3
23492366 Generated and Replaced Content Module < a href ="#CSS3GENCON "
2350- rel =biblioentry > [CSS3GENCON]<!--{{CSS3GENCON}}--> </ a > .</ p >
2367+ rel =biblioentry > [CSS3GENCON]<!--{{CSS3GENCON}}--> </ a > .
2368+
2369+ < p class =issue id =issue-lists > ISSUE: the speech handling of list items is
2370+ under-specified. One suggestion is to ignore list-style-type (which can
2371+ represent visual glyphs not suitable for aural processing anyway) and to
2372+ define a standard way to read < a
2373+ href ="http://dev.w3.org/csswg/css3-lists/#marker-pseudoelement "> list
2374+ "markers"</ a > . Unfortunately this introduces a dependency on a feature
2375+ specific to CSS3-Lists. There is a < a
2376+ href ="http://lists.w3.org/Archives/Public/www-style/2011Feb/0195.html "> related
2377+ discussion</ a > on the public mailing-list.</ p >
23512378 <!--
23522379<h2 id="interpret-as-props">Interpretation property:
23532380 <a href="#interpret-as">‘<code class=property>interpret-as</code>’</a></h2>
@@ -2486,8 +2513,7 @@ <h2 class=no-num id=property-index>Appendix B — Property index</h2>
24862513 < tr valign =baseline >
24872514 < td > < a class =property href ="#cue-after "> cue-after</ a >
24882515
2489- < td > <uri> [<non-negative number> | <percentage> |
2490- silent | x-soft | soft | medium | loud | x-loud] | none | inherit
2516+ < td > <uri> [<percentage>] | none | inherit
24912517
24922518 < td > none
24932519
@@ -2502,8 +2528,7 @@ <h2 class=no-num id=property-index>Appendix B — Property index</h2>
25022528 < tr valign =baseline >
25032529 < td > < a class =property href ="#cue-before "> cue-before</ a >
25042530
2505- < td > <uri> [<non-negative number> | <percentage> |
2506- silent | x-soft | soft | medium | loud | x-loud] | none | inherit
2531+ < td > <uri> [<percentage>] | none | inherit
25072532
25082533 < td > none
25092534
@@ -3116,8 +3141,13 @@ <h2 class=no-num id=changes>Appendix F — Changes from previous draft</h2>
31163141 < li > Fixed ‘< a href ="#voice-family "> < code
31173142 class =property > voice-family</ code > </ a > ’ grammar as per < a
31183143 href ="http://lists.w3.org/Archives/Public/www-style/2010Dec/0231.html "> this
3119- discussion</ a > </ li >
3120- <!-- li>Added the default ‘<code class=property>auto</code>’ value for the ‘<code class=property>voice-volume</code>’ property, in order to inherit aural rendering behavior from the ‘<code class=property>visibility</code>’ property.</li -->
3144+ discussion</ a >
3145+
3146+ < li > The volume level of audio cues can only be set relatively to the
3147+ inherited ‘< a href ="#voice-volume "> < code
3148+ class =property > voice-volume</ code > </ a > ’ property (to avoid cues
3149+ being spoken when the main element is silent, which contradicts the
3150+ "aural box model").
31213151
31223152 < li > Added "HTML" to "CSS defines aural properties that give control over
31233153 rendering XML to speech" in the abstract.
@@ -3252,10 +3282,10 @@ <h3 class=no-num id=other-references>Other references</h3>
32523282 < dt id =CSS21 > [CSS21]
32533283
32543284 < dd > Bert Bos; et al. < a
3255- href ="http://www.w3.org/TR/2009/CR -CSS2-20090908 "> < cite > Cascading Style
3256- Sheets Level 2 Revision 1 (CSS 2.1) Specification.</ cite > </ a > 8 September
3257- 2009 . W3C Candidate Recommendation . (Work in progress.) URL: < a
3258- href ="http://www.w3.org/TR/2009/CR -CSS2-20090908 "> http://www.w3.org/TR/2009/CR -CSS2-20090908 </ a >
3285+ href ="http://www.w3.org/TR/2010/WD -CSS2-20101207 "> < cite > Cascading Style
3286+ Sheets Level 2 Revision 1 (CSS 2.1) Specification.</ cite > </ a > 7 December
3287+ 2010 . W3C Working Draft . (Work in progress.) URL: < a
3288+ href ="http://www.w3.org/TR/2010/WD -CSS2-20101207 "> http://www.w3.org/TR/2010/WD -CSS2-20101207 </ a >
32593289 </ dd >
32603290 <!---->
32613291
0 commit comments