xfq
diff --git a/‎css3-speech/Overview.html‎
Lines changed: 81 additions & 91 deletions b/‎css3-speech/Overview.html‎
Lines changed: 81 additions & 91 deletions
@@ -90,13 +90,13 @@
 
    <h1 id=top>CSS Speech Module</h1>
 
-   <h2 class="no-num no-toc" id=longstatus-date>Editor's Draft 06 July 2011</h2>
+   <h2 class="no-num no-toc" id=longstatus-date>Editor's Draft 07 July 2011</h2>
 
    <dl>
     <dt>This version:
 
     <dd>
-     <!--<a href="http://www.w3.org/TR/2011/WD-css3-speech-20110706">http://www.w3.org/TR/2011/ED-css3-speech-20110706/</a>-->
+     <!--<a href="http://www.w3.org/TR/2011/WD-css3-speech-20110707">http://www.w3.org/TR/2011/ED-css3-speech-20110707/</a>-->
      <a
      href="http://dev.w3.org/csswg/css3-speech">http://dev.w3.org/csswg/css3-speech</a>
 
@@ -442,6 +442,7 @@ <h2 id=example><span class=secno>3. </span>Example</h2>
   voice-family: paul;
   voice-stress: moderate;
   cue-before: url(../audio/ping.wav);
+  voice-volume: medium 6dB;
 }
 p.heidi
 {
@@ -516,13 +517,13 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
     <tr>
      <td> <em>Value:</em>
 
-     <td>normal | silent | x-soft | soft | medium | loud | x-loud |
-      &lt;decibel&gt;
+     <td>silent | [[x-soft | soft | medium | loud | x-loud] ||
+      &lt;decibel&gt;]
 
     <tr>
      <td> <em>Initial:</em>
 
-     <td>normal
+     <td>medium
 
     <tr>
      <td> <em>Applies&nbsp;to:</em>
@@ -547,7 +548,7 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
     <tr>
      <td> <em>Computed value:</em>
 
-     <td>specified value
+     <td>keyword value, and decibel offset (if not zero)
   </table>
 
   <p>The &lsquo;<a href="#voice-volume"><code
@@ -563,12 +564,13 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
    attribute of the <code>prosody</code> element</a> from the SSML markup
    language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
 
-  <dl>
-   <dt> <strong>normal</strong>
-
-   <dd>
-    <p> Corresponds to +0.0dB, which means that there is no modification of
-     volume level. This value overrides the inherited value.</p>
+  <dl><!-- dt>
+        <strong>normal</strong>
+      </dt>
+      <dd>
+        <p> Corresponds to +0.0dB, which means that there is no modification of volume level. This
+          value overrides the inherited value.</p>
+      </dd -->
 
    <dt> <strong>silent</strong>
 
@@ -582,9 +584,9 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
      &lsquo;<code class=property>silent</code>&rsquo;, and an element whose
      &lsquo;<a href="#speak"><code class=property>speak</code></a>&rsquo;
      property has the value &lsquo;<code class=property>none</code>&rsquo;.
-     With the former, the selected takes up the same time as if it had been
-     spoken, including any pause before and after the element, but no sound
-     is generated (descendants can override the &lsquo;<a
+     With the former, the selected element takes up the same time as if it
+     was spoken, including any pause before and after the element, but no
+     sound is generated (descendants can override the &lsquo;<a
      href="#voice-volume"><code class=property>voice-volume</code></a>&rsquo;
      value and may therefore generate audio output). With the latter, the
      selected element is not rendered in the aural dimension and no time is
@@ -598,8 +600,8 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
    <dd>
     <p> This sequence of keywords corresponds to monotonically non-decreasing
-     by the user-agent) that meet user's requirements in terms of perceived
-     sound loudness . The keyword &lsquo;<code
+     by the user-agent) that meet the user's requirements in terms of
+     perceived sound loudness . The keyword &lsquo;<code
      class=property>x-soft</code>&rsquo; maps to the user's <em>minimum
      audible</em> volume level, &lsquo;<code
      class=property>x-loud</code>&rsquo; maps to the user's <em>maximum
@@ -614,10 +616,17 @@ <h3 id=mixing-props-voice-volume><span class=secno>5.1. </span>The
    <dd>
     <p>A <a href="#number-def">number</a> immediately followed by "dB"
      (decibel unit). This represents a change (positive or negative) relative
-     to the default value for the root element, or to the inherited volume
-     level otherwise. This is expressed as the ratio of the squares of the
-     new signal amplitude (a1) and the current amplitude (a0), as per the
-     following logarithmic equation: volume(dB) = 20 log10 (a1 / a0)</p>
+     to the given keyword value (see enumeration above), or to the default
+     value for the root element, or otherwise to the inherited volume level
+     (which may itself be be a combination of a keyword value and of a
+     decibel offset). When the inherited volume level is &lsquo;<code
+     class=property>silent</code>&rsquo;, this &lsquo;<a
+     href="#voice-volume"><code class=property>voice-volume</code></a>&rsquo;
+     resolves to &lsquo;<code class=property>silent</code>&rsquo; too,
+     regardless of the provided &lt;decibel&gt; value. Decibels express the
+     ratio
F882
 of the squares of the new signal amplitude (a1) and the current
+     amplitude (a0), as per the following logarithmic equation: volume(dB) =
+     20 log10 (a1 / a0)</p>
 
     <p class=note> Note that -6.0dB is approximately half the amplitude of
      the audio signal, and +6.0dB is approximately twice the amplitude.</p>
@@ -1369,9 +1378,8 @@ <h3 id=rest-props-rest-before-after><span class=secno>8.1. </span>The
    <dt> <strong>none</strong>
 
    <dd>
-    <p> Equivalent to 0ms (no prosodic break in the speech output). This
-     value can be used to inhibit a prosodic break which the processor would
-     otherwise produce.</p>
+    <p> Equivalent to 0ms (no prosodic break is produced by the speech
+     processor).</p>
 
    <dt> <strong>x-weak</strong>, <strong>weak</strong>,
     <strong>medium</strong>, <strong>strong</strong>, and
@@ -1579,23 +1587,18 @@ <h3 id=cue-props-cue-before-after><span class=secno>9.1. </span>The
    <dd>
     <p>A <a href="#number-def">number</a> immediately followed by "dB"
      (decibel unit). This represents a change (positive or negative) relative
-     to the default sound level of audio clip. This is expressed as the ratio
+     to the computed value of the &lsquo;<a href="#voice-volume"><code
+     class=property>voice-volume</code></a>&rsquo; property within the <a
+     href="#aural-model">aural "box" model</a> of the selected element. When
+     the &lsquo;<a href="#voice-volume"><code
+     class=property>voice-volume</code></a>&rsquo; property is set to
+     &lsquo;<code class=property>silent</code>&rsquo;, the audio cue is also
+     set to &lsquo;<code class=property>silent</code>&rsquo; (regardless of
+     the value provided for this &lt;decibel&gt;). Decibels express the ratio
      of the squares of the new signal amplitude (a1) and the current
      amplitude (a0), as per the following logarithmic equation: volume(dB) =
      20 log10 (a1 / a0)</p>
 
-    <p>Audio cues apply to the selected element within the <a
-     href="#aural-model">audio "box" model</a>, so when the inherited value
-     from the &lsquo;<a href="#voice-volume"><code
-     class=property>voice-volume</code></a>&rsquo; property is &lsquo;<code
-     class=property>silent</code>&rsquo;, the volume level for the audio cue
-     is resolved to -infinity decibels (which effectively silences the audio
-     cue), regardless of the value provided for this &lt;decibel&gt;. In
-     other words, a selected element can be entirely silenced (i.e. including
-     its associated audio cues) by setting the &lsquo;<a
-     href="#voice-volume"><code class=property>voice-volume</code></a>&rsquo;
-     property to &lsquo;<code class=property>silent</code>&rsquo;.</p>
-
     <p class=note> Note that -6.0dB is approximately half the amplitude of
      the audio signal, and +6.0dB is approximately twice the amplitude.</p>
 
@@ -1802,6 +1805,12 @@ <h3 id=voice-props-voice-family><span class=secno>10.1. </span>The
      rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, voice names are
      space-separated and cannot contain whitespace characters.</p>
 
+    <p> It is recommended to quote voice names that contain white space,
+     digits, or punctuation characters other than hyphens - even if these
+     voice names are valid in unquoted form - in order to improve code
+     clarity. For example: <code>voice-family: "john doe", "Henry
+     the-8th";</code></p>
+
    <dt> <strong>&lt;age&gt;</strong>
 
    <dd>
@@ -1855,15 +1864,6 @@ <h3 id=voice-props-voice-family><span class=secno>10.1. </span>The
 voice-family: john 1st; /* identifier cannot start with digit */</pre>
   </div>
 
-  <div class=example>
-   <p> This is an example of valid voice names that contain white space,
-    digits, or punctuation characters other than hyphens, but which are
-    quoted nonetheless, for reading clarity.</p>
-
-   <pre>
-voice-family: "john doe", "Henry the-8th";</pre>
-  </div>
-
   <h4 class=no-toc id=voice-selection><span class=secno>10.1.1. </span>Voice
    selection, content language</h4>
 
@@ -2079,10 +2079,12 @@ <h3 id=voice-props-voice-pitch><span class=secno>10.3. </span>The &lsquo;<a
 
   <p>The &lsquo;<a href="#voice-pitch"><code
    class=property>voice-pitch</code></a>&rsquo; property specifies the
-   average pitch of generated speech output, and depends on the &lsquo;<a
-   href="#voice-family"><code class=property>voice-family</code></a>&rsquo;.
-   For example, the default average pitch for a common male voice is around
-   120Hz, whereas it is around 210Hz for a female voice.
+   "baseline" pitch of the generated speech output, which depends on the used
+   &lsquo;<a href="#voice-family"><code
+   class=property>voice-family</code></a>&rsquo; instance, and varies across
+   speech synthesis processors (it approximately corresponds to the average
+   pitch of the output). For example, the common pitch for a male voice is
+   around 120Hz, whereas it is around 210Hz for a female voice.
 
   <p class=note> Note that the functionality provided by this property is
    related to the <a
@@ -2095,24 +2097,18 @@ <h3 id=voice-props-voice-pitch><span class=secno>10.3. </span>The &lsquo;<a
 
    <dd>
     <p> A value in <a href="#frequency-def">frequency</a> units (Hertz or
-     kiloHertz, e.g. "100Hz", "+2kHz"). Unless the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is used, values are
-     restricted to positive numbers (using negative numbers results in the
-     property value being ignored). When the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is used, the provided
-     value specifies a relative change (decrement or increment) to the
-     inherited value. When the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is not used, the provided
-     value specifies the average pitch of the speaking voice, expressed as an
-     absolute frequency.</p>
+     kiloHertz, e.g. "100Hz", "+2kHz"). Values are restricted to positive
+     numbers (unless the &lsquo;<code class=property>relative</code>&rsquo;
+     keyword is used), and using negative numbers results in the property
+     value being ignored.</p>
 
    <dt> <strong>relative</strong>
 
    <dd>
     <p> This keyword specifies that the provided frequency value is expressed
-     relatively to another base value. This disambiguates absolute positive
-     &lt;frequency&gt; values from increments (e.g. "+2kHz" can either be an
-     increment or an absolute value).</p>
+     relatively to the inherited value, with positive or negative numbers.
+     For example, "+2kHz relative" is an increment, unlike "+2kHz" which is a
+     positive absolute value.</p>
 
    <dt> <strong>&lt;semitones&gt;</strong>
 
     <p> Only non-negative <a href="#percentage-def">percentage</a> values are
      allowed. Computed values are calculated relative to the inherited value.
      For example, 50% means that the inherited value gets multiplied by 0.5,
-     which results in half the inherited average pitch of the voice.</p>
+     which results in half the inherited pitch of the voice.</p>
 
    <dt><strong>x-low</strong>, <strong>low</strong>, <strong>medium</strong>,
     <strong>high</strong>, <strong>x-high</strong>
@@ -2150,8 +2146,10 @@ <h3 id=voice-props-voice-pitch><span class=secno>10.3. </span>The &lsquo;<a
 h1 { voice-pitch: +250Hz; } /* identical to the line above */
 h2 { voice-pitch: +30Hz relative; }
 h2 { voice-pitch: 30Hz relative; } /* identical to the line above */
-h3 { voice-pitch: relative -2st; } /* the swapped keyword placement is a legal syntax */
-h4 { voice-pitch: -2st; } /* Illegal syntax ! ("relative" keyword is missing) */</pre>
+h3 { voice-pitch: relative -20Hz; } /* the swapped keyword placement is a legal syntax */
+h4 { voice-pitch: -20Hz; } /* Illegal syntax ! ("relative" keyword is missing for negative frequency) */
+h4 { voice-pitch: -3.5st; } /* Legal syntax: semitones are always relative, no need for the keyword. */
+      </pre>
   </div>
 
   <h3 id=voice-props-voice-pitch-range><span class=secno>10.4. </span>The
@@ -2204,11 +2202,12 @@ <h3 id=voice-props-voice-pitch-range><span class=secno>10.4. </span>The
 
   <p> The &lsquo;<a href="#voice-pitch-range"><code
    class=property>voice-pitch-range</code></a>&rsquo; property specifies the
-   variability in average pitch, i.e. how much the fundamental frequency may
-   deviate from the average pitch. The dynamic pitch range of the generated
-   speech output typically increases for a highly animated voice, for example
-   when variations in inflection are used to convey meaning and emphasis in
-   speech.
+   variability in the "baseline" pitch, i.e. how much the fundamental
+   frequency may deviate from the average pitch of the speech output. The
+   dynamic pitch range of the generated speech generally increases for a
+   highly animated voice, for example when variations in inflection are used
+   to convey meaning and emphasis in speech. Typically, a low range produces
+   a flat, monotonic voice, whereas a high range produces an animated voice.
 
   <p class=note> Note that the functionality provided by this property is
    related to the <a
@@ -2221,27 +2220,18 @@ <h3 id=voice-props-voice-pitch-range><span class=secno>10.4. </span>The
 
    <dd>
     <p> A value in <a href="#frequency-def">frequency</a> units (Hertz or
-     kiloHertz, e.g. "100Hz", "+2kHz"). Unless the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is used, values are
-     restricted to positive numbers (using negative numbers results in the
-     property value being ignored). When the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is used, the provided
-     value specifies a relative change (decrement or increment) to the
-     inherited value. When the &lsquo;<code
-     class=property>relative</code>&rsquo; keyword is not used, the provided
-     value specifies the average pitch of the speaking voice, expressed as an
-     absolute frequency.</p>
-
-    <p class=note> Low ranges produce a flat, monotonic voice. A high range
-     produces animated voices.</p>
+     kiloHertz, e.g. "100Hz", "+2kHz"). Values are restricted to positive
+     numbers (unless the &lsquo;<code class=property>relative</code>&rsquo;
+     keyword is used), and using negative numbers results in the property
+     value being ignored.</p>
 
    <dt> <strong>relative</strong>
 
    <dd>
     <p> This keyword specifies that the provided frequency value is expressed
-     relatively to another base value. This disambiguates absolute positive
-     &lt;frequency&gt; values from increments (e.g. "+2kHz" can either be an
-     increment or an absolute value).</p>
+     relatively to the inherited value, with positive or negative numbers.
+     For example, "+2kHz relative" is an increment, unlike "+2kHz" which is a
+     positive absolute value.</p>
 
    <dt> <strong>&lt;semitones&gt;</strong>
 
@@ -2260,7 +2250,7 @@ <h3 id=voice-props-voice-pitch-range><span class=secno>10.4. </span>The
     <p> Only non-negative <a href="#percentage-def">percentage</a> values are
      allowed. Computed values are calculated relative to the inherited value.
      For example, 50% means that the inherited value gets multiplied by 0.5,
-     which results in half the inherited average pitch range of the voice.</p>
+     which results in half the inherited pitch range of the voice.</p>
 
    <dt><strong>x-low</strong>, <strong>low</strong>, <strong>medium</strong>,
     <strong>high</strong> and <strong>x-high</strong>
@@ -2958,10 +2948,10 @@ <h2 class=no-num id=property-index>Appendix A &mdash; Property index</h2>
     <tr>
      <td><a class=property href="#voice-volume">voice-volume</a>
 
-     <td>normal | silent | x-soft | soft | medium | loud | x-loud |
-      &lt;decibel&gt;
+     <td>silent | [[x-soft | soft | medium | loud | x-loud] ||
+      &lt;decibel&gt;]
 
-     <td>normal
+     <td>medium
 
      <td>all elements