flackr
diff --git a/‎css3-speech/Overview.html‎
Lines changed: 70 additions & 46 deletions b/‎css3-speech/Overview.html‎
Lines changed: 70 additions & 46 deletions
@@ -436,8 +436,8 @@ <h2 id=ssml-rel><span class=secno>3. </span>Relationship with SSML</h2>
    However, the specificities of the CSS model mean that compatibility with
    SSML in terms of syntax and/or semantics is only partially achievable. The
    definition of each property in the Speech module includes informative
-   statements, wherever necessary, to clarify the relationship with similar
-   features in SSML.
+   statements, wherever necessary, to clarify their relationship with similar
+   functionality from SSML.
 
   <h2 id=css-values><span class=secno>4. </span>CSS values</h2>
 
@@ -1206,7 +1206,8 @@ <h3 id=pause-props-pause-before-after><span class=secno>9.1. </span>The
    property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_break"><code>break</code>
    element</a> from the SSML markup language <a href="#SSML"
-   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, the application of prosodic
+   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, the application of &lsquo;<a
+   href="#pause"><code class=property>pause</code></a>&rsquo; prosodic
    boundaries within the <a href="#aural-model">aural "box" model</a> of CSS
    Speech requires special considerations (e.g. <a
    href="#collapsed-pauses">"collapsed" pauses</a>).
@@ -1482,11 +1483,15 @@ <h3 id=rest-props-rest-before-after><span class=secno>10.1. </span>The
    that occurs before (or after) the speech synthesis rendition of an element
    within the <a href="#aural-model">audio "box" model</a>.
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_break"><code>break</code>
    element</a> from the SSML markup language <a href="#SSML"
-   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, the application of &lsquo;<a
+   href="#rest"><code class=property>rest</code></a>&rsquo; prosodic
+   boundaries within the <a href="#aural-model">aural "box" model</a> of CSS
+   Speech requires special considerations (e.g. interspersed audio cues,
+   additive adjacent rests).
 
   <dl>
    <dt> <strong>&lt;time&gt;</strong>
@@ -1683,11 +1688,15 @@ <h3 id=cue-props-cue-before-after><span class=secno>11.1. </span>The
    clips) to be played before (or after) the selected element within the <a
    href="#aural-model">audio "box" model</a>.
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property may appear related to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_audio"><code>audio</code>
    element</a> from the SSML markup language <a href="#SSML"
-   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, there are in fact major
+   discrepancies. For example, the <a href="#aural-model">aural "box"
+   model</a> means that audio cues are associated to the selected element's
+   volume level, and CSS Speech's auditory icons provide limited
+   functionality compared to SSML's <code>audio</code> element.
 
   <dl>
    <dt> <strong>&lt;uri&gt;</strong>
@@ -1936,11 +1945,14 @@ <h3 id=voice-props-voice-family><span class=secno>12.1. </span>The
   <p> <strong>&lt;generic-voice&gt;</strong> = [&lt;age&gt;? &lt;gender&gt;
    &lt;integer&gt;?]
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_voice"><code>voice</code>
    element</a> from the SSML markup language <a href="#SSML"
-   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, CSS Speech does not provide an
+   equivalent to SSML's sophisticated voice language selection. This
+   technical limitation may be alleviated in a future revision of the Speech
+   module.
 
   <dl>
    <dt> <strong>&lt;name&gt;</strong>
@@ -1986,24 +1998,14 @@ <h3 id=voice-props-voice-family><span class=secno>12.1. </span>The
     <p> Possible values are &lsquo;<code class=property>child</code>&rsquo;,
      &lsquo;<code class=property>young</code>&rsquo; and &lsquo;<code
      class=property>old</code>&rsquo;, indicating the preferred age category
-     to match during voice selection. The mapping with <a href="#SSML"
-     rel=biblioentry>[SSML]<!--{{!SSML}}--></a> ages is defined as follows:
-     &lsquo;<code class=property>child</code>&rsquo; = 6 y/o, &lsquo;<code
-     class=property>young</code>&rsquo; = 24 y/o, &lsquo;<code
-     class=property>old</code>&rsquo; = 75 y/o (note that more flexible age
-     ranges may be used by the processor-dependent voice-matching algorithm).
-     </p>
+     to match during voice selection.</p>
 
-    <p class=note> Note that the interpretation of the relationship between a
-     person's age and a recognizable type of voice cannot realistically be
-     defined in a universal manner, as it effectively depends on numerous
-     criteria (cultural, linguistic, biological, etc.). The values provided
-     by this specification therefore represent a simplified model that can be
-     reasonably applied to a broad variety of speech contexts, albeit at the
-     cost of a certain degree of approximation. Future versions of this
-     specification may refine the level of precision of the voice-matching
-     algorithm, as speech processor implementations become more standardized.
-     </p>
+    <p class=note> Note that a recommended mapping with <a href="#SSML"
+     rel=biblioentry>[SSML]<!--{{!SSML}}--></a> ages is: &lsquo;<code
+     class=property>child</code>&rsquo; = 6 y/o, &lsquo;<code
+     class=property>young</code>&rsquo; = 24 y/o, &lsquo;<code
+     class=property>old</code>&rsquo; = 75 y/o. More flexible age ranges may
+     be used by the processor-dependent voice-matching algorithm.</p>
 
    <dt> <strong>&lt;gender&gt;</strong>
 
@@ -2013,6 +2015,17 @@ <h3 id=voice-props-voice-family><span class=secno>12.1. </span>The
      class=property>neutral</code>&rsquo;, specifying a male, female, or
      neutral voice, respectively.</p>
 
+    <p class=note> Note that the interpretation of the relationship between a
+     person's age or gender, and a recognizable type of voice, cannot
+     realistically be defined in a universal manner as it effectively depends
+     on numerous criteria (cultural, linguistic, biological, etc.). The
+     functionality provided by this specification therefore represent a
+     simplified model that can be reasonably applied to a broad variety of
+     speech contexts, albeit at the cost of a certain degree of
+     approximation. Future versions of this specification may refine the
+     level of precision of the voice-matching algorithm, as speech processor
+     implementations become more standardized.</p>
+
    <dt> <strong>&lt;integer&gt;</strong>
 
    <dd>
@@ -2184,11 +2197,14 @@ <h3 id=voice-props-voice-rate><span class=secno>12.2. </span>The &lsquo;<a
    class=property>voice-rate</code></a>&rsquo; property manipulates the rate
    of generated synthetic speech in terms of words per minute.
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_prosody"><code>rate</code>
    attribute of the <code>prosody</code> element</a> from the SSML markup
-   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, there
+   are notable discrepancies. For example, CSS Speech rate keywords and
+   percentage modifiers are not mutually-exclusive, due to how values are
+   inherited and combined for selected elements.
 
   <dl>
    <dt> <strong>normal</strong>
@@ -2323,11 +2339,15 @@ <h3 id=voice-props-voice-pitch><span class=secno>12.3. </span>The &lsquo;<a
    pitch of the output). For example, the common pitch for a male voice is
    around 120Hz, whereas it is around 210Hz for a female voice.
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_prosody"><code>pitch</code>
    attribute of the <code>prosody</code> element</a> from the SSML markup
-   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, there
+   are notable discrepancies. For example, CSS Speech pitch keywords and
+   relative changes (frequency, semitone or percentage) are not
+   mutually-exclusive, due to how values are inherited and combined for
+   selected elements.
 
   <dl>
    <dt> <strong>&lt;frequency&gt;</strong>
@@ -2483,11 +2503,15 @@ <h3 id=voice-props-voice-range><span class=secno>12.4. </span>The &lsquo;<a
    to convey meaning and emphasis in speech. Typically, a low range produces
    a flat, monotonic voice, whereas a high range produces an animated voice.
 
-  <p class=note> Note that the functionality provided by this property is
-   related to the <a
+  <p class=note> Note that although the functionality provided by this
+   property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_prosody"><code>range</code>
    attribute of the <code>prosody</code> element</a> from the SSML markup
-   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
+   language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>, there
+   are notable discrepancies. For example, CSS Speech pitch range keywords
+   and relative changes (frequency, semitone or percentage) are not
+   mutually-exclusive, due to how values are inherited and combined for
+   selected elements.
 
   <dl>
    <dt> <strong>&lt;frequency&gt;</strong>
@@ -2689,7 +2713,7 @@ <h3 id=voice-props-voice-stress><span class=secno>12.5. </span>The
    spoken.
 
   <p class=note> Note that the functionality provided by this property is
-   related to the <a
+   similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_emphasis"><code>emphasis</code>
    element</a> from the SSML markup language <a href="#SSML"
    rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
@@ -2817,7 +2841,7 @@ <h3 id=mixing-props-voice-duration><span class=secno>13.1. </span>The
    property).
 
   <p class=note> Note that the functionality provided by this property is
-   related to the <a
+   similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_prosody"><code>duration</code>
    attribute of the <code>prosody</code> element</a> from the SSML markup
    language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
@@ -2916,7 +2940,7 @@ <h2 id=content><span class=secno>15. </span>Inserted and replaced content</h2>
    unlikely to be recognized by the synthesizer. The &lsquo;<a
    href="#content-def"><code class=property>content</code></a>&rsquo;
    property can be used to replace one string by another. The functionality
-   provided by this property is related to the <a
+   provided by this property is similar to the <a
    href="http://www.w3.org/TR/speech-synthesis11/#edef_sub"><code>alias</code>
    attribute of the <code>sub</code> element</a> from the SSML markup
    language <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>.
@@ -3002,11 +3026,11 @@ <h2 id=pronunciation><span class=secno>16. </span> Pronunciation, phonemes</h2>
   <p> Additionally, an attribute-based mechanism can be used within the
    markup to author text-pronunciation associations. At the time of writing,
    such mechanism isn't formally defined in the W3C HTML standard(s).
-   However, the <a href="http://idpf.org/epub/30">EPUB 3.0 draft
-   specification</a> allows (x)HTML5 documents to contain attributes derived
-   from the <a href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a>
-   specification, that describe how to pronounce text based on a particular
-   phonetic alphabet.</p>
+   However, the <a href="http://idpf.org/epub/30">EPUB 3.0 specification</a>
+   allows (x)HTML5 documents to contain attributes derived from the <a
+   href="#SSML" rel=biblioentry>[SSML]<!--{{!SSML}}--></a> specification,
+   that describe how to pronounce text based on a particular phonetic
+   alphabet.</p>
   <!-- p> 
       One avenue to explore is the use CSS to "bind" HTML text with a   
       phoneme (also declared in the HTML document). This would maintain a