[css2] Comments from TVRaman, see 0125

ianbjacobs · ianbjacobs · commit 9791da85511f · 1998-02-09T23:08:09.000Z
--HG--
extra : convert_revision : svn%3A73dc7c4b-06e6-40f3-b4f7-9ed1dbc14bfc/trunk%40834
diff --git a/css2/aural.src b/css2/aural.src
@@ -1,6 +1,6 @@
 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
 <html lang="en">
-<!-- $Id: aural.src,v 2.1 1998-02-07 01:59:54 ijacobs Exp $ -->
+<!-- $Id: aural.src,v 2.2 1998-02-09 23:08:09 ijacobs Exp $ -->
 <HEAD>
 <meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
 <TITLE>Aural style sheets</TITLE>
@@ -18,19 +18,23 @@ text and feeding this to a <span class="index-def" title="screen
 reader"><dfn>screen reader</dfn></span> -- software or hardware that
 simply reads all the characters on the screen.  This results in less
 effective presentation than would be the case if the document
-structure were retained.  Style Sheet properties for aural presentation
+structure were retained.  Style sheet properties for aural presentation
 may be used together with visual properties (mixed media) or as an
 aural alternative to visual presentation.
 
 <p>Besides the obvious accessibility advantages, there are other large
-markets for aural presentation, including in-car use, industrial and
-medical documentation systems (intranets), home entertainment, and to
-help users learning to read or who have difficulty reading.
+markets for listening to information, including in-car use, industrial
+and medical documentation systems (intranets), home entertainment, and
+to help users learning to read or who have difficulty reading.
 
-<!-- Talk about aural canvas here. Space, time, frequency, etc. -IJ
--->
+<p>When using aural properties, the <span class="index-inst"
+title="canvas">canvas</span> consists of a three-dimensional physical
+space (sound surrounds) and a temporal space (one may specify sounds
+before, during, and after other sounds). The CSS properties also
+allow authors to vary the quality of synthesized speech (voice type,
+frequency, inflection, etc.).
 
-<!-- Give examples! -->
+<!-- Give examples! -IJ -->
 
 
 <H2><a name="volume-props">Volume properties</a>: <span
@@ -242,9 +246,12 @@ The following two rules are equivalent:
 </pre>
 </div>
 
-<!-- What do UAs do when the auditory icon is not found
-or they cannot render the auditory icon? -IJ -->
+<!-- Proposed, see mail from T.V. Raman -->
 
+<P>If a user agent cannot render an auditory icon (e.g., the user's
+environment does not permit it), we recommend that it produce an
+alternative cue (e.g., popping up a warning, emitting a warning sound,
+etc.)
 
 <H2><a name="mixing-props">Mixing properties</a>: <span
 class="propinst-play-during">'play-during'</span></H2>
@@ -448,23 +455,23 @@ class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
 somewhat by language but is nevertheless widely supported by speech
 synthesizers.
 <dt><strong>x-slow</strong>
-<dd>Same as ?
+<dd>Same as 80 words per minute.
 <dt><strong>slow</strong>
-<dd>Same as ?
+<dd>Same as 120 words per minute
 <dt><strong>medium</strong>
-<dd>Same as ? Refers to the user's preferred
-speech-rate setting.
+<dd>Same as 180 - 200 words per minute.
 <dt><strong>fast</strong>
-<dd>Same as ?
+<dd>Same as 300 words per minute.
 <dt><strong>x-fast</strong>
-<dd>Same as ?
+<dd>Same as 500 words per minute.
 <dt><strong>faster</strong>
-<dd>Adds ? to current speech rate.
+<dd>Adds 40 words per minute to the current speech rate.
 <dt><strong>slower</strong>
-<dd>Subtracts ? to current speech rate.
+<dd>Subtracts 40 words per minutes from the current speech rate.
 </dl>
 
-<!-- These need completion! -IJ -->
+<!-- Some values are those suggested by T.V. Raman,
+others proposed by Ian in light of TV's values  -IJ -->
 
 <!-- #include src=properties/voice-family.srb -->
 
@@ -503,30 +510,40 @@ in a way that is independent of speech synthesizer?
 
 <!-- #include src=properties/pitch.srb -->
 
-<p>Specifies the average pitch of the speaking voice.
-Values have the following meanings:</P>
+<p>Specifies the average pitch (in hertz) of the speaking voice.  The
+average pitch of a voice depends on the voice family.  For example,
+the average pitch for a standard male voice is around 120hz,
+but for a female voice, it's around 210hz.</p>
+
+<P>Values have the following meanings:</P>
 
 <dl>
 <dt><span class="index-inst" title="&lt;frequency&gt;"><span class="value-inst-frequency"><strong>&lt;frequency&gt;</strong></span></span>
 <dd>Specifies the average pitch of the speaking voice in hertz (Hz).
-<dt><strong>x-low</strong>
-<dd>Same as ?
-<dt><strong>low</strong>
-<dd>Same as ?
-<dt><strong>medium</strong>
-<dd>Same as ?
-<dt><strong>high</strong>
-<dd>Same as ?
-<dt><strong>x-high</strong>
-<dd>Same as ?
+<dt><strong>x-low</strong>, <strong>low</strong>,
+<strong>medium</strong>, <strong>high</strong>, <strong>x-high</strong>
+<dd>These values do not map to absolute frequencies since 
+these values depend on the voice family. User agents should map
+these values to appropriate frequencies based on the voice family
+and user environment. However, user agents must map these values in 
+order (i.e., 'x-low' is a lower frequency than 'low', etc.).
 </dl>
 
-<!-- Needs completion! -IJ -->
+<!-- Give examples! -IJ -->
 
 <!-- #include src=properties/pitch-range.srb -->
 
-<p>Specifies variation in average pitch. Values have the
-following meanings:</p>
+<p>Specifies variation in average pitch.  The perceived pitch of a
+human voice is determined by the fundamental frequency and typically
+has a value of 120hz for a male voice and 200hz for a female voice.
+Human languages are spoken with varying inflection and pitch; these
+variations convey additional meaning and emphasis.  Thus, a highly
+animated voice, i.e., one that is heavily inflected, displays a high
+pitch range. This property specifies the range over which these
+variations occur, i.e., how much the fundamental frequency may deviate
+from the average pitch.
+
+<P>Values have the following meanings:</p>
 
 <dl>
 <dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
@@ -535,38 +552,43 @@ a flat, monotonic voice. A pitch range of 50 produces normal
 inflection.  Pitch ranges greater than 50 produce animated voices.
 </dl>
 
-<!-- Needs completion -IJ -->
 
 <!-- #include src=properties/stress.srb -->
 
-<p>Specifies the level of stress (assertiveness or emphasis) of the
-speaking voice.  English is a <strong>stressed</strong> language, and
-different parts of a sentence are assigned primary, secondary or
-tertiary stress. The value of <span
+<p>Specifies the the height of "local peaks" in the intonation contour
+of a voice. For example, English is a <strong>stressed</strong>
+language, and different parts of a sentence are assigned primary,
+secondary, or tertiary stress. The value of <span
 class="propinst-stress">'stress'</span> controls the amount of
-inflection that results from these stress markers. Values
-have the following meanings:</p>
+inflection that results from these stress markers.  This property is a
+companion to the <span
+class="propinst-pitch-range">'pitch-range'</span> property and is
+provided to allow developers to exploit higher-end auditory displays.
+
+<P>Values have the following meanings:</p>
 
 <dl>
 <dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
-<dd>Increasing the value of this property results in the speech being
-more strongly inflected.  It is, in a sense, a companion to the <span
-class="propinst-pitch-range">'pitch-range'</span> property and is
-provided to allow developers to exploit higher-end auditory displays.
+<dd>The greater the value, the more inflected the voice. For
+example, a value of 30 or 40Hz corresponds to 
+a standard, English-speaking male voice (average pitch = 122Hz), speaking
+with normal intonation and emphasis. The fundamental frequency
+may go up to, but never higher than, around 155hz for stressed 
+parts of the speech.
 </dl>
 
-<!-- Needs completion -IJ -->
-
 <!-- #include src=properties/richness.srb -->
 
-<P>Specifies the richness (brightness) of the speaking voice. 
-Values have the following meanings:</p>
+<P>Specifies the richness, or brightness, of the speaking voice.  A
+rich voice will "carry" in a large room, a smooth voice will not.
+(The term "smooth" refers to how the wave form looks when drawn.)
+
+<P>Values have the following meanings:</p>
 
 <dl>
 <dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
-<dd>The effect of increasing richness is to produce a voice that
-<em>carries</em>. Reducing richness produces a soft, mellifluous
-voice.
+<dd>The higher the value, the more the voice will carry.
+A lower value will produce a soft, mellifluous voice.
 </dl>
 
 <!-- Needs completion -IJ -->