11<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
22<html lang="en">
3- <!-- $Id: aural.src,v 2.1 1998-02-07 01:59:54 ijacobs Exp $ -->
3+ <!-- $Id: aural.src,v 2.2 1998-02-09 23:08:09 ijacobs Exp $ -->
44<HEAD>
55<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
66<TITLE>Aural style sheets</TITLE>
@@ -18,19 +18,23 @@ text and feeding this to a <span class="index-def" title="screen
1818reader"><dfn>screen reader</dfn></span> -- software or hardware that
1919simply reads all the characters on the screen. This results in less
2020effective presentation than would be the case if the document
21- structure were retained. Style Sheet properties for aural presentation
21+ structure were retained. Style sheet properties for aural presentation
2222may be used together with visual properties (mixed media) or as an
2323aural alternative to visual presentation.
2424
2525<p>Besides the obvious accessibility advantages, there are other large
26- markets for aural presentation , including in-car use, industrial and
27- medical documentation systems (intranets), home entertainment, and to
28- help users learning to read or who have difficulty reading.
26+ markets for listening to information , including in-car use, industrial
27+ and medical documentation systems (intranets), home entertainment, and
28+ to help users learning to read or who have difficulty reading.
2929
30- <!-- Talk about aural canvas here. Space, time, frequency, etc. -IJ
31- -->
30+ <p>When using aural properties, the <span class="index-inst"
31+ title="canvas">canvas</span> consists of a three-dimensional physical
32+ space (sound surrounds) and a temporal space (one may specify sounds
33+ before, during, and after other sounds). The CSS properties also
34+ allow authors to vary the quality of synthesized speech (voice type,
35+ frequency, inflection, etc.).
3236
33- <!-- Give examples! -->
37+ <!-- Give examples! -IJ - ->
3438
3539
3640<H2><a name="volume-props">Volume properties</a>: <span
@@ -242,9 +246,12 @@ The following two rules are equivalent:
242246</pre>
243247</div>
244248
245- <!-- What do UAs do when the auditory icon is not found
246- or they cannot render the auditory icon? -IJ -->
249+ <!-- Proposed, see mail from T.V. Raman -->
247250
251+ <P>If a user agent cannot render an auditory icon (e.g., the user's
252+ environment does not permit it), we recommend that it produce an
253+ alternative cue (e.g., popping up a warning, emitting a warning sound,
254+ etc.)
248255
249256<H2><a name="mixing-props">Mixing properties</a>: <span
250257class="propinst-play-during">'play-during'</span></H2>
@@ -448,23 +455,23 @@ class="value-inst-number"><strong><number></strong></span></span>
448455somewhat by language but is nevertheless widely supported by speech
449456synthesizers.
450457<dt><strong>x-slow</strong>
451- <dd>Same as ?
458+ <dd>Same as 80 words per minute.
452459<dt><strong>slow</strong>
453- <dd>Same as ?
460+ <dd>Same as 120 words per minute
454461<dt><strong>medium</strong>
455- <dd>Same as ? Refers to the user's preferred
456- speech-rate setting.
462+ <dd>Same as 180 - 200 words per minute.
457463<dt><strong>fast</strong>
458- <dd>Same as ?
464+ <dd>Same as 300 words per minute.
459465<dt><strong>x-fast</strong>
460- <dd>Same as ?
466+ <dd>Same as 500 words per minute.
461467<dt><strong>faster</strong>
462- <dd>Adds ? to current speech rate.
468+ <dd>Adds 40 words per minute to the current speech rate.
463469<dt><strong>slower</strong>
464- <dd>Subtracts ? to current speech rate.
470+ <dd>Subtracts 40 words per minutes from the current speech rate.
465471</dl>
466472
467- <!-- These need completion! -IJ -->
473+ <!-- Some values are those suggested by T.V. Raman,
474+ others proposed by Ian in light of TV's values -IJ -->
468475
469476<!-- #include src=properties/voice-family.srb -->
470477
@@ -503,30 +510,40 @@ in a way that is independent of speech synthesizer?
503510
504511<!-- #include src=properties/pitch.srb -->
505512
506- <p>Specifies the average pitch of the speaking voice.
507- Values have the following meanings:</P>
513+ <p>Specifies the average pitch (in hertz) of the speaking voice. The
514+ average pitch of a voice depends on the voice family. For example,
515+ the average pitch for a standard male voice is around 120hz,
516+ but for a female voice, it's around 210hz.</p>
517+
518+ <P>Values have the following meanings:</P>
508519
509520<dl>
510521<dt><span class="index-inst" title="<frequency>"><span class="value-inst-frequency"><strong><frequency></strong></span></span>
511522<dd>Specifies the average pitch of the speaking voice in hertz (Hz).
512- <dt><strong>x-low</strong>
513- <dd>Same as ?
514- <dt><strong>low</strong>
515- <dd>Same as ?
516- <dt><strong>medium</strong>
517- <dd>Same as ?
518- <dt><strong>high</strong>
519- <dd>Same as ?
520- <dt><strong>x-high</strong>
521- <dd>Same as ?
523+ <dt><strong>x-low</strong>, <strong>low</strong>,
524+ <strong>medium</strong>, <strong>high</strong>, <strong>x-high</strong>
525+ <dd>These values do not map to absolute frequencies since
526+ these values depend on the voice family. User agents should map
527+ these values to appropriate frequencies based on the voice family
528+ and user environment. However, user agents must map these values in
529+ order (i.e., 'x-low' is a lower frequency than 'low', etc.).
522530</dl>
523531
524- <!-- Needs completion ! -IJ -->
532+ <!-- Give examples ! -IJ -->
525533
526534<!-- #include src=properties/pitch-range.srb -->
527535
528- <p>Specifies variation in average pitch. Values have the
529- following meanings:</p>
536+ <p>Specifies variation in average pitch. The perceived pitch of a
537+ human voice is determined by the fundamental frequency and typically
538+ has a value of 120hz for a male voice and 200hz for a female voice.
539+ Human languages are spoken with varying inflection and pitch; these
540+ variations convey additional meaning and emphasis. Thus, a highly
541+ animated voice, i.e., one that is heavily inflected, displays a high
542+ pitch range. This property specifies the range over which these
543+ variations occur, i.e., how much the fundamental frequency may deviate
544+ from the average pitch.
545+
546+ <P>Values have the following meanings:</p>
530547
531548<dl>
532549<dt><span class="index-inst" title="<number>"><span class="value-inst-number"><strong><number></strong></span></span>
@@ -535,38 +552,43 @@ a flat, monotonic voice. A pitch range of 50 produces normal
535552inflection. Pitch ranges greater than 50 produce animated voices.
536553</dl>
537554
538- <!-- Needs completion -IJ -->
539555
540556<!-- #include src=properties/stress.srb -->
541557
542- <p>Specifies the level of stress (assertiveness or emphasis) of the
543- speaking voice. English is a <strong>stressed</strong> language, and
544- different parts of a sentence are assigned primary, secondary or
545- tertiary stress. The value of <span
558+ <p>Specifies the the height of "local peaks" in the intonation contour
559+ of a voice. For example, English is a <strong>stressed</strong>
560+ language, and different parts of a sentence are assigned primary,
561+ secondary, or tertiary stress. The value of <span
546562class="propinst-stress">'stress'</span> controls the amount of
547- inflection that results from these stress markers. Values
548- have the following meanings:</p>
563+ inflection that results from these stress markers. This property is a
564+ companion to the <span
565+ class="propinst-pitch-range">'pitch-range'</span> property and is
566+ provided to allow developers to exploit higher-end auditory displays.
567+
568+ <P>Values have the following meanings:</p>
549569
550570<dl>
551571<dt><span class="index-inst" title="<number>"><span class="value-inst-number"><strong><number></strong></span></span>
552- <dd>Increasing the value of this property results in the speech being
553- more strongly inflected. It is, in a sense, a companion to the <span
554- class="propinst-pitch-range">'pitch-range'</span> property and is
555- provided to allow developers to exploit higher-end auditory displays.
572+ <dd>The greater the value, the more inflected the voice. For
573+ example, a value of 30 or 40Hz corresponds to
574+ a standard, English-speaking male voice (average pitch = 122Hz), speaking
575+ with normal intonation and emphasis. The fundamental frequency
576+ may go up to, but never higher than, around 155hz for stressed
577+ parts of the speech.
556578</dl>
557579
558- <!-- Needs completion -IJ -->
559-
560580<!-- #include src=properties/richness.srb -->
561581
562- <P>Specifies the richness (brightness) of the speaking voice.
563- Values have the following meanings:</p>
582+ <P>Specifies the richness, or brightness, of the speaking voice. A
583+ rich voice will "carry" in a large room, a smooth voice will not.
584+ (The term "smooth" refers to how the wave form looks when drawn.)
585+
586+ <P>Values have the following meanings:</p>
564587
565588<dl>
566589<dt><span class="index-inst" title="<number>"><span class="value-inst-number"><strong><number></strong></span></span>
567- <dd>The effect of increasing richness is to produce a voice that
568- <em>carries</em>. Reducing richness produces a soft, mellifluous
569- voice.
590+ <dd>The higher the value, the more the voice will carry.
591+ A lower value will produce a soft, mellifluous voice.
570592</dl>
571593
572594<!-- Needs completion -IJ -->
0 commit comments