Skip to content

Commit 9791da8

Browse files
committed
[css2] Comments from TVRaman, see 0125
--HG-- extra : convert_revision : svn%3A73dc7c4b-06e6-40f3-b4f7-9ed1dbc14bfc/trunk%40834
1 parent 04a0c8d commit 9791da8

1 file changed

Lines changed: 74 additions & 52 deletions

File tree

css2/aural.src

Lines changed: 74 additions & 52 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
22
<html lang="en">
3-
<!-- $Id: aural.src,v 2.1 1998-02-07 01:59:54 ijacobs Exp $ -->
3+
<!-- $Id: aural.src,v 2.2 1998-02-09 23:08:09 ijacobs Exp $ -->
44
<HEAD>
55
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
66
<TITLE>Aural style sheets</TITLE>
@@ -18,19 +18,23 @@ text and feeding this to a <span class="index-def" title="screen
1818
reader"><dfn>screen reader</dfn></span> -- software or hardware that
1919
simply reads all the characters on the screen. This results in less
2020
effective presentation than would be the case if the document
21-
structure were retained. Style Sheet properties for aural presentation
21+
structure were retained. Style sheet properties for aural presentation
2222
may be used together with visual properties (mixed media) or as an
2323
aural alternative to visual presentation.
2424

2525
<p>Besides the obvious accessibility advantages, there are other large
26-
markets for aural presentation, including in-car use, industrial and
27-
medical documentation systems (intranets), home entertainment, and to
28-
help users learning to read or who have difficulty reading.
26+
markets for listening to information, including in-car use, industrial
27+
and medical documentation systems (intranets), home entertainment, and
28+
to help users learning to read or who have difficulty reading.
2929

30-
<!-- Talk about aural canvas here. Space, time, frequency, etc. -IJ
31-
-->
30+
<p>When using aural properties, the <span class="index-inst"
31+
title="canvas">canvas</span> consists of a three-dimensional physical
32+
space (sound surrounds) and a temporal space (one may specify sounds
33+
before, during, and after other sounds). The CSS properties also
34+
allow authors to vary the quality of synthesized speech (voice type,
35+
frequency, inflection, etc.).
3236

33-
<!-- Give examples! -->
37+
<!-- Give examples! -IJ -->
3438

3539

3640
<H2><a name="volume-props">Volume properties</a>: <span
@@ -242,9 +246,12 @@ The following two rules are equivalent:
242246
</pre>
243247
</div>
244248

245-
<!-- What do UAs do when the auditory icon is not found
246-
or they cannot render the auditory icon? -IJ -->
249+
<!-- Proposed, see mail from T.V. Raman -->
247250

251+
<P>If a user agent cannot render an auditory icon (e.g., the user's
252+
environment does not permit it), we recommend that it produce an
253+
alternative cue (e.g., popping up a warning, emitting a warning sound,
254+
etc.)
248255

249256
<H2><a name="mixing-props">Mixing properties</a>: <span
250257
class="propinst-play-during">'play-during'</span></H2>
@@ -448,23 +455,23 @@ class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
448455
somewhat by language but is nevertheless widely supported by speech
449456
synthesizers.
450457
<dt><strong>x-slow</strong>
451-
<dd>Same as ?
458+
<dd>Same as 80 words per minute.
452459
<dt><strong>slow</strong>
453-
<dd>Same as ?
460+
<dd>Same as 120 words per minute
454461
<dt><strong>medium</strong>
455-
<dd>Same as ? Refers to the user's preferred
456-
speech-rate setting.
462+
<dd>Same as 180 - 200 words per minute.
457463
<dt><strong>fast</strong>
458-
<dd>Same as ?
464+
<dd>Same as 300 words per minute.
459465
<dt><strong>x-fast</strong>
460-
<dd>Same as ?
466+
<dd>Same as 500 words per minute.
461467
<dt><strong>faster</strong>
462-
<dd>Adds ? to current speech rate.
468+
<dd>Adds 40 words per minute to the current speech rate.
463469
<dt><strong>slower</strong>
464-
<dd>Subtracts ? to current speech rate.
470+
<dd>Subtracts 40 words per minutes from the current speech rate.
465471
</dl>
466472

467-
<!-- These need completion! -IJ -->
473+
<!-- Some values are those suggested by T.V. Raman,
474+
others proposed by Ian in light of TV's values -IJ -->
468475

469476
<!-- #include src=properties/voice-family.srb -->
470477

@@ -503,30 +510,40 @@ in a way that is independent of speech synthesizer?
503510

504511
<!-- #include src=properties/pitch.srb -->
505512

506-
<p>Specifies the average pitch of the speaking voice.
507-
Values have the following meanings:</P>
513+
<p>Specifies the average pitch (in hertz) of the speaking voice. The
514+
average pitch of a voice depends on the voice family. For example,
515+
the average pitch for a standard male voice is around 120hz,
516+
but for a female voice, it's around 210hz.</p>
517+
518+
<P>Values have the following meanings:</P>
508519

509520
<dl>
510521
<dt><span class="index-inst" title="&lt;frequency&gt;"><span class="value-inst-frequency"><strong>&lt;frequency&gt;</strong></span></span>
511522
<dd>Specifies the average pitch of the speaking voice in hertz (Hz).
512-
<dt><strong>x-low</strong>
513-
<dd>Same as ?
514-
<dt><strong>low</strong>
515-
<dd>Same as ?
516-
<dt><strong>medium</strong>
517-
<dd>Same as ?
518-
<dt><strong>high</strong>
519-
<dd>Same as ?
520-
<dt><strong>x-high</strong>
521-
<dd>Same as ?
523+
<dt><strong>x-low</strong>, <strong>low</strong>,
524+
<strong>medium</strong>, <strong>high</strong>, <strong>x-high</strong>
525+
<dd>These values do not map to absolute frequencies since
526+
these values depend on the voice family. User agents should map
527+
these values to appropriate frequencies based on the voice family
528+
and user environment. However, user agents must map these values in
529+
order (i.e., 'x-low' is a lower frequency than 'low', etc.).
522530
</dl>
523531

524-
<!-- Needs completion! -IJ -->
532+
<!-- Give examples! -IJ -->
525533

526534
<!-- #include src=properties/pitch-range.srb -->
527535

528-
<p>Specifies variation in average pitch. Values have the
529-
following meanings:</p>
536+
<p>Specifies variation in average pitch. The perceived pitch of a
537+
human voice is determined by the fundamental frequency and typically
538+
has a value of 120hz for a male voice and 200hz for a female voice.
539+
Human languages are spoken with varying inflection and pitch; these
540+
variations convey additional meaning and emphasis. Thus, a highly
541+
animated voice, i.e., one that is heavily inflected, displays a high
542+
pitch range. This property specifies the range over which these
543+
variations occur, i.e., how much the fundamental frequency may deviate
544+
from the average pitch.
545+
546+
<P>Values have the following meanings:</p>
530547

531548
<dl>
532549
<dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
@@ -535,38 +552,43 @@ a flat, monotonic voice. A pitch range of 50 produces normal
535552
inflection. Pitch ranges greater than 50 produce animated voices.
536553
</dl>
537554

538-
<!-- Needs completion -IJ -->
539555

540556
<!-- #include src=properties/stress.srb -->
541557

542-
<p>Specifies the level of stress (assertiveness or emphasis) of the
543-
speaking voice. English is a <strong>stressed</strong> language, and
544-
different parts of a sentence are assigned primary, secondary or
545-
tertiary stress. The value of <span
558+
<p>Specifies the the height of "local peaks" in the intonation contour
559+
of a voice. For example, English is a <strong>stressed</strong>
560+
language, and different parts of a sentence are assigned primary,
561+
secondary, or tertiary stress. The value of <span
546562
class="propinst-stress">'stress'</span> controls the amount of
547-
inflection that results from these stress markers. Values
548-
have the following meanings:</p>
563+
inflection that results from these stress markers. This property is a
564+
companion to the <span
565+
class="propinst-pitch-range">'pitch-range'</span> property and is
566+
provided to allow developers to exploit higher-end auditory displays.
567+
568+
<P>Values have the following meanings:</p>
549569

550570
<dl>
551571
<dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
552-
<dd>Increasing the value of this property results in the speech being
553-
more strongly inflected. It is, in a sense, a companion to the <span
554-
class="propinst-pitch-range">'pitch-range'</span> property and is
555-
provided to allow developers to exploit higher-end auditory displays.
572+
<dd>The greater the value, the more inflected the voice. For
573+
example, a value of 30 or 40Hz corresponds to
574+
a standard, English-speaking male voice (average pitch = 122Hz), speaking
575+
with normal intonation and emphasis. The fundamental frequency
576+
may go up to, but never higher than, around 155hz for stressed
577+
parts of the speech.
556578
</dl>
557579

558-
<!-- Needs completion -IJ -->
559-
560580
<!-- #include src=properties/richness.srb -->
561581

562-
<P>Specifies the richness (brightness) of the speaking voice.
563-
Values have the following meanings:</p>
582+
<P>Specifies the richness, or brightness, of the speaking voice. A
583+
rich voice will "carry" in a large room, a smooth voice will not.
584+
(The term "smooth" refers to how the wave form looks when drawn.)
585+
586+
<P>Values have the following meanings:</p>
564587

565588
<dl>
566589
<dt><span class="index-inst" title="&lt;number&gt;"><span class="value-inst-number"><strong>&lt;number&gt;</strong></span></span>
567-
<dd>The effect of increasing richness is to produce a voice that
568-
<em>carries</em>. Reducing richness produces a soft, mellifluous
569-
voice.
590+
<dd>The higher the value, the more the voice will carry.
591+
A lower value will produce a soft, mellifluous voice.
570592
</dl>
571593

572594
<!-- Needs completion -IJ -->

0 commit comments

Comments
 (0)