"prosody" Element

<prosody>

 

Description

Designates the prosodic attributes (pitch, speed, volume) of the text surrounded with tags.

Syntax  

  <prosody

pitch = “x-low” | “low” | “medium” | “high” | “x-high” | “default” | relative change(unit : Hz)

contour = (X1%,pitch1)(X2%,pitch2)...(Xn%,pitchn)

range = “x-low” | “low” | “medium” | “high” | “x-high” | “default” | relative change (unit : Hz)

rate = “x-slow” | “slow” | “medium” | “fast” | “x-fast” | “default” | relative change 

duration = time interval (unit : s | ms)

volume = “silent” | “x-soft” | “soft” | “medium” | “loud” | “x-loud” | “default” | relative change >

    child element

</prosody>  

Attributes        

  Attribute  Description
  pitch  Pitch of the sound.  optional field
  contour  Contour of the sound pitch.  optional field
  range  Range of the sound.  optional field
  rate  Speaking rate.  optional field
  duration  Duration of the time it takes for the text to play.  optional field
  volume  Volume.  optional field

Parent

<audio>, <emphasis>, <p>, <prosody>, <s>, <speak>, <voice>

Children

<audio>, <break>, <emphasis>, <mark>, <p>, <phoneme>, <prosody>, <say-as>, <sub>, <s>, <voice>, #TEXT, #CDATA-SECTION

Limitations/Restrictions

Contour/range/duration are not supported and will be disregarded.

The prosody attributes have characteristics described below.

  pitch    Absolute value  “x-high”, “high”, “medium”, “low”, “x-low”, “default”
  Relative value  -99% ~ +99%
Note 
  

rate

  Absolute value  “x-fast”, “fast”, “medium”, “slow”, “x-slow”, “default” or 0.5~4.0 (default: 1.0)
  Relative value  -99% ~ +99%
Note 
  

volume

  
  Absolute value  “silent”, “x-soft”, “soft”, “medium”, “loud”, “x-loud”,  “default” or 0.0 ~ 100 (default: 50)
  Relative value  -99 ~ +99 or -99% ~ +99%
  Note   +50 and +50% have the same meaning.
* Any value lower than 0 is set to 0, and any value higher  than 100 is set to 100.  

Example  

  <?xml version="1.0"?>

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis"

         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

         xsi:schemaLocation="http://www.w3.org/2001/10/synthesis

                   http://www.w3.org/TR/speech-synthesis/synthesis.xsd"

         xml:lang="en-US">

  <prosody pitch=”high” rate=”+50%”>Hello, World!</prosody>

</speak> 


Was this article helpful?
Copyright (C) 2001-2024, Ai Software, LLC d/b/a LumenVox