Embedded Speech Commands


Embedded commands are used to fine-tune the pronunciation of individual words in the text you pass to a synthesizer.

An embedded command is enclosed in “[[“ and “]]” characters, so called delimiters.
All embedded commands consist of a 4-character command code and a parameter.

Separated by semicolons, more than one command may occur within a single pair of delimiters.
For example: Do
[[emph 100; rate 100]] not [[rate 175]] over tighten the screw.

A parameter may consist of a string or numeric type, and may be accompanied by the + or - characters.
Some commands allow you to use the parameter to specify either an absolute value or a relative value.
For example, the
volm command allows you to specify a particular volume or an amount by which to increase or decrease the current volume, as shown below:
[[volm 0.3]] This command sets the volume with which the following words are spoken to 0.3.
[[volm +0.1]] This command increases the volume with which the following words are spoken by 0.1.

Please not that not all commands are supported by all synthesizers nor by all voices. Also the implementation may vary greatly. Some use an effects until it’s switched off again, while others, apply an effect only to the word immediately preceded by the embedded command. Also, remember to URL parameter you parameter submission, e.g. “+” turns into “%2B”. Btw, the “Alex” voice seems to be the most compliant, with regards to the Embedded Speech Commands.