T

tags

See text-to-speech control tags.

text-to-speech

Technologies for converting textual (ASCII) information into synthetic speech output. Used in voice-processing applications requiring production of broad, unrelated, and unpredictable vocabularies, such as products in a catalog or names and addresses. This technology is appropriate when system design constraints prevent the more efficient use of speech concatenation alone.

text-to-speech control tags

Instructions that can be embedded in text sent to a text-to-speech engine to improve the prosody of the spoken text.

text-to-speech engine

An OLE Component Object Model dynamic-link library (DLL) or executable file (.exe) that provides functionality for converting text to digital-audio speech. Text-to-speech engines are supplied by vendors who specialize in the software.

text-to-speech enumerator

Enumerates the text-to-speech modes provided by all of the engines available to the application.

text-to-speech mode

Analogous to voice quality or personality. Every text-to-speech mode is different, and each allows for different properties such as timbre, accent, language, and digital-audio sampling rate.

threshold

The point below which an utterance is rejected as unrecognized.

training

The process of speaking a series of preselected phrases for the engine. This provides the engine with more information about the voice of the speaker and can improve speech recognition.