tags
See text-to-speech control tags.
text-to-speech
Technologies for converting textual (ASCII) information into synthetic speech output. Used in voice-processing applications requiring production of broad, unrelated, and unpredictable vocabularies, such as products in a catalog or names and addresses. This technology is appropriate when system design constraints prevent the more efficient use of speech concatenation alone.
text-to-speech control tags
Instructions that can be embedded in text sent to a text-to-speech engine to improve the prosody of the spoken text.
text-to-speech engine
An OLE Component Object Model dynamic-link library (DLL) or executable file (.exe) that provides functionality for converting text to digital-audio speech. Text-to-speech engines are supplied by vendors who specialize in the software.
text-to-speech enumerator
Enumerates the text-to-speech modes provided by all of the engines available to the application.
text-to-speech mode
Analogous to voice quality or personality. Every text-to-speech mode is different, and each allows for different properties such as timbre, accent, language, and digital-audio sampling rate.
threshold
The point below which an utterance is rejected as unrecognized.
training
The process of speaking a series of preselected phrases for the engine. This provides the engine with more information about the voice of the speaker and can improve speech recognition.