Interfaces and Member Functions

Speech-recognition interfaces allow an application to recognize the user's speech using speech-recognition engines and grammars. These interfaces include the following:

Interface

Description

IEnumSRShare

Enumerates, selects, or creates a new shared engine/audio-source pair.

ISRAttributes

Controls the attributes of a speech-recognition engine.

ISRCentral

Controls recognition of speech by a speech-recognition engine, including loading a grammar, pausing and resuming the engine, getting information about the engine or the speech being received, and registering a notification sink.

ISRDialogs

Displays Windows dialog boxes that allow the user to configure the speech-recognition engine.

ISREnum

Enumerates and selects a speech-recognition mode.

ISRFind

Finds and selects the speech-recognition mode that most closely matches the requested characteristics.

ISRGramCFG

Controls a context-free grammar.

ISRGramCommon

Controls features common to both context-free and dictation grammars.

ISRGramDictation

Controls a dictation grammar.

ISRGramInsertionGUI

Displays a small status window next to the insertion point in a document or edit field.

ISRGramNotifySink

Notifies an application that the speaker has finished a phrase, that the engine has a hypothesis about a phrase, or that the engine needs training from the current speaker.

ISRNotifySink

Notifies an application of interference, noise, a possible change of speaker, the beginning or end of an utterance, or the current VU level.

ISRResAudio

Gets an audio recording of an utterance from an engine.

ISRResBasic

Provides general information about a speech-recognition results object.

ISRResCorrection

Confirms that an utterance was recognized correctly, or provides the correct interpretation of an incorrectly recognized utterance.

ISRResEval

Reevaluates the recognition.

ISRResGraph

Provides information from a recognition/alternative graph generated by an engine.

ISRResMemory

Gets the amount of memory used by a particular kind of information in a recognition/alternative graph, or frees memory used by the information.

ISRResMerge

Merges information from two temporally adjacent speech-recognition results objects into a new object.

ISRResModifyGUI

Displays a small status window next to selected text for specific recognition results.

ISRResScore

Provides scores for recognition alternatives.

ISRResSpeaker

Identifies the speaker for a recognition.

ISRSpeaker

Transfers speaker information from one system to another.


This section describes, in alphabetical order, the interfaces and member functions that support speech recognition.

In addition to the interfaces described in this section, each object supports the IUnknown interface, which is a common interface supported by all OLE component objects. A speech-recognition engine may also support ILexPronounce, an optional interface that allows a user to correct words in the engine's pronunciation lexicon. For more information about IUnknown and ILexPronounce, see the section, "Common Elements of the Speech Application Programming Interface."