circuitcellar.com
Magazine Support   Digital Library   Products & Services   Suppliers Directory 
 
 





 

September 1999, Issue 110

Taking Orders:
A Speech Recognition Module


by John Iovine

TRAINING THE MODULE

Training begins when the Train pin is pulled to ground for at least 100 ms. When you press the momentary contact switch marked Train, you are prompted to say the first word to be trained.

Speak the word or phrase you want the circuit to recognize into the microphone. It may be up to 3.2 s long but may not contain silences longer than 0.5 s. For example, "Circuit Cellar" is acceptable as long as the two words are not separated by a long pause.

Next, the module prompts you to repeat the word or phrase. Each time a word is entered, the module creates a template. The two templates for each target word are compared and, if similar enough, are averaged together and stored in memory. If the templates are too different, an error is generated, and the module asks you to repeat the word starting with the initial template.

Before storing a word template, the new template is compared to the word templates already in memory. If the new template is too close to an existing template, the word is not accepted.

The VoiceDirect module has an automatic gain control over the audio amplifier to provide optimum signal strength. It also monitors the background noise and gives a warning if the noise is too high. A steady background noise (like a fan) has less impact on recognition than a fluctuating one (like a radio). Of course, best recognition occurs in low-noise environments.

Once a word is accepted, the module continues training by asking for more words. Training can be interrupted or stopped at any time by not speaking into the microphone at the prompt or by pressing either the Train or Recognize button.

Training is resumed by pressing the Train button. The module automatically starts training new words at the end of the previously trained words. For instance, if you trained six words and then stopped, when you resume training, the module automatically begins training at word seven.

Individual words and phrases cannot be erased or overwritten. But, the entire set of words can be deleted by simultaneously pressing the Train and Recognize buttons for at least 100 ms.