|
Paperback - $29.95 |
The Waveform Model has been published by
Universal Publishers. The book can be purchased directly at Universal Publishers following this link. The book is also available at the following: |
|
May 14, 2010
April 16, 2010
Feb. 9, 2010
Waveform Model This book presents a new model of vowel perception and production derived from visual cues identified in waveform displays. In addition to describing waveform displays of vowels beyond previous descriptions, included in the book are descriptions of experimental evidence supporting near 100% vowel identification accuracy across 20 male talkers using the concepts in the model. The book content will be of interest to several academic fields including Cognitive Science, Psychology, Linguistics, Speech and Hearing, Language Acquisition, Neurolinguistics, Phonetics, and areas within Physics and Mathematics. Beyond these academic fields, the new model of vowel perception presented here could possibly be used to improve accuracy and speed within existing speech recognition systems, or it could be used to generate a new speech recognition program. Many speech recognition programs are based on simple statistical programs like Hidden Markov Models that ignore any theoretical basis to speech recognition. The Waveform Model differs from the HMM approaches since it has a theoretical basis rooted in articulation and that has potentially more promise than these simple HMM models that just take overall similarities in waveforms and try to match them to phonemes and words. Furthermore, many of the speech recognition programs use extensive training by a single user (in quiet conditions) in order to attain over 90% accuracy, which is still a relatively poor performance. The Waveform Model requires no training, can be used across talkers, and has accuracy above reported speech recognition performance (specific to vowels). In summary, the Waveform Model is innovative, and new to the literature and research communities. |
|
| Technical Presentations and Publications | ||
|
Stokes, M.A. (2009). The waveform model of vowel perception and production. Boca Raton, FL: Universal-Publishers. Michael A. Stokes (2002), "Talker Identification from Analysis of Raw Complex Waveforms," presented at the 143rd meeting: Acoustical Society of America. Michael A. Stokes (2001), "Male and female vowels identified by visual inspection of raw complex waveforms.," presented at the 141st meeting: Acoustical Society of America.
Michael A. Stokes (1998), "MAS Model of vowel perception," posted
on the internet at: Michael A. Stokes (1996), "Identification of vowels based on visual cues within raw complex waveforms," presented at the 131st meeting: Acoustical Society of America. W. Van Summers, David B. Pisoni, and Michael A. Stokes (1989), "Effects of cognitive workload on speech production," presented at the 117th meeting: Acoustical Society of America. W. Van Summers, David B. Pisoni, Robert H. Bernacki, Robert I. Pedlow, and Michael A. Stokes (1988), "Effects of noise on speech production: Acoustic and Perceptual analysis," Journal of the Acoustical Society of America, 84, 917-928. |
I will be reposting my presentation of talker identification at the Acoustical Society of America in 2002 soon. I noticed several things were missing from my original post. There was 100% accuracy in two identification trials involving 2 vowels and 20 talkers.
Email: waveform.model@yahoo.com
It should be noted, the Waveform Model was formerly referred to as the MAS Model.