Speech and audio signal processing : processing and perception of speech and music /
Ben Gold, Nelson Morgan, Dan Ellis ; with contributions from Hervé Bourlard [and others].
- 2nd ed.
- 1 online resource (xxii, 661 pages) : illustrations
Includes bibliographical references and index.
Front Matter -- Introduction -- Historical Background -- Historical Background. Synthetic Audio: A Brief History -- Speech Analysis and Synthesis Overview -- Brief History of Automatic Speech Recognition -- Speech-Recognition Overview -- Mathematical Background -- Mathematical Background. Digital Signal Processing -- Digital Filters and Discrete Fourier Transform -- Pattern Classification -- Statistical Pattern Classification -- Acoustics -- Acoustics. Wave Basics -- Acoustic Tube Modeling of Speech Production -- Musical Instrument Acoustics -- Room Acoustics -- Auditory Perception -- Auditory Perception. Ear Physiology -- Psychoacoustics -- Models of Pitch Perception -- Speech Perception -- Human Speech Recognition -- Speech Features -- Speech Features. The Auditory System as a Filter Bank -- The Cepstrum as a Spectral Analyzer -- Linear Prediction -- Automatic Speech Recognition -- Automatic Speech Recognition. Feature Extraction for ASR -- Linguistic Categories for Speech Recognition -- Deterministic Sequence Recognition for ASR -- Statistical Sequence Recognition -- Statistical Model Training -- Discriminant Acoustic Probability Estimation -- Acoustic Model Training: Further Topics -- Speech Recognition and Understanding -- Synthesis and Coding -- Synthesis and Coding. Speech Synthesis -- Pitch Detection -- Vocoders -- Low-Rate Vocoders -- Medium-Rate and High-Rate Vocoders -- Perceptual Audio Coding -- Other Applications -- Other Applications. Some Aspects of Computer Music Synthesis -- Music Signal Analysis -- Music Retrieval -- Source Separation -- Speech Transformations -- Speaker Verification -- Speaker Diarization -- Index.