Browse
Publications
Preprints
About
About UCL Open: Env.
Aims and Scope
Editorial Board
Indexing
APCs
How to cite
Publishing policies
Editorial policy
Peer review policy
Equality, Diversity & Inclusion
About UCL Press
Contact us
For authors
Information for authors
How it works
Benefits of publishing with us
Submit
How to submit
Preparing your manuscript
Article types
Open Data
ORCID
APCs
Contributor agreement
For reviewers
Information for reviewers
Review process
How to peer review
Peer review policy
My ScienceOpen
Sign in
Register
Dashboard
Search
Browse
Publications
Preprints
About
About UCL Open: Env.
Aims and Scope
Editorial Board
Indexing
APCs
How to cite
Publishing policies
Editorial policy
Peer review policy
Equality, Diversity & Inclusion
About UCL Press
Contact us
For authors
Information for authors
How it works
Benefits of publishing with us
Submit
How to submit
Preparing your manuscript
Article types
Open Data
ORCID
APCs
Contributor agreement
For reviewers
Information for reviewers
Review process
How to peer review
Peer review policy
My ScienceOpen
Sign in
Register
Dashboard
Search
7
views
0
references
Top references
cited by
8
Cite as...
0 reviews
Review
0
comments
Comment
0
recommends
+1
Recommend
0
collections
Add to
0
shares
Share
Twitter
Sina Weibo
Facebook
Email
4,323
similar
All similar
Record
: found
Abstract
: not found
Book
: not found
Speech and Audio Signal Processing : Processing and Perception of Speech and Music
other
Author(s):
Ben Gold
,
Nelson Morgan
,
Dan Ellis
Publication date
(Print):
August 15 2011
Publisher:
John Wiley & Sons, Inc.
Read this book at
Publisher
Buy book
Review
Review book
Invite someone to review
Bookmark
Cite as...
There is no author summary for this book yet. Authors can add summaries to their books on ScienceOpen to make them more accessible to a non-specialist audience.
Related collections
International Polymer Processing
Author and book information
Book
ISBN (Electronic):
9781118142882
ISBN (Print):
9780470195369
Publication date (Print):
August 15 2011
DOI:
10.1002/9781118142882
SO-VID:
a5dfdc60-c833-4124-b34f-1edf45890e67
History
Data availability:
Comments
Comment on this book
Sign in to comment
Book chapters
pp. i
Front Matter
pp. 1
Introduction
pp. 7
Historical Background
pp. 9
Synthetic Audio: A Brief History
pp. 21
Speech Analysis and Synthesis Overview
pp. 40
Brief History of Automatic Speech Recognition
pp. 59
Speech-Recognition Overview
pp. 71
Mathematical Background
pp. 73
Digital Signal Processing
pp. 87
Digital Filters and Discrete Fourier Transform
pp. 105
Pattern Classification
pp. 124
Statistical Pattern Classification
pp. 139
Acoustics
pp. 141
Wave Basics
pp. 152
Acoustic Tube Modeling of Speech Production
pp. 158
Musical Instrument Acoustics
pp. 179
Room Acoustics
pp. 191
Auditory Perception
pp. 193
Ear Physiology
pp. 209
Psychoacoustics
pp. 218
Models of Pitch Perception
pp. 232
Speech Perception
pp. 250
Human Speech Recognition
pp. 261
Speech Features
pp. 263
The Auditory System as a Filter Bank
pp. 277
The Cepstrum as a Spectral Analyzer
pp. 286
Linear Prediction
pp. 299
Automatic Speech Recognition
pp. 301
Feature Extraction for ASR
pp. 319
Linguistic Categories for Speech Recognition
pp. 337
Deterministic Sequence Recognition for ASR
pp. 350
Statistical Sequence Recognition
pp. 364
Statistical Model Training
pp. 381
Discriminant Acoustic Probability Estimation
pp. 394
Acoustic Model Training: Further Topics
pp. 416
Speech Recognition and Understanding
pp. 429
Synthesis and Coding
pp. 431
Speech Synthesis
pp. 455
Pitch Detection
pp. 473
Vocoders
pp. 493
Low-Rate Vocoders
pp. 505
Medium-Rate and High-Rate Vocoders
pp. 531
Perceptual Audio Coding
pp. 551
Other Applications
pp. 553
Some Aspects of Computer Music Synthesis
pp. 567
Music Signal Analysis
pp. 581
Music Retrieval
pp. 595
Source Separation
pp. 617
Speech Transformations
pp. 633
Speaker Verification
pp. 644
Speaker Diarization
pp. 655
Index
Similar content
4,323
CUSCO: An Unobtrusive Custom Secure Audio-Visual Recording System for Ambient Assisted Living
Authors:
Pierre Albert
,
Fasih Haider
,
Saturnino Luz
Self-reports of HIV risk factors by patients at a sexually transmitted disease clinic: audio vs written questionnaires.
Authors:
B Boekeloo
,
L Schiavo
,
D Rabin
…
FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation
Authors:
Swarup Ranjan Behera
,
Abhishek Dhiman
,
Karthik Gowda
…
See all similar
Cited by
6
Weakly Supervised Action Labeling in Videos under Ordering Constraints
Authors:
Piotr Bojanowski
,
Rémi Lajugie
,
Francis Bach
…
Scene analysis in the natural environment
Authors:
Michael Lewicki
,
Bruno Olshausen
,
Annemarie Surlykke
…
Audiovisual Speech Source Separation: An overview of key methodologies
Authors:
Jonathon Chambers
,
Syed Mohsen Raza Naqvi
,
Wenwu Wang
…
See all cited by