This book is focused on audio signal processing and recognition. We expect to achieve the following goals:
The book is a practical guide for graduate students, researchers, as well as practitioner, with the following characteristics:
- Cover the basic principles of audio signal processing and recognition.
- Exemplify the use of MATLAB for implementing audio signal processing and recognition.
- Take real-world speech and audio signals as target applications.
The depth of this book is designed for first-year graduate students. However, it is also suitable for upper-division of undergraduates if the lecturer put more emphasis on coding and implementation. Some of the programming contests (such as endpoint detection, SU/V detection, melody recognition, speaker recognition, speech recognition) can be futher developed into a term project or Master/PhD research topic.
- Example-based tutorial: All the chapters include a number of examples, together with formal mathematical analysis and derivation.
- Emphasis on both theory and implementation: All algorithms covered in the text have accompanying MATLAB implementation, such that the uses can have hands-on experience to practice "learning by doing".
- Application oriented: Most of the examples take real-world speech or audio signals to verify the covered algorithms or methods. This enables the users to have a concrete idea of the gap between theory and implementation for real-world applications.
Speech and audio signal processing and recognition involves a fair amount of mathematics. We expect the readers to have taken the following prerequisites: Calculus, linear algebra, and probability.
This book was original written in Chinese. Therefore for some page, we have a link to the old Chinese version. However, it should be noted that the latest version is in English and there is no guarantee for the synchronization between English and Chinese versions.
If you want to cite this book, choose one of the following two formats:
- Jyh-Shing Roger Jang, "Audio Signal Processing and Recognition," available at the links for on-line courses at the author's homepage at http://www.cs.nthu.edu.tw/~jang.
- 張智星,"音訊處理與辨識",網路線上課程,可由作者之網頁 http://www.cs.nthu.edu.tw/~jang連結到此線上教材。
Audio Signal Processing and Recognition (音訊處理與辨識)