For QBSH, we will cover the following topics:
For AFP, we are going to cover the following topics:
In addition to the theoretical aspects, the course will be interleaved with MATLAB code for plotting, sound playback, and animation, to demonstrate the underlying fundamental steps of QBSH and AFP. (An example page of course material and code snippets can be found at http://mirlab.org/jang/books/audioSignalProcessing/ptTimeDomainAcf.asp?title=7-2%20ACF.)
After the course, the audience should have a comprehensive notation of how to create QBSH and AFP prototypes for their own applications. If time and space permit, we will also organize lab sessions (using MATLAB primarily) to let the audience have hands-on experiences of building working QBSH and AFP prototypes for their own applications.
We have been maintaining two online courses on the web, with open course material and code snippets:
We have been maintaining several MATLAB toolboxes that are used in the previous two online courses. These toolboxes are also used for our MIREX submissions. If we have lab sessions in the Summer School, these toolboxes will be used to create working prototypes of QBSH and AFP systems.
Jyh-Shing Roger Jang received the B.S. degree in Electrical Engineering from National Taiwan University in 1984, and the Ph.D. degree from the EECS Department at the University of California at Berkeley in 1992. He studied artificial networks and fuzzy logic with Prof. Lotfi Zadeh, the father of fuzzy logic. As of 2013, Google Scholar shows over 7600 citations for his seminal paper on ANFIS (adaptive neuro-fuzzy inference systems), published in 1993. After obtaining his Ph.D., he joined The MathWorks to coauthor the Fuzzy Logic Toolbox used with MATLAB. Since then, he has cultivated a keen interest in implementing industry-strength software for pattern recognition and computational intelligence. He was with the CS Dept. of National Tsing Hua Univ., Taiwan, from 1995 to 2012. Since August 2012, he has been with CSIE Dept. of National Taiwan Univ., Taiwan. He has published one English book on "Neuro-Fuzzy and Soft Computing" (1997, Prentice Hall), two Chinese books on MATLAB programming, and one Chinese book on JavaScript Programming. He has also been maintaining toolboxes for machine learning and speech/audio processing, and on-line tutorials on ¡§Data Clustering and Pattern Recognition¡¨ and ¡§Audio Signal Processing and Recognition¡¨. Moreover, he has been putting a significant amount of efforts to bridge the gap between academic research and industrial applications, and has achieved excellent records in technology transfer and licensing from the university to the industry. More info about Prof Jang can be found at his homepage at http://mirlab.org/jang.
Prof. Jang¡¦s research interests include machine learning and pattern recognition, with applications to music analysis/retrieval, speech recognition/assessment/synthesis, and image identification/retrieval. He is an active participant of ISMIR (International Society of Music Information Retrieval) and has achieved excellent rankings in several tasks of MIREX (Music Information Retrieval Evaluation eXchange) for years, including query by singing/humming, audio melody extraction, beat tracking, tempo estimation, audio classification, and score following. For speech part, most of his research is focused on the use of innovative machine learning techniques for computed aided pronunciation training of Mandarin, English, Japanese, and Taiwanese.
Prof. Jyh-Shing Roger Jang
Email: jang@mirlab.org
Mobile: +886-966-646464
Postal address:
Department of Computer Science and Information Engineering
National Taiwan University
No. 1, Sec. 4, Roosevelt Road, Taipei, 10617 Taiwan