5-5 Timbre (������)

[chinese][english]

Timbre is an acoustic feature that is defined conceptually. In general, timbre refers to the "content" of a frame of audio signals, which is ideally not affected much by pitch and intensity. Theoretically, for quasi-periodic audio signals, we can use the waveform within a fundamental period as the timbre of the frame. However, it is difficult to analysis the waveform within a fundamental period directly. Instead, we usually use the fast Fourier transform (or FFT) to transform the time-domain waveform into frequency-domain spectrum for further analysis. The amplitude spectrum in the frequency domain simply represent the intensity of the waveform at each frequency band.

¡u­µ¦â¡v¡]Timber¡^¬O¤@­Ó«Ü¼Ò½kªº¦Wµü¡Aªx«ü­µ°Tªº¤º®e¡A¨Ò¦p¡u¤Ñ®Ñ¡v³o¨â­Ó¦rªºµo­µ¡AÁöµM³£¬O²Ä¤@Án¡A¦]¦¹¥¦­Ìªº­µ°ªÀ³¸Ó¬OÆZ±µªñªº¡A¦ý¬O¥Ñ©ó­µ¦âªº¤£¦P¡A§Ú­Ì¥i¥H¤À¿ë³o¨â­Ó­µ¡Cª½Ä±¨Ó¬Ý¡A­µ¦âªº¤£¦P¡A¥Nªí°ò¥»¶g´Áªºªi§Î¤£¦P¡A¦]¦¹§Ú­Ì¥i¥H¨Ï¥Î°ò¥»¶g´Áªºªi§Î¨Ó¥Nªí­µ¦â¡C­Y­n±q°ò¥»¶g´Áªºªi§Î¨Óª½±µ¤ÀªR­µ¦â¡A¬O¤@¥ó«Ü§xÃøªº¨Æ¡C³q±`§Ú­Ìªº§@ªk¡A¬O±N¨C¤@­Ó­µ®Ø¶i¦æÀWÃФÀªR¡]Spectral Analysis¡^¡Aºâ¥X¤@­Ó­µ®Ø°T¸¹¦p¦ó¥i¥H©î¸Ñ¦¨¦b¤£¦PÀW²vªº¤À¶q¡AµM«á¤~¯à¶i¦æ¤ñ¹ï©Î¤ÀªR¡C¦bÀWÃФÀªR®É¡A³Ì±`¥Îªº¤èªk´N¬O¡u§Ö³t³Å¥ß¸­Âà´«¡v¡]Fast Fourier Transform¡^¡A²ºÙ FFT¡A³o¬O¤@­Ó¬Û·í¹ê¥Îªº¤èªk¡A¥i¥H±N¦b®É°ì¡]Time Domain¡^ªº°T¸¹Âà´«¦¨¦bÀW°ì¡]Frequency Domain¡^ªº°T¸¹¡A¨Ã¶i¦Óª¾¹D¨C­ÓÀW²vªº°T¸¹±j«×¡C

If you want to experience real-time FFT demo, type the following command within the MATLAB command window:

­Y­n¬Ý¬Ý FFT ªº¹ê»Ú®i¥Ü¡A¥i¥H¿é¤J¤U¦C«ü¥O¡G

The opened Simulink block system looks like this:

¶}±Òªº Simulink ¨t²Î¦p¤U¡G

When you start running the system and speak to the microphone, you will able to see the time-varying spectrum:

·í§A±Ò°Êµ{¦¡¨Ã¶}©l¹ï³Á§J­·»¡¸Ü®É¡A´N·|¥X²{¤U¦C°ÊºAªº¡uÀWÃйϡv¡]Spectrum¡^¡AÀH®É¶¡¦Ó§e²{«æÁتºÅܤơG

If we use different colors to represent the height of spectrum, we can obtain the spectrogram, as shown next:

­Y±NÀWÃйϡu¥ß¡v°_¨Ó¡A¨Ã¥Î¤£¦PªºÃC¦â¥NªíÀWÃйϪº°ª§C¡A´N¥i¥H±o¨ìÀWÃйï®É¶¡©Ò²£¥Íªº¼v¹³¡AºÙ¬° Spectrogram¡A¦p¤U¡G

Spectrogram represent the time-varying spectrum displayed in a image map. The same utterance will correspond to the same pattern of spectrogram. Some experienced persons can understand the contents of the speech by viewing the spectragram alone. This is call "spectrogram reading" and related contests and information can be found on the Internet. For instance:

Spectrogram ¥Nªí¤F­µ¦âÀH®É¶¡Åܤƪº¸ê®Æ¡A¦]¦¹¦³¨Ç¼F®`ªº¤H¡A¥i¥H¥Ñ Specgrogram ª½±µ¬Ý¥X»y­µªº¤º®e¡A³oºØ§Þ³NºÙ¬° Specgrogram Reading¡A¦³¿³½ìªº¦P¾Ç¡A¥i¥H¦b·j´M¤ÞÀº¤W§ä¨ì«Ü¦h¬ÛÃöªººô­¶¡A¤]¥i¥H¸Õ¸Õ¦Û¤vªº¥\¤O¡C


Audio Signal Processing and Recognition (­µ°T³B²z»P¿ëÃÑ)