Other Related Papers

Other Related Papers

QBSH

Title

Author(s)

Published in

Download

Comments

A Query-by-Singing System based on Dynamic Programming J. S. R. Jang
M. Y. Gao International Workshop on Intelligent Systems Resolutions (the 8th Bellman Continuum) [PDF]

Content-based Music Retrieval Using Linear Scaling and Branch-and-bound Tree Search J. S. R. Jang
H. R. Lee
M. Y. Kao ICME 2001 [PDF]

A Top-down Approach to Melody Match in Pitch Contour for Query by Humming X. Wu
M. Li
J. Liu
J. Yang
Y. Yan ISCSLP 2006 [PDF]

Query by humming of midi and audio using locality sensitive hashing M. Ryynanen
A. Klapuri ICASSP 2008 [PDF]

Alignment

Title

Author(s)

Published in

Download

Comments

A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions G. Bordel
S. Nieto
M. Penagarikano
L. J. Rodriguez-Fuentes
A. Varona Interspeech 2012 [PDF]

Automatic Phoneme Segmentation Using Auditory Attention Features O. Kalinli Interspeech 2012 [PDF]

Automatic Speech Segmentation Using Probabilistic Latent Component S. Ghosh
T.V. Sreenivas Interspeech 2012 [PDF]

Sentence Detection Using Multiple Annotations A. Lee
J. Glass Interspeech 2012 [PDF] MIT的, 使用三大類的features來做

Toward an Optimum Feature Set and HMM Model Parameters for Automatic Phonetic Alignment of Spontaneous Speech M. Karnjanadecha
S. A. Zahorian Interspeech 2012 [PDF]

Word Prominence Detection using Robust yet Simple Prosodic Features T. Mishra
V. K. R. Sridhar
A. Conkie Interspeech 2012 [PDF]

Grapheme to phoneme

Title

Author(s)

Published in

Download

Comments

Joint-Sequence Models for Grapheme-to-Phoneme Conversion M. Bisani
H. Ney Speech Communication 2008 [PDF] Tool download/installation readme here

Adaptation (MAP and MLLR)

Title

Author(s)

Published in

Download

Comments

A Study on Speaker Adaptation of Continuous Density HMM Parameters Chin-Hui Lee
Chih-Heng Lin
Biing-Hwang Juang ICASSP 1990 [PDF]
[PPT] MAP 經典 paper

Speaker Adaptation of HMMs Using Linear Regression C.J. Leggetter
P.C. WoodLand CUED/F-INFENG/TR. 181, June 1994 [PDF]
[PPT-1]
[PPT-2]

Speaker Adaptation using Constrained Estimation of Gaussian Mixtures V. V. Digalakis
D. Rtischev
L. G. Neumeyer IEEE Transactions on Speech and Audio Processing, 3:357-366, 1995 [PDF]

Flexible Speaker Adaptation using Maximum Likelihood Linear Regression C.J. Leggetter
P.C. WoodLand Proc. ARPA Spoken Language Technology Workshop, p104~109, 1995 [PDF]
[PPT] 看完這篇就差不多知道 MLLR 在做啥, 也可以了解 speaker adaptation 在做啥, 但數學細節很多...

Structural MAP Speaker Adaptation Using Hierarchical Priors Koichi Shinoda
Chin-Hui Lee ASRU 1997 Proceedings [PDF]
[PPT]

Structural Maximum A Posteriori Linear Regression for Fast HMM Adaptation Olivier Siohan
Tor Andre Myrvoll
Chin-Hui Lee Computer Speech & Language 2002 [PDF]
[PPT]

MPE (Minimum Phone Error)

Title

Author(s)

Published in

Download

Comments

Minimum Phone Error and I-Smoothing for Improved Discriminative Training D. Povey
P. C. Woodland ICASSP 2002 [PDF]

Discriminative Training for Large Vocabulary Speech Recognition Daniel Povey Ph.D Dissertation, University of Cambridge, July 2004 [PDF] MPE 的原版博士論文, 很長一篇

最小化音素錯誤鑑別式聲學模型學習於中文大詞彙連續語音辨識之初步研究
An Initial Study on Minimum Phone Error Discriminative Learning of Acoustic Models for Mandarin Large Vocabulary Continuous Speech Recognition 郭人瑋 Master Thesis, NTNU, June 2005 [PDF]
也是一篇有關 MPE 的精釆論文, follow Dan Povey 那篇做下來的, 雖然是一篇碩士論文, 但請用看博士論文的心態去看這篇 @@

資料選取方法於鑑別式聲學模型訓練之研究
Training Data Selection for Discriminative Training of Acoustic Models 朱芳輝 Master Thesis, NTNU, February 2008 [PDF] follow 郭人瑋那篇做下來的

Other topics

Title

Author(s)

Published in

Download

Comments

Maximum Mutual Information Estimation of Hidden Markov Model Parameters for Speech Recognition L. R. Bahl
P. F. Brown
P. V. de Souza
R. L. Mercer ICASSP 1986 [PDF] MMI 的經典 paper, 但我沒看過, 不知道可以幹麻 =.=

Speaker-Independent Phone Recognition Using Hidden Markov Models K. F. Lee
H. W. Hon TASSP (IEEE Trans. on Acoustics, Speech, and Signal Processing) 1989 [PDF] 經典 paper, 用 HMM 來做語音辨識的始祖 paper, 必讀

A Tutorial on Hidden Markov Models and Selected Aplications in Speech Recognition L. R. Rabiner Proceedings of the IEEE, 1989 [PDF] 經典HMM tutorial, 必讀

A Tutorial on Support Vector Machines for Pattern Recognition C. J. C. Burges Data Mining and Knowledge Discovery, 1998 [PDF] 網路上找到的 SVM tutorial

Automatic Construction of Regression Class Tree for MLLR via Model-based Hierarchical Clustering Shih-Sian Cheng
Yeong-Yuh Xu
Hsin-Min Wang
Hsin-Chia Fu TASLP, 2006 - Springer [PDF] 利用 BIC 加上 top-down, bottom-up approach 來自動建立MLLR所用的二元樹

Linear Discriminant Feature Extraction for Speech Recognition 李鴻欣, 臺灣師範大學 NeGSST 2008 暑期講習會 [PPT]
[MP3] An excellent presentation on LDA (Linear Discriminant Analysis), some remarks and comparative studies, and variations of LDA.

A Brief Maximum Entropy Tutorial A. Berger Online Tutorial, 1996 [PPT]

back to homepage

last updated: 2012/01/04

Title	Author(s)	Published in	Download	Comments
A Query-by-Singing System based on Dynamic Programming	J. S. R. Jang M. Y. Gao	International Workshop on Intelligent Systems Resolutions (the 8th Bellman Continuum)	[PDF]
Content-based Music Retrieval Using Linear Scaling and Branch-and-bound Tree Search	J. S. R. Jang H. R. Lee M. Y. Kao	ICME 2001	[PDF]
A Top-down Approach to Melody Match in Pitch Contour for Query by Humming	X. Wu M. Li J. Liu J. Yang Y. Yan	ISCSLP 2006	[PDF]
Query by humming of midi and audio using locality sensitive hashing	M. Ryynanen A. Klapuri	ICASSP 2008	[PDF]