Session ThAD Auditory Modelling and Psychoacoustics, Neural Networks for Speech Processing and Recognition

Chairperson Phil D. Green Univ. of Sheffield, UK

Home

A PROBABILISTIC MODEL OF DOUBLE-VOWEL SEGREGATION

Authors: Laurent Varin and Frédéric Berthommier

Institut de la Communication Parlée/INPG Grenoble, FRANCE {varin,bertho}@icp.grenet.fr

Volume 5 pages 2791 - 2794

ABSTRACT

The decomposition principle was first proposed by Varga and Moore [] and applied to Automatic Speech Recognition (ASR) in noise. We show a new adaptation of this principle to model the schema-based streaming process which was inferred after psychoacoustical studies []. We address here the classical problem of double vowel segregation. The signal decomposition is allowed by an internal and statistical model of vowel spectra. We apply this decomposition model able to reconstruct the spectra of superimposed signals after identification of only the dominant or of both members of the pair. Three stages are invoked. The first one is a module performing identification when the input is a mixture of interfering signals. Prior identification of the dominant spectra prevents combinatorial reconstruction. The second step is an evaluation of the mixture coefficient also based on an internal representation of spectra. Finally, the reconstruction of spectra is probabilistic, by the way of likelihood maximisation. It uses labels and mixture coefficient. This is tested on a large database of synthetic vowels.

Session ThAD Auditory Modelling and Psychoacoustics, Neural Networks for Speech Processing and Recognition

Chairperson Phil D. Green Univ. of Sheffield, UK

Authors: Laurent Varin and Frédéric Berthommier

Institut de la Communication Parlée/INPG Grenoble, FRANCE {varin,bertho}@icp.grenet.fr

Volume 5 pages 2791 - 2794

Authors: Habibzadeh V. Houshang , Kitazawa Shigeyoshi

Graduate School of Science and Engineering, Shizuoka University 3-5-1 Johoku, Hamamatsu 432, JAPAN E-mail: houshang@cs.inf.shizuoka.ac.jp

Volume 5 pages 2795 - 2798

Authors: Chakib Tadj (1), Pierre Dumouchel (2) Franck Poirier (3)

Volume 5 pages 2799 - 2802

Authors: V.V. Lublinskaja Ch. Sappok

Pavlov Institute of Physiology, Saint-Petersburg Tel. +7 812 529 09 58, Fax: +7 812 218 05 01, E-mail:chi@physiology.spb.su Institute of Slavonic Studies, Ruhr Universität Bochum Tel. +49 234 700 6664, Fax: +49 234 7094 337, E-mail: sappokc@slf.ruhr-uni-bochum.de

Volume 5 pages 2803 - 2806

Authors: Nikko Ström

Department of Speech, Music and Hearing KTH (Royal Institute of Technology), Stockholm, Sweden Tel. +46 8 790 75 63, FAX: +46 8 790 78 54, E-mail: nikko@speech.kth.se

Volume 5 pages 2807 - 2810

Authors: Roxana Teodorescu, Dirk Van Compernolle and Ioannis Dologlou

K. U. Leuven - E.S.A.T., Kardinaal Mercierlaan 94, B-3001 Heverlee, Belgium E-mail: Roxana.Teodorescu@esat.kuleuven.ac.be

Volume 5 pages 2811 - 2814

Authors: Tatiana V.Chernigovskaya

I.M.Sechenov Institute of Evolutionary Physiology, Russian Academy of Sciences, 194223 St. Petersburg FAX:7 812 552 30 12, E-mail: chern@ief.spb.su

Volume 5 pages 2815 - 2818

Authors: Yuri Kosarev, Pavel Jarov, Alexander Osipov

Russian Academy of Sciences Institute for Informatics and Automation St. Petersburg E-mail: kosarev@mail.jias.spb.su

Volume 5 pages 2819 - 2822

Authors: C.J. Sumner and D.F. Gillies

Department of Computing, Imperial College of Science Technology and Medicine, 180 Queens Gate, London SW7 2BZ, United Kingdom Tel: +44 171 589 5111 ext 58378, E-mail: cjs2@doc.ic.ac.uk

Volume 5 pages 2823 - 2826

Authors: Henning Reetz Allgemeine Sprachwissenschaft

University of Konstanz D-78464 Konstanz, Germany Phone: +49 7531 882928, FAX: +49 7531 883095, E-mail: henning.reetz@uni-konstanz.de

Volume 5 pages 2827 - 2830

Authors: F. Freitag, E. Monte, J. Salavedra

Polytechnic University of Catalunya Department of Signal Theory and Communications C/Gran Capità, s/n, E - 08034 Barcelona E-mail: felix@gps.tsc.upc.es Fax: 34-3-4016447 Phone: 34-3-4016435

Volume 5 pages 2831 - 2834

Authors: Suhardi, Klaus Fellbaum

Institute for Telecommunication and Theoretical Electrical Engineering Technical University of Berlin, Germany suhardi@ft.ee.tu-berlin.de Communication Engineering Brandenburg Technical University of Cottbus, Germany fellbaum@kt.tu-cottbus.de

Volume 5 pages 2835 - 2838

Authors: Toshiaki Fukada, Sophie Aveline, Mike Schuster, Yoshinori Sagisaka

ATR Interpreting Telecommunications Research Laboratories 2{2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-02 Japan Tel: +81 774 95 1301, FAX: +81 774 95 1308, E-mail: fukada@itl.atr.co.jp

Volume 5 pages 2839 - 2842

Authors: Mike Schuster

ATR, Interpreting Telecommunications Research Lab. 2-2 Hikari-dai, Seika-cho, Soraku-gun, Kyoto 619-02, JAPAN gustl@itl.atr.co.jp http://www.itl.atr.co.jp/

Volume 5 pages 2843 - 2846

Authors: Ludmila Babkina*, Sergey Koval**, Alexander Molchanov*

* Research Institute of Ear Nose, Throat and Speech disorders, St. Petersburg, Russia ** Speech Technology Center, St. Petersburg, Russia Tel./fax: +7(812)3279297, E-mail: master@stc.rus.net

Volume 5 pages 2847 - 2850

Authors: Christine Meunier (1), Alain Content (1), (2), Uli H. Frauenfelder (1), Ruth Kearns (3)

Volume 5 pages 2851 - 2854

Authors: E. G. Bard and R. J. Lickley

Human Communication Research Centre and Department of Linguistics University of Edinburgh, Edinburgh EH8 9LL, UK Tel. +44 131 650 3951, E-mail: ellen@ling.ed.ac.uk

Volume 5 pages 2855 - 2858

Authors: T. Andringa

tjeerd@bcn.rug.nl Department of biophysics University of Groningen Postbus 72 9700 AB Groningen The Netherlands

Volume 5 pages 2859 - 2862