Prosody and Emotion 3

Home
Full List of Titles
1: ICSLP'98 Proceedings
Keynote Speeches
Text-To-Speech Synthesis 1
Spoken Language Models and Dialog 1
Prosody and Emotion 1
Hidden Markov Model Techniques 1
Speaker and Language Recognition 1
Multimodal Spoken Language Processing 1
Isolated Word Recognition
Robust Speech Processing in Adverse Environments 1
Spoken Language Models and Dialog 2
Articulatory Modelling 1
Talking to Infants, Pets and Lovers
Robust Speech Processing in Adverse Environments 2
Spoken Language Models and Dialog 3
Speech Coding 1
Articulatory Modelling 2
Prosody and Emotion 2
Neural Networks, Fuzzy and Evolutionary Methods 1
Utterance Verification and Word Spotting 1 / Speaker Adaptation 1
Text-To-Speech Synthesis 2
Spoken Language Models and Dialog 4
Human Speech Perception 1
Robust Speech Processing in Adverse Environments 3
Speech and Hearing Disorders 1
Prosody and Emotion 3
Spoken Language Understanding Systems 1
Signal Processing and Speech Analysis 1
Spoken Language Generation and Translation 1
Spoken Language Models and Dialog 5
Segmentation, Labelling and Speech Corpora 1
Multimodal Spoken Language Processing 2
Prosody and Emotion 4
Neural Networks, Fuzzy and Evolutionary Methods 2
Large Vocabulary Continuous Speech Recognition 1
Speaker and Language Recognition 2
Signal Processing and Speech Analysis 2
Prosody and Emotion 5
Robust Speech Processing in Adverse Environments 4
Segmentation, Labelling and Speech Corpora 2
Speech Technology Applications and Human-Machine Interface 1
Large Vocabulary Continuous Speech Recognition 2
Text-To-Speech Synthesis 3
Language Acquisition 1
Acoustic Phonetics 1
Speaker Adaptation 2
Speech Coding 2
Hidden Markov Model Techniques 2
Multilingual Perception and Recognition 1
Large Vocabulary Continuous Speech Recognition 3
Articulatory Modelling 3
Language Acquisition 2
Speaker and Language Recognition 3
Text-To-Speech Synthesis 4
Spoken Language Understanding Systems 4
Human Speech Perception 2
Large Vocabulary Continuous Speech Recognition 4
Spoken Language Understanding Systems 2
Signal Processing and Speech Analysis 3
Human Speech Perception 3
Speaker Adaptation 3
Spoken Language Understanding Systems 3
Multimodal Spoken Language Processing 3
Acoustic Phonetics 2
Large Vocabulary Continuous Speech Recognition 5
Speech Coding 3
Language Acquisition 3 / Multilingual Perception and Recognition 2
Segmentation, Labelling and Speech Corpora 3
Text-To-Speech Synthesis 5
Spoken Language Generation and Translation 2
Human Speech Perception 4
Robust Speech Processing in Adverse Environments 5
Text-To-Speech Synthesis 6
Speech Technology Applications and Human-Machine Interface 2
Prosody and Emotion 6
Hidden Markov Model Techniques 3
Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1
Human Speech Production
Segmentation, Labelling and Speech Corpora 4
Speaker and Language Recognition 4
Speech Technology Applications and Human-Machine Interface 3
Utterance Verification and Word Spotting 2
Large Vocabulary Continuous Speech Recognition 6
Neural Networks, Fuzzy and Evolutionary Methods 3
Speech Processing for the Speech-Impaired and Hearing-Impaired 2
Prosody and Emotion 7
2: SST Student Day
SST Student Day - Poster Session 1
SST Student Day - Poster Session 2

Author Index
A B C D E F G H I
J K L M N O P Q R
S T U V W X Y Z

Multimedia Files

What Spreads, And How? Tonal Rightward Spreading on Shanghai Disyllabic Compounds

Authors:

Xiaonong Sean Zhu, ANU (Australia)

Page (NA) Paper number 422

Abstract:

The present paper examines what kinds of Shanghai disyllabic lexical tone sandhi undergoes, especially in what sense and to what extent a disyllabic tone can be claimed to result from rightward spreading of the corresponding citation tone. It will be shown that F0 spreading occurs in the Long tone domains while Contour element spreading mainly in the Short tone domains.

SL980422.PDF (From Author) SL980422.PDF (Rasterized)

TOP


Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from Four Zhejiang Wu Dialects

Authors:

Sean Zhu, Department of Linguistics (Arts), Australian National University (Australia)
Phil Rose, Department of Linguistics (Arts), Australian National University (Australia)

Page (NA) Paper number 299

Abstract:

The Wu dialects of East-Central China are notorious for their tone sandhi, which is said to be the most complex in the world. This paper demonstrates that tonological complexity in Wu is not confined to tone sandhi, but is manifested also in citation tones. These show a both a large (7 - 8) number of contrasts and a very high percentage of contour and complex tones. Acoustic and auditory data are presented from an ongoing large-scale investigation into the tones and tone sandhi of the Wu dialects of Zhejiang province in East Central China. The citation tones from 4 sites (3 hitherto undescribed) in the little known Central Zhejiang area are described: Pujiang, Tonglu, Shengxian and Tiantai. Mean F0 and duration data are presented for the tones of these dialects. The data demonstrate a high degree of complexity, having no less than 25 Linguistic-tonetically different tones, including 3 different falling tones, and 4 different falling-level tones. The nature of the complexity of these forms is analysed.

SL980299.PDF (From Author) SL980299.PDF (Scanned)

TOP


Emotional Speech Synthesis: From Speech Database to TTS

Authors:

Juan Manuel Montero, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Juana M. Gutiérrez-Arriola, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Sira Palazuelos, Laboratorio de Tecnologías de Rehabilitación-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Emilia Enríquez, Grupo de Tecnología del Habla-Departamento de Lengua Española-Universidad Nacional de Educación a Distancia (Spain)
Santiago Aguilera, Laboratorio de Tecnologías de Rehabilitación-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
José Manuel Pardo, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)

Page (NA) Paper number 1037

Abstract:

Modern Speech synthesisers have achieved a high degree of intelligibility, but can not be regarded as natural-sounding devices. In order to decrease the monotony of synthetic speech, the implementation of emotional effects is now being progressively considered. This paper presents a through study of emotional speech in Spanish, and its application to TTS, presenting a prototype system that simulates emotional speech using a commercial synthesiser. The design and recording of a Spanish database will be described and also the analysis of the emotional prosody (by fitting the data to a formal model). Using this collected data, a rule-based simulation of three primary emotions was implemented in the Text-to-Speech system. Finally, the assessment of the synthetic voice through perception experiments will classify the system as capable of producing quality voice with recognisable emotional effects.

SL981037.PDF (From Author) SL981037.PDF (Rasterized)

TOP


Some Acoustic Characteristics Of Emotion

Authors:

Cécile Pereira, Speech, Hearing and Language Research Centre, Macquarie University (Australia)
Catherine Watson, Speech, Hearing and Language Research Centre, Macquarie University (Australia)

Page (NA) Paper number 684

Abstract:

This study presents an acoustic analysis of emotion. The material consisted of two semantically neutral utterances spoken by two actors, one male, one female, portraying three moods: anger, happiness and sadness; and a neutral tone. The duration, fundamental frequency (F0) and an estimate of the sound intensity (RMS) were analysed. The fundamental frequency parameter was the most revealing, showing differences between anger and happiness according to the shape of the contour, and between "cold" anger and "hot" anger on F0 mean. In addition, the study replicates previous findings showing hot anger and happiness having an F0 large range and high mean in contrast to the more subdued emotion of sadness, and the neutral voice.

SL980684.PDF (From Author) SL980684.PDF (Rasterized)

TOP