Authors:
Xiaonong Sean Zhu, ANU (Australia)
Page (NA) Paper number 422
Abstract:
The present paper examines what kinds of Shanghai disyllabic lexical
tone sandhi undergoes, especially in what sense and to what extent
a disyllabic tone can be claimed to result from rightward spreading
of the corresponding citation tone. It will be shown that F0 spreading
occurs in the Long tone domains while Contour element spreading mainly
in the Short tone domains.
Authors:
Sean Zhu, Department of Linguistics (Arts), Australian National University (Australia)
Phil Rose, Department of Linguistics (Arts), Australian National University (Australia)
Page (NA) Paper number 299
Abstract:
The Wu dialects of East-Central China are notorious for their tone
sandhi, which is said to be the most complex in the world. This paper
demonstrates that tonological complexity in Wu is not confined to tone
sandhi, but is manifested also in citation tones. These show a both
a large (7 - 8) number of contrasts and a very high percentage of contour
and complex tones. Acoustic and auditory data are presented from an
ongoing large-scale investigation into the tones and tone sandhi of
the Wu dialects of Zhejiang province in East Central China. The citation
tones from 4 sites (3 hitherto undescribed) in the little known Central
Zhejiang area are described: Pujiang, Tonglu, Shengxian and Tiantai.
Mean F0 and duration data are presented for the tones of these dialects.
The data demonstrate a high degree of complexity, having no less than
25 Linguistic-tonetically different tones, including 3 different falling
tones, and 4 different falling-level tones. The nature of the complexity
of these forms is analysed.
Authors:
Juan Manuel Montero, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Juana M. Gutiérrez-Arriola, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Sira Palazuelos, Laboratorio de Tecnologías de Rehabilitación-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Emilia Enríquez, Grupo de Tecnología del Habla-Departamento de Lengua Española-Universidad Nacional de Educación a Distancia (Spain)
Santiago Aguilera, Laboratorio de Tecnologías de Rehabilitación-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
José Manuel Pardo, Grupo de Tecnología del Habla-Departamento de Ingeniería Electrónica-E.T.S.I. Telecomunicación-Universidad Politécnica de Madrid (Spain)
Page (NA) Paper number 1037
Abstract:
Modern Speech synthesisers have achieved a high degree of intelligibility,
but can not be regarded as natural-sounding devices. In order to decrease
the monotony of synthetic speech, the implementation of emotional effects
is now being progressively considered. This paper presents a through
study of emotional speech in Spanish, and its application to TTS, presenting
a prototype system that simulates emotional speech using a commercial
synthesiser. The design and recording of a Spanish database will be
described and also the analysis of the emotional prosody (by fitting
the data to a formal model). Using this collected data, a rule-based
simulation of three primary emotions was implemented in the Text-to-Speech
system. Finally, the assessment of the synthetic voice through perception
experiments will classify the system as capable of producing quality
voice with recognisable emotional effects.
Authors:
Cécile Pereira, Speech, Hearing and Language Research Centre, Macquarie University (Australia)
Catherine Watson, Speech, Hearing and Language Research Centre, Macquarie University (Australia)
Page (NA) Paper number 684
Abstract:
This study presents an acoustic analysis of emotion. The material consisted
of two semantically neutral utterances spoken by two actors, one male,
one female, portraying three moods: anger, happiness and sadness; and
a neutral tone. The duration, fundamental frequency (F0) and an estimate
of the sound intensity (RMS) were analysed. The fundamental frequency
parameter was the most revealing, showing differences between anger
and happiness according to the shape of the contour, and between "cold"
anger and "hot" anger on F0 mean. In addition, the study replicates
previous findings showing hot anger and happiness having an F0 large
range and high mean in contrast to the more subdued emotion of sadness,
and the neutral voice.
|