Multimedia Files

Home
Full List of Titles
1: ICSLP'98 Proceedings
Keynote Speeches
Text-To-Speech Synthesis 1
Spoken Language Models and Dialog 1
Prosody and Emotion 1
Hidden Markov Model Techniques 1
Speaker and Language Recognition 1
Multimodal Spoken Language Processing 1
Isolated Word Recognition
Robust Speech Processing in Adverse Environments 1
Spoken Language Models and Dialog 2
Articulatory Modelling 1
Talking to Infants, Pets and Lovers
Robust Speech Processing in Adverse Environments 2
Spoken Language Models and Dialog 3
Speech Coding 1
Articulatory Modelling 2
Prosody and Emotion 2
Neural Networks, Fuzzy and Evolutionary Methods 1
Utterance Verification and Word Spotting 1 / Speaker Adaptation 1
Text-To-Speech Synthesis 2
Spoken Language Models and Dialog 4
Human Speech Perception 1
Robust Speech Processing in Adverse Environments 3
Speech and Hearing Disorders 1
Prosody and Emotion 3
Spoken Language Understanding Systems 1
Signal Processing and Speech Analysis 1
Spoken Language Generation and Translation 1
Spoken Language Models and Dialog 5
Segmentation, Labelling and Speech Corpora 1
Multimodal Spoken Language Processing 2
Prosody and Emotion 4
Neural Networks, Fuzzy and Evolutionary Methods 2
Large Vocabulary Continuous Speech Recognition 1
Speaker and Language Recognition 2
Signal Processing and Speech Analysis 2
Prosody and Emotion 5
Robust Speech Processing in Adverse Environments 4
Segmentation, Labelling and Speech Corpora 2
Speech Technology Applications and Human-Machine Interface 1
Large Vocabulary Continuous Speech Recognition 2
Text-To-Speech Synthesis 3
Language Acquisition 1
Acoustic Phonetics 1
Speaker Adaptation 2
Speech Coding 2
Hidden Markov Model Techniques 2
Multilingual Perception and Recognition 1
Large Vocabulary Continuous Speech Recognition 3
Articulatory Modelling 3
Language Acquisition 2
Speaker and Language Recognition 3
Text-To-Speech Synthesis 4
Spoken Language Understanding Systems 4
Human Speech Perception 2
Large Vocabulary Continuous Speech Recognition 4
Spoken Language Understanding Systems 2
Signal Processing and Speech Analysis 3
Human Speech Perception 3
Speaker Adaptation 3
Spoken Language Understanding Systems 3
Multimodal Spoken Language Processing 3
Acoustic Phonetics 2
Large Vocabulary Continuous Speech Recognition 5
Speech Coding 3
Language Acquisition 3 / Multilingual Perception and Recognition 2
Segmentation, Labelling and Speech Corpora 3
Text-To-Speech Synthesis 5
Spoken Language Generation and Translation 2
Human Speech Perception 4
Robust Speech Processing in Adverse Environments 5
Text-To-Speech Synthesis 6
Speech Technology Applications and Human-Machine Interface 2
Prosody and Emotion 6
Hidden Markov Model Techniques 3
Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1
Human Speech Production
Segmentation, Labelling and Speech Corpora 4
Speaker and Language Recognition 4
Speech Technology Applications and Human-Machine Interface 3
Utterance Verification and Word Spotting 2
Large Vocabulary Continuous Speech Recognition 6
Neural Networks, Fuzzy and Evolutionary Methods 3
Speech Processing for the Speech-Impaired and Hearing-Impaired 2
Prosody and Emotion 7
2: SST Student Day
SST Student Day - Poster Session 1
SST Student Day - Poster Session 2

Author Index
A B C D E F G H I
J K L M N O P Q R
S T U V W X Y Z

Multimedia Files
0734_01.WAV
(was: 734_1.wav)
View ABSTRACT
source sound
File type: Sound File
Format: Sound File: WAV
Tech. description: 8khz, 16 bit mono adpcm
Creating Application:: cool96
Creating OS: win95
0734_02.PDF
(was: 734_2.jpg)
View ABSTRACT
source spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0734_03.WAV
(was: 734_3.wav)
View ABSTRACT
target sound
File type: Sound File
Format: Sound File: WAV
Tech. description: 8khz, 16 bit mono adpcm
Creating Application:: cool96
Creating OS: win95
0734_04.PDF
(was: 734_4.jpg)
View ABSTRACT
target spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0734_05.WAV
(was: 734_5.wav)
View ABSTRACT
noise source hmm transformation
File type: Sound File
Format: Sound File: WAV
Tech. description: 8khz, 16 bit mono adpcm
Creating Application:: cool96
Creating OS: win95
0734_06.PDF
(was: 734_6.jpg)
View ABSTRACT
source noise hmm transformation spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0734_07.WAV
(was: 734_7.wav)
View ABSTRACT
random background noise hmm transformation
File type: Sound File
Format: Sound File: WAV
Tech. description: 8khz, 16 bit mono adpcm
Creating Application:: cool96
Creating OS: win95
0734_08.PDF
(was: 734_8.jpg)
View ABSTRACT
random background nois hmm transformation spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0734_09.WAV
(was: 734_9.wav)
View ABSTRACT
original sound
File type: Sound File
Format: Sound File: WAV
Tech. description: None
Creating Application:: cool edit
Creating OS: win95
0734_10.PDF
(was: 734_10.jpg)
View ABSTRACT
original spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0734_11.WAV
(was: 734_11.wav)
View ABSTRACT
harmonics subtraction sound
File type: Sound File
Format: Sound File: WAV
Tech. description: 8khz, 16 bit mono adpcm
Creating Application:: cool edit
Creating OS: win95
0734_12.PDF
(was: 734_12.jpg)
View ABSTRACT
harmonics subtraction spectrogram
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: lview
Creating OS: win95
0532_01.PDF
(was: 0532_1.GIF)
View ABSTRACT
MRI image
File type: Image File
Format: Image : GIF
Tech. description: 514x510 pixels, 150 dpi, 113k
Creating Application:: Photoshop
Creating OS: MacOs 8.1
0532_02.PDF
(was: 0532_2.GIF)
View ABSTRACT
MRI image
File type: Image File
Format: Image : GIF
Tech. description: 277x228 pixels, 72 dpi, 27k
Creating Application:: Photoshop
Creating OS: MacOs 8.1
0532_03.PDF
(was: 0532_3.GIF)
View ABSTRACT
3-D reconstruction
File type: Image File
Format: Image : GIF
Tech. description: 627x385 pixels, 72 dpi, 8 bits/pixel
Creating Application:: Photoshop
Creating OS: MacOs 8.1
0425_01.WAV
(was: 0425_01.WAV)
View ABSTRACT
Speech samples synthesized with phoneme categorization are included in the CD-ROM [SOUND 0425\_01.WAV] [SOUND 0425\_02.WAV] [SOUND 0425\_03.WAV].
File type: Sound File
Format: Sound File: WAV
Tech. description: None
Creating Application:: Unknown
Creating OS: Unknown
0425_02.WAV
(was: 0425_02.WAV)
View ABSTRACT
Speech samples synthesized with phoneme categorization are included in the CD-ROM [SOUND 0425\_01.WAV] [SOUND 0425\_02.WAV] [SOUND 0425\_03.WAV].
File type: Sound File
Format: Sound File: WAV
Tech. description: None
Creating Application:: Unknown
Creating OS: Unknown
0425_03.WAV
(was: 0425_03.WAV)
View ABSTRACT
Speech samples synthesized with phoneme categorization are included in the CD-ROM [SOUND 0425\_01.WAV] [SOUND 0425\_02.WAV] [SOUND 0425\_03.WAV].
File type: Sound File
Format: Sound File: WAV
Tech. description: None
Creating Application:: Unknown
Creating OS: Unknown
0777_01.WAV
(was: 0777_01.wav)
View ABSTRACT
synthesized from original spectral parameters
File type: Sound File
Format: Sound File: WAV
Tech. description: 11.025kHz(up sampled from 10kHz), signed short(16bit), mono, linear
Creating Application:: sox-10
Creating OS: SunOS 4.1.4
0777_02.WAV
(was: 0777_02.wav)
View ABSTRACT
coded speech using speaker dependent models
File type: Sound File
Format: Sound File: WAV
Tech. description: 11.025kHz(up sampled from 10kHz), signed short(16bit), mono, linear
Creating Application:: sox-10
Creating OS: SunOS 4.1.4
0777_03.WAV
(was: 0777_03.wav)
View ABSTRACT
coded speech using speaker independent models without adaptation
File type: Sound File
Format: Sound File: WAV
Tech. description: 11.025kHz(up sampled from 10kHz), signed short(16bit), mono, linear
Creating Application:: sox-10
Creating OS: SunOS 4.1.4
0777_04.WAV
(was: 0777_04.wav)
View ABSTRACT
coded speech using adapted models without quantization of transfer vectors
File type: Sound File
Format: Sound File: WAV
Tech. description: 11.025kHz(up sampled from 10kHz), signed short(16bit), mono, linear
Creating Application:: sox-10
Creating OS: SunOS 4.1.4
0777_05.WAV
(was: 0777_05.wav)
View ABSTRACT
coded speech using adapted models with quantization of transfer vectors
File type: Sound File
Format: Sound File: WAV
Tech. description: 11.025kHz(up sampled from 10kHz), signed short(16bit), mono, linear
Creating Application:: sox-10
Creating OS: SunOS 4.1.4
0997_01.WAV
(was: 0997.wav)
View ABSTRACT
Typical utterances of sentence 1) uttered with paralinguistic information types A,D,F,I,N, and S, plus an utterance of sentence 3) type N.
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025Hz-16bit samplling, mono
Creating Application:: Creative SoundStudio
Creating OS: Win95
0997_02.PDF
(was: 0997.gif)
View ABSTRACT
Instructions given to the speakers at the time of recording and also to the subjects of perception test.
File type: Image File
Format: Image : GIF
Tech. description: None
Creating Application:: Imagetool on Solaris
Creating OS: Solaris 2.6
0549_01.PDF
(was: 0549_01.GIF)
View ABSTRACT
Consonant confusion matrix for hidden Markov modelling experiment using mapping of acoustic parameters onto phonetic features
File type: Image File
Format: Image : GIF
Tech. description: 480 x 270, 24 bits per pixel
Creating Application:: xv
Creating OS: UNIX, sun4\_solaris 2.6
0549_02.PDF
(was: 0549_02.GIF)
View ABSTRACT
Consonant confusion matrix for hidden Markov modelling experiment using acoustic parameters as input directly
File type: Image File
Format: Image : GIF
Tech. description: 480 x 270, 24 bits per pixel
Creating Application:: xv
Creating OS: UNIX, sun4\_solaris 2.6
0804_01.WAV
(was: 0804_01.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0804_02.WAV
(was: 0804_02.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0804_03.WAV
(was: 0804_03.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0804_04.WAV
(was: 0804_04.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0804_05.WAV
(was: 0804_05.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0804_06.WAV
(was: 0804_06.WAV.gz)
View ABSTRACT
Example sound file.
File type: Sound File
Format: NIST/Sphere
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
1151_01.WAV
(was: 1151_01.WAV)
View ABSTRACT
1st of 3 example waveforms from section 2
File type: Sound File
Format: Sound File: WAV
Tech. description: 16000 Hz, 16 bits/sample, mono, PCM
Creating Application:: Unknown
Creating OS: Unknown
1151_02.WAV
(was: 1151_02.WAV)
View ABSTRACT
2nd of 3 example waveforms from section 2
File type: Sound File
Format: Sound File: WAV
Tech. description: 16000 Hz, 16 bits/sample, mono, PCM
Creating Application:: Unknown
Creating OS: Unknown
1151_03.WAV
(was: 1151_03.WAV)
View ABSTRACT
3rd of 3 example waveforms from section 2
File type: Sound File
Format: Sound File: WAV
Tech. description: 16000 Hz, 16 bits/sample, mono, PCM
Creating Application:: Unknown
Creating OS: Unknown
1151_04.WAV
(was: 1151_04.WAV)
View ABSTRACT
1st of 2 example waveforms from section 6
File type: Sound File
Format: Sound File: WAV
Tech. description: 16000 Hz, 16 bits/sample, mono, PCM
Creating Application:: Unknown
Creating OS: Unknown
1151_05.WAV
(was: 1151_05.WAV)
View ABSTRACT
2nd of 2 example waveforms from section 6
File type: Sound File
Format: Sound File: WAV
Tech. description: 16000 Hz, 16 bits/sample, mono, PCM
Creating Application:: Unknown
Creating OS: Unknown
0355_01.WAV
(was: sound355_01.wav)
View ABSTRACT
Synthesized French sentence: 'Le petit canard apprend ŕ nager' (The little duck learns to swim).
File type: Sound File
Format: Sound File: WAV
Tech. description: Sound file: 16000Hz, 16 bits/sample, mono, pcm,pc windows wav format
Creating Application:: Unknown
Creating OS: Unknown
0190_01.PDF
(was: 0190_01.JPG)
View ABSTRACT
spectrogram in JPEG Fig.6(a)
File type: Image File
Format: JPEG
Tech. description: 1280x411 24bits
Creating Application:: Unknown
Creating OS: linux 2.0.34
0190_02.WAV
(was: 0190_02.WAV)
View ABSTRACT
sound file Fig.6(1)
File type: Sound File
Format: WAV
Tech. description: (44.1k, 16bit, mono)
Creating Application:: LHa for UNIX V 1.14c
Creating OS: linux 2.0.34
0190_03.PDF
(was: 0190_03.JPG)
View ABSTRACT
spectrogram in JPEG Fig.6(b)
File type: Image File
Format: JPEG
Tech. description: 1280x411 24bits
Creating Application:: LHa for UNIX V 1.14c
Creating OS: linux 2.0.34
0190_04.WAV
(was: 0190_04.WAV)
View ABSTRACT
sound file Fig.6(b)
File type: Sound File
Format: WAV
Tech. description: (44.1k, 16bit, mono)
Creating Application:: LHa for UNIX V 1.14c
Creating OS: linux 2.0.34
0190_05.PDF
(was: 0190_05.JPG)
View ABSTRACT
spectrogram in JPEG Fig.6(c)
File type: Archive File
Format: Archive File
Tech. description: 1280x411 24bits
Creating Application:: LHa for UNIX V 1.14c
Creating OS: linux 2.0.34
0190_06.WAV
(was: 0190_06.WAV)
View ABSTRACT
sound file Fig.6(c)
File type: Sound File
Format: WAV
Tech. description: (44.1k, 16bit, mono)
Creating Application:: LHa for UNIX V 1.14c
Creating OS: linux 2.0.34
0692_XTR.ZIP
(was: WinMSF.exe)
View ABSTRACT
MSF player, this program can read and display multimedia speech file with lip-synchronized animated face
File type: Executable Program File
Format: Executable Program File: MS-Windows 32-bit
Tech. description: None
Creating Application:: Unknown
Creating OS: Unknown
0692_XTR.ZIP
(was: Man_1106.img)
View ABSTRACT
Image data library of man's facial images
File type: Image File
Format: OTHER
Tech. description: Resolution 141 by 141, 8bit per pixel
Creating Application:: Unknown
Creating OS: MS-Windows95
0692_XTR.ZIP
(was: Man.pal)
View ABSTRACT
palette data for images
File type: OTHER
Format: OTHER
Tech. description: 256 color palette used for demo images
Creating Application:: Unknown
Creating OS: MS-Windows95
0692_XTR.ZIP
(was: changwon.msf)
View ABSTRACT
msf file for Korean 'Changwon Univeristy'/chang won dae hak kyo/
File type: OTHER
Format: OTHER
Tech. description: msf multimedia file, 141 by 141, sound: sampling rate 16KHz, 16bit, mono
Creating Application:: Unknown
Creating OS: MS-Windows95
0692_XTR.ZIP
(was: Spring1.msf)
View ABSTRACT
msf file for Korean lyric song
File type: OTHER
Format: OTHER
Tech. description: msf multimedia file, 141 by 141, sound: sampling rate 16KHz, 16bit, mono
Creating Application:: Unknown
Creating OS: MS-Windows95
0166_01.WAV
(was: 0166.WAV)
View ABSTRACT
0166.WAV is the synthetic sentence ``When a sailor in a small craft faces the might of the vast Atlantic Ocean today, he takes the same risks as generations took before him.''.
File type: Sound File
Format: Sound File: WAV
Tech. description: 16kHz, 16 bits per sample, mono, PCM.
Creating Application:: Unknown
Creating OS: Unknown
1078_01.WAV
(was: 1078_01.wav)
View ABSTRACT
The speech synthesized with poor prosody due to wrong word segmentation.
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025 samples per second, 8 bits per sample, mono, u-Law encoded
Creating Application:: Unknown
Creating OS: Windows 95/NT
1078_02.WAV
(was: 1078_02.wav)
View ABSTRACT
The speech synthesized is based on composite word approach, which obviously produces more correct and natural prosody.
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025 samples per second, 8 bits per sample, mono, u-Law encoded
Creating Application:: Unknown
Creating OS: Windows 95/NT
0627_01.WAV
(was: 0627_01.WAV)
View ABSTRACT
Included with postscript file in 0627.zip
File type: Sound File
Format: Sound File: WAV
Tech. description: Sampling rate 10 kHz, mono, little-endian, not encoded
Creating Application:: Unknown
Creating OS: Windows NT
0627_02.WAV
(was: 0627_02.WAV)
View ABSTRACT
Included with postscript file in 0627.zip
File type: Sound File
Format: Sound File: WAV
Tech. description: Sampling rate 10 kHz, mono, little-endian, not encoded
Creating Application:: Unknown
Creating OS: Windows NT 4.0
0898_01.WAV
(was: 0898_01.wav)
View ABSTRACT
The original token can be heard here.
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0898_02.WAV
(was: 0898_02.wav)
View ABSTRACT
The minimum phase reconstruction can be heard here.
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0899_01.WAV
(was: 0899_1.wav)
View ABSTRACT
Speech file of the original sentence "Where are you?"
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 KHz, 16 bits, mono, signed linear encoding.
Creating Application:: sox
Creating OS: Linux
0899_02.WAV
(was: 0899_2.wav)
View ABSTRACT
Speech file of the mimic result of the sentence "Where are you?" using our imploved codebook.
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 KHz, 16 bits, mono, signed linear encoding.
Creating Application:: sox
Creating OS: Linux
0899_03.WAV
(was: 0899_3.wav)
View ABSTRACT
Speech file of the mimic result of the sentence "Where are you?" using our old codebook.
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 KHz, 16 bits, mono, signed linear encoding.
Creating Application:: sox
Creating OS: Linux
0899_04.PDF
(was: 0899.gif)
View ABSTRACT
Spectrogram of the sentence "Where are you?" using our old codebook.
File type: Image File
Format: Image : GIF
Tech. description: None
Creating Application:: XV
Creating OS: Linux.
0158_01.MOV
(was: 0158.MOV)
View ABSTRACT
This design of the manifestative behavior was implemented, and the dialogue for this implementation is in [MOVIE 0158.MOV] on CD-ROM.
File type: Video File
Format: Quicktime
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
1002_01.PDF
(was: 1002_01.gif)
View ABSTRACT
A screen dump of the user interface.
File type: Image File
Format: GIF
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0448_01.WAV
(was: 0448)
View ABSTRACT
wav sound file
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 bit, mono, 8000 Hz, wav file
Creating Application:: Unknown
Creating OS: Unknown
0448_02.WAV
(was: 0448)
View ABSTRACT
wav sound file
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 bit, mono, 8000 Hz, wav file
Creating Application:: Unknown
Creating OS: Unknown
0448_03.WAV
(was: 0448)
View ABSTRACT
wav sound file
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 bit, mono, 8000 Hz, wav file
Creating Application:: Unknown
Creating OS: Unknown
0448_04.WAV
(was: 0448)
View ABSTRACT
wav sound file
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 bit, mono, 8000 Hz, wav file
Creating Application:: Unknown
Creating OS: Unknown
0448_05.WAV
(was: 0448)
View ABSTRACT
wav sound file
File type: Sound File
Format: Sound File: WAV
Tech. description: 16 bit, mono, 8000 Hz, wav file
Creating Application:: Unknown
Creating OS: Unknown
0648_01.WAV
(was: 0648_01.wav)
View ABSTRACT
audio file demonstrating quality of synthesis method
File type: Sound File
Format: Sound File: WAV
Tech. description: sample rate = 11025, mono, linear encoding, 16 bits per sample
Creating Application:: Unknown
Creating OS: Windows95
0648_02.WAV
(was: 0648_02.wav)
View ABSTRACT
audio file demonstrating quality of synthesis method
File type: Sound File
Format: Sound File: WAV
Tech. description: sample rate = 11025, mono, linear encoding, 16 bits per sample
Creating Application:: Unknown
Creating OS: Windows95
1053_01.WAV
(was: 1053_01.WAV)
View ABSTRACT
Italian word /lavan'daja/ ('washerwoman) pronouced by a male speaker in clean condition.
File type: Sound File
Format: Sound File: WAV
Tech. description: 16kHz, 16bits-per-sample, mono, Windows PCM
Creating Application:: Windows Sound Utilities
Creating OS: Windows 95/NT
1053_03.WAV
(was: 1053_03.WAV)
View ABSTRACT
Italian word /lavan'daja/ ('washerwoman) reconstructed, by the correlogram subtraction technique described in the paper, from the corresponding noisy signal (0dB SNR).
File type: Sound File
Format: Sound File: WAV
Tech. description: 16kHz, 16bits-per-sample, mono, Windows PCM
Creating Application:: Windows Sound Utilities
Creating OS: Windows 95/NT
1053_02.WAV
(was: 1053_02.WAV)
View ABSTRACT
Italian word /lavan'daja/ ('washerwoman) pronouced by a male speaker in a noisy condition (0dB SNR).
File type: Sound File
Format: Sound File: WAV
Tech. description: 16kHz, 16bits-per-sample, mono, Windows PCM
Creating Application:: Windows Sound Utilities
Creating OS: Windows 95/NT
0418_01.WAV
(was: 0418_4bA.wav)
View ABSTRACT
The performance by mean value of coherence in simulation: Case 1
File type: Sound File
Format: Sound File: WAV
Tech. description: 44.1kHz, 16bit, mono
Creating Application:: Unknown
Creating OS: Unix 2.0.32
0418_02.WAV
(was: 0418_4bB.wav)
View ABSTRACT
The performance by mean value of coherence in simulation: Case 2
File type: Sound File
Format: Sound File: WAV
Tech. description: 44.1kHz, 16bit, mono
Creating Application:: Unknown
Creating OS: Unix 2.0.32
0418_03.WAV
(was: 0418_5bA.wav)
View ABSTRACT
The performance as a mean value of coherence in experiment: Case 1
File type: Sound File
Format: Sound File: WAV
Tech. description: 44.1kHz, 16bit, mono
Creating Application:: Unknown
Creating OS: Unix 2.0.32
0418_04.WAV
(was: 0418_5bB.wav)
View ABSTRACT
The performance as a mean value of coherence in experiment: Case 2
File type: Sound File
Format: Sound File: WAV
Tech. description: 44.1kHz, 16bit, mono
Creating Application:: Unknown
Creating OS: Unix 2.0.32
0514_01.WAV
(was: 0514_01.WAV)
View ABSTRACT
Speech synthesis example.
File type: Sound File
Format: OTHER
Tech. description: Sampling rate: 16 kHz, bits-per-sample: 16, Mono., Encoding: Linear PCM,
Creating Application:: Unknown
Creating OS: unix
0514_02.WAV
(was: 0514_02.WAV)
View ABSTRACT
Speech synthesis example.
File type: Sound File
Format: OTHER
Tech. description: Sampling rate: 16 kHz, Bits-per-sample: 16, Mono., Encoding: Linear PCM
Creating Application:: Unknown
Creating OS: unix
0024_01.WAV
(was: 0024_01.wav)
View ABSTRACT
Since CHATR produces speech in the recognisable voice of a known person, it offers the potential to extend that person's apparent abilities into the realm of multi-linguality. By offering this ability to the voice of a young child, we are perhaps meeting Furui's expectations [3]. [SOUND 0024.01.WAV][SOUND 0024.02.WAV]
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_02.WAV
(was: 0024_02.wav)
View ABSTRACT
Since CHATR produces speech in the recognisable voice of a known person, it offers the potential to extend that person's apparent abilities into the realm of multi-linguality. By offering this ability to the voice of a young child, we are perhaps meeting Furui's expectations [3]. [SOUND 0024.01.WAV][SOUND 0024.02.WAV]
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_03.WAV
(was: 0024_03.wav)
View ABSTRACT
By mapping from the phone sequence predicted for synthesis in one language to the phone-set used to label the speech of another, we can produce foreign-language speech using the voice of any speaker. In these examples we use the voice of a small Japanese child to speak in English ([SOUND 0024.03.WAV][SOUND 0024.04.WAV] greeting) and Korean ([SOUND 0024.05.WAV] [SOUND 0024.06.WAV] explaining the technical processing within CHATR).
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_04.WAV
(was: 0024_04.wav)
View ABSTRACT
By mapping from the phone sequence predicted for synthesis in one language to the phone-set used to label the speech of another, we can produce foreign-language speech using the voice of any speaker. In these examples we use the voice of a small Japanese child to speak in English ([SOUND 0024.03.WAV][SOUND 0024.04.WAV] greeting) and Korean ([SOUND 0024.05.WAV] [SOUND 0024.06.WAV] explaining the technical processing within CHATR).
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_05.WAV
(was: 0024_05.wav)
View ABSTRACT
By mapping from the phone sequence predicted for synthesis in one language to the phone-set used to label the speech of another, we can produce foreign-language speech using the voice of any speaker. In these examples we use the voice of a small Japanese child to speak in English ([SOUND 0024.03.WAV][SOUND 0024.04.WAV] greeting) and Korean ([SOUND 0024.05.WAV] [SOUND 0024.06.WAV] explaining the technical processing within CHATR).
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_06.WAV
(was: 0024_06.wav)
View ABSTRACT
By mapping from the phone sequence predicted for synthesis in one language to the phone-set used to label the speech of another, we can produce foreign-language speech using the voice of any speaker. In these examples we use the voice of a small Japanese child to speak in English ([SOUND 0024.03.WAV][SOUND 0024.04.WAV] greeting) and Korean ([SOUND 0024.05.WAV] [SOUND 0024.06.WAV] explaining the technical processing within CHATR).
File type: Sound File
Format: Sound File: WAV
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0024_07.PDF
(was: 0024_01.GIF)
View ABSTRACT
Section 5.2: To reduce the `accent', we adopt the following two-stage process: ([IMAGE 0024\_01.GIF] schematic).
File type: Image File
Format: GIF
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0409_01.WAV
(was: 0409_01.wav)
View ABSTRACT
The instruction messages (fourteen in total) of each set are the synthesized speech whose prosodic parameters are manually determined to express good mood, bad mood, and neutral mood, respectively (good mood [SOUND 0409\_1.WAV], bad mood [SOUND 0409\_2.WAV], neutral mood [SOUND 0409\_3.WAV]).
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025, 16, mono, PCM
Creating Application:: Sound Forge4.0
Creating OS: Windows95
0409_02.WAV
(was: 0409_02.wav)
View ABSTRACT
The instruction messages (fourteen in total) of each set are the synthesized speech whose prosodic parameters are manually determined to express good mood, bad mood, and neutral mood, respectively (good mood [SOUND 0409\_1.WAV], bad mood [SOUND 0409\_2.WAV], neutral mood [SOUND 0409\_3.WAV]).
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025, 16, mono, PCM
Creating Application:: Sound Forge4.0
Creating OS: Windows95
0409_03.WAV
(was: 0409_03.wav)
View ABSTRACT
The instruction messages (fourteen in total) of each set are the synthesized speech whose prosodic parameters are manually determined to express good mood, bad mood, and neutral mood, respectively (good mood [SOUND 0409\_1.WAV], bad mood [SOUND 0409\_2.WAV], neutral mood [SOUND 0409\_3.WAV]).
File type: Sound File
Format: Sound File: WAV
Tech. description: 11025, 16, mono, PCM
Creating Application:: Sound Forge4.0
Creating OS: Windows95
0969_01.PDF
(was: 0969.jpg)
View ABSTRACT
The file 0969.jpg contains a view of the poster presented at the conference, and referred to in the paper. 24 vowel schemes are displayed chronologically with relations among them indicated. A 3-D bar diagram of the 'distances' calculated between them is also included. Updated versions of this file will be made available at http://www.suntiger.ee.up.ac.za/hendrik/icslp5
File type: Image File
Format: Image : JPEG
Tech. description: None
Creating Application:: Corel Draw 8
Creating OS: MS Windows 98
0842_01.PDF
(was: 0842.jpg)
View ABSTRACT
In order to elicit speech, a comic strip of four frames with no dialogues was chosen from Shuangxiangpao, a very famous comic series in Taiwan [IMAGE 0842.JPG]. Subjects were seated in a sound-treated room and were told to study the comic strip and retell the story afterwards. Recordings were made individually with SONY TCM-5000EV recorder and SONY ECM-G3M super-directional microphone. Transcriptions were done afterwards in terms of intonation units (IU) following the discourse analysis tradition.
File type: Image File
Format: JPEG
Tech. description: Unknown
Creating Application:: Unknown
Creating OS: Unknown
0589_01.MPG
(was: 0589_01.MPG)
View ABSTRACT
The manually cued sentence "The old castle passed from the duke to the king."
File type: Video File
Format: Video File: MPEG
Tech. description: 30 frames/second, 320 x 240 frame size
Creating Application:: mpeg\_encode
Creating OS: linux
0589_02.MPG
(was: 0589_02.MPG)
View ABSTRACT
Automatically cued (discrete cues) sentence "The loss and two wins were fair games."
File type: Video File
Format: Video File: MPEG
Tech. description: 30 frames/second, 320 x 240 frame size
Creating Application:: mpeg\_encode
Creating OS: linux
0589_03.MPG
(was: 0589_03.MPG)
View ABSTRACT
Automatically cued (dynamic cues) sentence "The kite may fly on this windy day."
File type: Video File
Format: Video File: MPEG
Tech. description: 30 frames/second, 320 x 240 frame size
Creating Application:: mpeg\_encode
Creating OS: linux
0592_01.PDF
(was: S0592_01.BMP)
View ABSTRACT
Linear model of speech production.
File type: Image File
Format: OTHER
Tech. description: 524x204, 24 bits per pixel
Creating Application:: MS Paint
Creating OS: Windows 95
0592_02.PDF
(was: S0592_02.BMP)
View ABSTRACT
Fundamental frequencies for an esophageal speaker and a normal speaker, sayin the Spanish word "martes"
File type: Image File
Format: OTHER
Tech. description: 514x177, 24 bits per pixel
Creating Application:: MS Paint
Creating OS: Windows 95
0592_03.PDF
(was: S0592_03.BMP)
View ABSTRACT
Central frequencies of the 3 first formants for the Spanish vowels, F1, F2, and F3. The values are similar for both alaryngeal and laryngeal speakers.
File type: Image File
Format: OTHER
Tech. description: 524x394, 24 bits per pixel
Creating Application:: MS Paint
Creating OS: Windows 95
0592_04.PDF
(was: S0592_04.BMP)
View ABSTRACT
Spectrograms of the word martes a) said by an esophageal speaker, and b) re-synthesized according to the method proposed.
File type: Image File
Format: OTHER
Tech. description: 703x284, 24 bits per pixel
Creating Application:: MS Paint
Creating OS: Windows 95
0592_01.WAV
(was: S0592_01.WAV)
View ABSTRACT
The Spanish word "lunes" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_02.WAV
(was: S0592_02.WAV)
View ABSTRACT
The Spanish word "martes" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_03.WAV
(was: S0592_03.WAV)
View ABSTRACT
The Spanish word "miércoles" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_04.WAV
(was: S0592_04.WAV)
View ABSTRACT
The Spanish word "jueves" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_05.WAV
(was: S0592_05.WAV)
View ABSTRACT
The Spanish word "viernes" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_06.WAV
(was: S0592_06.WAV)
View ABSTRACT
The Spanish word "sábado" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_07.WAV
(was: S0592_07.WAV)
View ABSTRACT
The Spanish sentence "y domingo" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_08.WAV
(was: S0592_08.WAV)
View ABSTRACT
The Spanish word "lunes" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_09.WAV
(was: S0592_09.WAV)
View ABSTRACT
The Spanish word "martes" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_10.WAV
(was: S0592_10.WAV)
View ABSTRACT
The Spanish word "miércoles" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_11.WAV
(was: S0592_11.WAV)
View ABSTRACT
The Spanish word "jueves" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_12.WAV
(was: S0592_12.WAV)
View ABSTRACT
The Spanish word "viernes" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_13.WAV
(was: S0592_13.WAV)
View ABSTRACT
The Spanish word "sábado" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_14.WAV
(was: S0592_14.WAV)
View ABSTRACT
The Spanish sentence "y domingo" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_15.WAV
(was: S0592_15.WAV)
View ABSTRACT
The Spanish word "martes" said by an esophageal speaker
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95
0592_16.WAV
(was: S0592_16.WAV)
View ABSTRACT
The Spanish word "martes" re-synthesized from a esophageal speaker recording
File type: Sound File
Format: Sound File: WAV
Tech. description: Samplig rate: 22050, Bits-per-sample: 16, Mono
Creating Application:: Soundo'LE by Creative
Creating OS: Windows 95