ICSLP'98 Sessions and Titles

Sessions and Titles
Home Full List of Titles 1: ICSLP'98 Proceedings Keynote Speeches Text-To-Speech Synthesis 1 Spoken Language Models and Dialog 1 Prosody and Emotion 1 Hidden Markov Model Techniques 1 Speaker and Language Recognition 1 Multimodal Spoken Language Processing 1 Isolated Word Recognition Robust Speech Processing in Adverse Environments 1 Spoken Language Models and Dialog 2 Articulatory Modelling 1 Talking to Infants, Pets and Lovers Robust Speech Processing in Adverse Environments 2 Spoken Language Models and Dialog 3 Speech Coding 1 Articulatory Modelling 2 Prosody and Emotion 2 Neural Networks, Fuzzy and Evolutionary Methods 1 Utterance Verification and Word Spotting 1 / Speaker Adaptation 1 Text-To-Speech Synthesis 2 Spoken Language Models and Dialog 4 Human Speech Perception 1 Robust Speech Processing in Adverse Environments 3 Speech and Hearing Disorders 1 Prosody and Emotion 3 Spoken Language Understanding Systems 1 Signal Processing and Speech Analysis 1 Spoken Language Generation and Translation 1 Spoken Language Models and Dialog 5 Segmentation, Labelling and Speech Corpora 1 Multimodal Spoken Language Processing 2 Prosody and Emotion 4 Neural Networks, Fuzzy and Evolutionary Methods 2 Large Vocabulary Continuous Speech Recognition 1 Speaker and Language Recognition 2 Signal Processing and Speech Analysis 2 Prosody and Emotion 5 Robust Speech Processing in Adverse Environments 4 Segmentation, Labelling and Speech Corpora 2 Speech Technology Applications and Human-Machine Interface 1 Large Vocabulary Continuous Speech Recognition 2 Text-To-Speech Synthesis 3 Language Acquisition 1 Acoustic Phonetics 1 Speaker Adaptation 2 Speech Coding 2 Hidden Markov Model Techniques 2 Multilingual Perception and Recognition 1 Large Vocabulary Continuous Speech Recognition 3 Articulatory Modelling 3 Language Acquisition 2 Speaker and Language Recognition 3 Text-To-Speech Synthesis 4 Spoken Language Understanding Systems 4 Human Speech Perception 2 Large Vocabulary Continuous Speech Recognition 4 Spoken Language Understanding Systems 2 Signal Processing and Speech Analysis 3 Human Speech Perception 3 Speaker Adaptation 3 Spoken Language Understanding Systems 3 Multimodal Spoken Language Processing 3 Acoustic Phonetics 2 Large Vocabulary Continuous Speech Recognition 5 Speech Coding 3 Language Acquisition 3 / Multilingual Perception and Recognition 2 Segmentation, Labelling and Speech Corpora 3 Text-To-Speech Synthesis 5 Spoken Language Generation and Translation 2 Human Speech Perception 4 Robust Speech Processing in Adverse Environments 5 Text-To-Speech Synthesis 6 Speech Technology Applications and Human-Machine Interface 2 Prosody and Emotion 6 Hidden Markov Model Techniques 3 Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1 Human Speech Production Segmentation, Labelling and Speech Corpora 4 Speaker and Language Recognition 4 Speech Technology Applications and Human-Machine Interface 3 Utterance Verification and Word Spotting 2 Large Vocabulary Continuous Speech Recognition 6 Neural Networks, Fuzzy and Evolutionary Methods 3 Speech Processing for the Speech-Impaired and Hearing-Impaired 2 Prosody and Emotion 7 2: SST Student Day SST Student Day - Poster Session 1 SST Student Day - Poster Session 2 Author Index A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Multimedia Files	Keynote Speeches Cochlear Implants In The Second And Third Millennia The Use of Linguistic Hierarchies in Speech Understanding Text-To-Speech Synthesis 1 Unsupervised Training of Phone Duration and Energy Models for Text-to-Speech Synthesis Improved Duration Modeling of English Phonemes Using a Root Sinusoidal Transformation Efficient Adaptation of TTS Duration Model to New Speakers Duration Modeling For HMM-Based Speech Synthesis Spoken Language Models and Dialog 1 An Educational Dialogue System with a User Controllable Dialogue Manager End-User Driven Dialogue System Design: The Reward Experience The Design of a Multi-Domain Mandarin Chinese Spoken Dialogue System An Integrated Dialogue System for the Automation of Call Centre Services Prosody and Emotion 1 Tones of a Tridialectal: Acoustic and Perceptual Data on Ten Linguistic Tonetic Contrasts Between Lao, Nyo and Standard Thai Tone Sandhi Between Complex Tones in a Seven-Tone Southern Thai Dialect The Acoustic And Perceptual Features Of Tone In The Tibeto-Burman Language Ao Naga The Differential Status of Semivowels in the Acoustic Phonetic Realisation of Tone Hidden Markov Model Techniques 1 Nonreciprocal Data Sharing in Estimating HMM Parameters Data-Driven Extensions to HMM Statistical Dependencies Use of High-Level Linguistic Constraints for Constructing Feature-Based Phonological Model in Speech Recognition Speaker and Language Recognition 1 Sub-Band Based Speaker Verification Using Dynamic Recombination Weights Measuring the Dynamic Encoding of Speaker Identity and Dialect in Prosodic Parameters German Regional Variants - A Problem for Automatic Speech Recognition? Improving Accent Identification Through Knowledge Of English Syllable Structure Multi-Dimensional Scaling of Listener Responses to Complex Auditory Stimuli Same Talker, Different Language The Impact of Regional Variety Upon Specific Word Categories in Spontaneous German Speech Pre-Processing Against Intentional Imposture In Speaker Recognition A Comparison of Two Unsupervised Approaches to Accent Identification The Influence of Accents in Australian English Vowels and their Relation to Articulatory Tract Parameters Automatic Language Recognition Using High-Order HMMs Speaker Recognition Using Residual Signal Of Linear and Nonlinear Prediction Models An Implementation and Evaluation of an On-Line Speaker Verification System for Field Trials Speaker Verification on the Polycost Database Using Frequency Filtered Spectral Energies A High-Performance Text-Independent Speaker Identification System Based on BCDM Representation Of Voice Quality Features Associated With Talker Individuality Candidate Selection Based on Significance Testing and its Use in Normalisation and Scoring Japanese Forensic Phonetics: Non-Contemporaneous Within-Speaker Variation In Natural And Read-Out Speech Statistical Modeling of Pronunciation and Production Variations for Speech Recognition Dialect Maps and Dialect Research; Useful Tools for Automatic Speech Recognition? Text Independent Speaker Recognition Using Micro-Prosody Speaker Verification Using Fundamental Frequency On Optimum Normalization Method Used for Speaker Verification Recurrent Substrings and Data Fusion for Language Recognition Text-Independent Speaker Recognition Using Multiple Information Sources Discriminative Training Of GMM Using a Modified EM Algorithm for Speaker Recognition Language Identification Incorporating Lexical Information A VQ Based Speaker Recognition System Based in Histogram Distances. Text Independent and for Noisy Environments Spanish Dialects: Phonetic Transcription Acoustic Analysis of Japanese English Prosody: Comparison Between Fukushima Dialect Speakers and Tokyo Dialect Speakers in Declarative Sentences and Yes-No Questions A Context-Dependent Approach for Speaker Verification Using Sequential Decision Quantitative Influence of Speech Variability Factors for Automatic Speaker Verification in Forensic Tasks Creating Hidden Markov Models for Fast Speech Speaker Identification using Relaxation Labeling A Novel Technique for the Combination of Utterance and Speaker Verification Systems in a Text-Dependent Speaker Verification Task A Forensic Phonetic Investigation into Non-contemporaneous Variation in the F-pattern of Similar-sounding Speakers. Human vs. Machine Speaker Identification with Telephone Speech A Comparison of Fusion Techniques in Mel-cepstral Based Speaker Identification On the Influence of Hyperarticulated Speech on Recognition Performance Text-Independent Speaker Identification and Verification Using the TIMIT Database Incorporating Linguistic Knowledge Into Automatic Dialect Identification of Spanish A Novel Text-Independent Speaker Verification Method Using the Global Speaker Model Multimodal Spoken Language Processing 1 A Fast Method of Producing Talking Head Mouth Shapes from Real Speech The Efficiency of Multimodal Interaction: a Case Study Audio and Audio-visual Perception of Consonants Disturbed by White Noise and 'Cocktail Party' Overview of the Maya Spoken Language System Automatic Recognition of Spontaneous Speech Dialogues Using an Animated Talking Character in a Web-Based City Guide Demonstrator Influence of Facial Views on the McGurk Effect in Auditory Noise The Intellimedia Workbench - a Generic Environment for Multimodal Systems STAMP: A Suite Of Tools For Analyzing Multimodal System Processing Cultural Similarities and Differences in the Recognition of Audio-Visual Speech Stimuli A Multimodal-Input Multimedia-Output Guidance System: MMGS HMM-based Visual Speech Recognition Using Intensity and Location Normalization A Hierarchy Probability-Based Visual Features Extraction Method for Speechreading Integration Of Talking Heads And Text-To-Speech Synthesizers For Visual TTS Isolated Word Recognition Improving Accuracy of Telephony-based, Speaker-Independent Speech Recognition Rejection in Speech Recognition Systems with Limited Training A Four Layer Sharing HMM System For Very Large Vocabulary Isolated Word Recognition A Comparative Study Of Hybrid Modelling Techniques For Improved Telephone Speech Recognition Smoothing and Tying for Korean Flexible Vocabulary Isolated Word Recognition Recent Work on a Preselection Module for a Flexible Large Vocabulary Speech Recognition System in Telephone Environment A Study of Noise Robustness for Speaker Independent Speech Recognition Method Using Phoneme Similarity Vector Classification of Taiwanese Tones Based on Pitch and Energy Movements Phoneme-Based Recognition for the Norwegian SpeechDat(II) Database Robust Feature Extraction for Alphabet Recognition Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network Improving the Speaker-Dependency of Subword-Unit-Based Isolated Word Recognition Speaker Independent Speech Recognition Method using Constrained Time Alignment near Phoneme Discriminative Frame A Nonstationary Autoregressive HMM With Gain Adaptation For Speech Recognition A Large-Vocabulary Taiwanese (MIN-NAN) Multi-Syllabic Word Recognition System Based Upon Right-Context-Dependent Phones with State Clustering by Acoustic Decision Tree Speech Recognition Based on the Distance Calculation Between Intermediate Phonetic Code Sequences in Symbolic Domain High Accuracy Chinese Speech Recognition Approach with Chinese Input Technology for Telecommunication Use Robust Speech Processing in Adverse Environments 1 Robust Speech Recognition using HMM's with Toeplitz State Covariance matrices Modeling of Output Probability Distribution to Improve Small Vocabulary Speech Recognition in Adverse Environments Robust and Compact Multilingual Word Recognizers Using Features Extracted from a Phoneme Similarity Front-End An Effect of Adaptive Beamforming on Hands-Free Speech Recognition Based on 3-D Viterbi Search Coherence-based Subband Decomposition for Robust Speech and Speaker Recognition in Noisy and Reverberant Rooms A Minimax Search Algorithm for CDHMM based Robust Continuous Speech Recognition Spoken Language Models and Dialog 2 An Event Driven Model for Dialogue Systems Automatic Classification of Dialogue Contexts for Dialogue Predictions Automatic Identification of Command Boundaries in a Conversational Natural Language User Interface The Predictive Power of Game Structure in Dialogue Act Recognition: Experimental Results Using Maximum Entropy Estimation A Schema Based Approach To Dialog Control Expanding A Time-Sensitive Conversational Architecture For Turn-Taking To Handle Content-Driven Interruption Articulatory Modelling 1 A Three-Dimensional Linear Articulatory Model Based on MRI Data On Loops and Articulatory Biomechanics Magnetic Resonance Measurements of the Velum Port Opening Cantilever-Type Force-Sensor-Mounted Palatal Plate for Measuring Palatolingual Contact Stress and Pattern During Speech Phonation Determination of the Vocal Tract Spectrum from the Articulatory Movements Based on the Search of an Articulatory-Acoustic Database An MRI Study On The Relationship Between Oral Cavity Shape And Larynx Position Talking to Infants, Pets and Lovers Acoustic And Affective Qualities Of IDS In English Acoustic Qualities Of IDS And ADS In Thai Pragmatic Characteristics of Infant Directed Speech Are You My Little Pussy-Cat? Acoustic, Phonetic And Affective Qualities Of Infant- And Pet-Directed Speech Special Speech Registers: Talking To Australian And Thai Infants, And To Pets Robust Speech Processing in Adverse Environments 2 Performance Improvements Through Combining Phone- And Syllable-Scale Information In Automatic Speech Recognition Predictive Adaptation and Compensation for Robust Speech Recognition Influence of the Speaking Style and the Noise Spectral Tilt on the Lombard Reflex and Automatic Speech Recognition Data-driven PMC and Bayesian Learning Integration for Fast Model Adaptation in Noisy Conditions Improving The Noise And Spectral Robustness Of An Isolated-Word Recognizer Using An Auditory-Model Front End A Model for Speech Reverberation and Intelligibility Restoring Filters Spoken Language Models and Dialog 3 On Different Functions of Repetitive Utterances Prosody-Based Detection of the Context of Backchannel Responses Robust Interpretation for Spoken Dialogue Systems System-User Interaction and Response Strategy in Spoken Dialogue System Organizing Self-Motivated Dialogue with Autonomous Creatures Fly with the EAGLES: Evaluation of the "ACCeSS" Spoken Language Dialogue System Speech Coding 1 A Very Low Bit Rate Speech Coder Using HMM With Speaker Adaptation ITU-T G.729 Extension At 6.4 kbps Adaptive Transformation for Segmented Parametric Speech Coding Speech Enhancement Using STC-Based Bandwidth Extension Performance And Optimization Of The SEEVOC Algorithm Articulatory Modelling 2 Acoustic-Articulatory Evaluation of the Upper Vowel-Formant Region and its Presumed Speaker-Specific Potency Control of Larynx Height in Vowel Production Analyzing the Effect of Secondary Excitations of the Vocal Tract on Vocal Intensity in Different Loudness Conditions An Analysis of Modal Coupling Effects During the Glottal Cycle: Formant Synthesizers from Time-Domain Finite-Difference Simulations Laryngoscopic Analysis of Pharyngeal Articulations and Larynx-Height Voice Quality Settings Effects of Shapes of Radiational Aperture on Radiation Characteristics Prosody and Emotion 2 De-accentuation: Linguistic Environments and Prosodic Realizations Towards an Automatic Classification of Emotions in Speech Can We Hear Smile? The Automatic Marking of Prominence in Spontaneous Speech Using Duration and Part of Speech Information On A Pitch Alteration Technique in Excited Cepstral Spectrum for High Quality TTS Dovetailing of Acoustics and Prosody in Spontaneous Speech Recognition A Computational Memory and Processing Model for Prosody Convergence Of Fundamental Frequencies In Conversation: If It Happens, Does It Matter? Analysis and Interpretation of Fundamental Frequency Contours of British English in Terms of a Command-Response Model Common Patterns In Word Level Prosody Prosodic Structure in Japanese Spontaneous Speech An Acoustic-Phonetic Description Of Word Tone In Kagoshima Japanese Representing Prosodic Words Using Statistical Models of Moraic Transition of Fundamental Frequency Contours of Japanese Disambiguation of Korean Utterances Using Automatic Intonation Recognition Multi-Level Rhythm Control for Speech Synthesis Using Hybrid Data Driven and Rule-Based Approaches EGG Model of Ditoneme in Mandarin Temporal Organization of Speech for Normal and Fast Rates A Syllable-based Generalization of Japanese Accentuation Non-Adjacent Segmental Effects in Tonal Realization of Accentual Phrase in Seoul Korean Improvement on Connected Numbers Recognition Using Prosodic Information Phonetic Investigation of Boundary Pitch Movements in Japanese Phonetic and Phonological Characteristics of Paralinguistic Information in Spoken Japanese ToBI Accent Type Recognition The Influence of Syllable Structure on the Timing of Intonational Events in German New Prosodic Control Rules For Expressive Synthetic Speech The Use of F0 Reliability Function for Prosodic Command Analysis on F0 Contour Generation Model Analysis of Effects of Lexical Accent, Syntax, and Global Speech Rate upon the Local Speech Rate On the Effects of Speech Rate upon Parameters of the Command-Response Model for the Fundamental Frequency Contours of Speech The Maximum-Based Description of F0 Contours and its Application to English Perceived Prominence and Acoustic Parameters in American English Generating Emotional Speech with a Concatenative Synthesizer A Perceptive Measure of Pure Prosody Linguistic Functions with Reiterant Sentences Prosodic Parameters in Emotional Speech Automatic Detection of Prominence (as Defined by Listeners' Judgements) in Read Aloud Dutch Sentences A Schema for Illocutionary Act Identification With Prosodic Feature An Algorithm for Choosing Japanese Acknowledgments using Prosodic Cues and Context A Study of Tones and Tempo in Continuous Mandarin Digit Strings and their Application in Telephone Quality Speech Recognition Simulated Emotions: an Acoustic Study of Voice and Perturbation Measures A Robust Tone Recognition Method of Chinese Based on Sub-syllabic F0 Contours The Microprosodics of Tone Sandhi in Shanghai Disyllabic Compounds Jitter And Shimmer Differences Between Pathological Voices Of School Children Neural Networks, Fuzzy and Evolutionary Methods 1 A Comparison of Thai Speech Recognition Systems Using Hidden Markov Model, Neural Network, and Fuzzy-Neural Network Phoneme Recognition with Statistical Modeling of the Prediction Error of Neural Networks Neural Network Based Pronunciation Modeling With Applications To Speech Recognition A Comparative Study of OCON and MLP Architectures for Phoneme Recognition Evaluation and Integration of Neural-Network Training Techniques for Continuous Digit Recognition Hierarchical Neural Networks (HNN) for Chinese Continuous Speech Recognition Neural Network Motivation for Segmental Distribution Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition Of Natural Numbers Initial Speech Recognition Results Using The Multinet Architecture Selection of the Optimal Structure of the Continuous HMM Using the Genetic Algorithm A Proposed Decision Rule For Speaker Recognition Based On Fuzzy C-Means Clustering Fuzzy Gaussian Mixture Models For Speaker Recognition A New Strategy of Fuzzy-Neural Network for Thai Numeral Speech Recognition Thai Polysyllabic Word Recognition Using Fuzzy-Neural Network Utterance Verification and Word Spotting 1 / Speaker Adaptation 1 Word Verification Using Confidence Measures in Speech Recognition Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems Two-Pass Utterance Verification Algorithm for Long Natural Numbers Recognition A*-Admissible Key-Phrase Spotting With Sub-Syllable Level Utterance Verification Speaker-Independent Upfront Dialect Adaptation in a Large Vocabulary Continuous Speech Recognizer Word-Based Acoustic Confidence Measures for Large-Vocabulary Speech Recognition Improved Utterance Rejection Using Length Dependent Thresholds Bayesian Constrained Frequency Warping HMMS for Speaker Normalisation An Evaluation of Keyword Spotting Performance Utilizing False Alarm Rejection Based on Prosodic Information Predictive Speaker Adaptation and Its Prior Training Powerful Syllabic Fillers for General-Task Keyword-Spotting and Unlimited-Vocabulary Continuous-Speech Recognition Confidence Scoring for Speech Understanding Systems Phonological Rules for Enhancing Acoustic Enrollment of Unknown Words Recognition-Based Word Counting for Reliable Barge-in and Early Endpoint Detection in Continuous Speech Recognition Linear Discriminant - A New Criterion For Speaker Normalization Confidence Measures Derived from an Acceptor HMM Telephone Speech Multi-Keyword Spotting Using Fuzzy Search Algorithm and Prosodic Verification Topic Recognition for News Speech Based on Keyword Spotting Text-To-Speech Synthesis 2 Prosody Prediction for Speech Synthesis using Transformational Rule-based Learning Representing the Environments for Phonological Processes in an Accent-Independent Lexicon for Synthesis of English Efficient Lexical Retrieval for English Text-to-Speech Synthesis Spoken Language Models and Dialog 4 SQEL: A Multilingual and Multifunctional Dialogue System Semi-Automated Incremental Prototyping of Spoken Dialog Systems Beyond Structured Dialogues: Factoring out Grounding Human Speech Perception 1 Heads And Tails in Word Perception: Evidence For `Early-to-Late' Processing in Listening and Reading Evidence for Early Effects of Sentence Context on Word Segmentation Assimilation and Anticipation in Word Perception Lexical Activation by Assimilated and Reduced Tokens Robust Speech Processing in Adverse Environments 3 Linear and Nonlinear Speech Feature Analysis for Stress Classification Speech Feature Modeling for Robust Stressed Speech Recognition Combining Articulatory and Acoustic Information for Speech Recognition in Noisy and Reverberant Environments Improving Speaker Identification Performance in Reverberant Conditions using Lip Information Speech and Hearing Disorders 1 Adults With a Severe-to-Profound Hearing Impairment. Investigating the Effects of Linguistic Context on Speech Perception Speech Perception in Dyslexia: Measurements From Birth Onwards An Acoustic Analysis of Vowel Production Across Tasks in a Case of Non-fluent Progressive Aphasia Speech Technology in Clinical Environments Prosody and Emotion 3 What Spreads, And How? Tonal Rightward Spreading on Shanghai Disyllabic Compounds Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from Four Zhejiang Wu Dialects Emotional Speech Synthesis: From Speech Database to TTS Some Acoustic Characteristics Of Emotion Spoken Language Understanding Systems 1 GALAXY-II: A Reference Architecture for Conversational System Development Improvements in Speech Understanding Accuracy Through the Integration of Hierarchical Linguistic, Prosodic, And Phonological Constraints in the Jupiter Domain Towards Robust Methods for Spoken Document Retrieval Signal Processing and Speech Analysis 1 Maximum a Posteriori Pitch Tracking Vowel Separation Using the Reassigned Amplitude-Modulation Spectrum Feature Decorrelation Methods in Speech Recognition. A Comparative Study Multi-Resolution for Speech Analysis Dynamic features in Children's Vowels Effectiveness of Phase-Corrected Rasta for Continuous Speech Recognition Techniques For Capturing Temporal Variations In Speech Signals With Fixed-Rate Processing Automatic Detection of Landmark for Nasal Consonants from Speech Waveform Plug and Play Software for Designing High-Level Speech Processing Systems Creating Speaker Independent HMM Models for Restricted Database Using STRAIGHT-TEMPO Morphing Restoration Of Hyperbaric Speech By Correction Of The Formants And The Pitch Voice Conversion Based on Parameter Transformation Noise Robust Two-Stream Auditory Feature Extraction Method for Speech Recognition Heterogeneous Measurements and Multiple Classifiers for Speech Recognition Joint Recognition and Segmentation Using Phonetically Derived Features and a Hybrid Phoneme Model TRAPS - Classifiers Of Temporal Patterns Robust Measurement of Fundamental Frequency and Degree of Voicing Micropower Electro-Magnetic Sensors for Speech Characterization, Recognition, Verification, and other applications Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments Statistical Integration of Temporal Filter Banks for Robust Speech Recognition Using Linear Discriminant Analysis (LDA) Feature-Based Approach to Speech Recognition Periodicity Emphasis of Voice Wave using Nonlinear IIR Digital Filters and Its Applications Speech Recognition Via Phonetically Featured Syllables Do Phonetic Features Help to Improve Consonant Identification in ASR? Perceptual and Acoustic Properties of Phonemes in Continuous Speech for Different Speaking Rate On Robust Sequential Estimator Based on T-Distribution with Forgetting Factor for Speech Analysis Discriminant Wavelet Basis Construction for Speech Recognition An Efficient Mel-LPC Analysis Method for Speech Recognition Discriminative Weighting of Multi-Resolution Sub-Band Cepstral Features for Speech Recognition Separation of Singing and Piano Sounds Modeling of Variations in Cepstral Coefficients Caused by F0 Changes and its Application to Speech Processing A Detection Framework for Locating Phonetic Events On Frequency Averaging For Spectral Analysis In Speech Recognition Wavelet Transform Domain Blind Equalization and Its Application to Speech Analysis A Novel Method of Formant Analysis and Glottal Inverse Filtering Vector Quantizer Acceleration for an Automatic Speech Recognition Application Local Speech Rate as a Combination of Syllable and Phone Rate Recovering Gestures From Speech Signals: A Preliminary Study for Nasal Vowels Extended Linear Discriminant Analysis (ELDA) for Speech Recognition Speech, Silence, Music and Noise Classification of TV Broadcast Material The Relation Between Vocal Tract Shape And Formant Frequencies Can Be Described By Means Of A System Of Coupled Differential Equations Improving Speech Recognizer by Broader Acoustic-Phonetic Group Classification Separation of Speech Source and Filter by Time-Domain Deconvolution On the Application of the AM-FM Model for the Recovery of Missing Frequency Bands of Telephone Speech Estimation of Voice Source and Vocal Tract Parameters Using Combined Subspace-Based and Amplitude Spectrum-Based Algorithm The Distance Measure For Line Spectrum Pairs Applied to Speech Recognition Spoken Language Generation and Translation 1 The Modeling and Realization of Natural Speech Generation System "Ko Tok Ples Ensin bilong Tok Pisin" or The TP-CLE: A First Report From a Pilot Speech-to-Speech Translation Project From Swedish to Tok Pisin An Iterative, DP-Based Search Algorithm For Statistical Machine Translation Information Extraction and Text Generation of News Reports for a Swedish-English Bilingual Spoken Dialogue System Utterance Generation for Transaction Dialogues Example-Based Error Recovery Method For Speech Translation: Repairing Sub-Trees According to the Semantic Distance Context Sensitive Generation of Descriptions An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues Generating Pitch Accents in a Concept-to-Speech System Using a Knowledge Base Making the Most of Multiplicity: a Multi-Parser Multi-Strategy Architecture for the Robust Processing of Spoken Language Natural-Sounding Speech Synthesis Using Variable-Length Units Spoken Language Models and Dialog 5 A Robust Dialogue Model for Spoken Dialogue Processing The REWARD Service Creation Environment. An Overview An Analysis of the Timing of Turn-Taking in a Corpus of Goal-Oriented Dialogue The Provision of Corrective Feedback in a Spoken Dialogue CALL System Evaluation of Dialog Strategies for a Tourist Information Retrieval System Designing a Multimodal Dialogue System for Information Retrieval The Research Project of Man-Computer Dialogue System in Chinese Interfaces for Speech Recognition Systems: the Impact of Vocabulary Constraints and Syntax on Performance Pacing Spoken Directions to Suit the Listener A Spoken Dialogue System Utilizing Spatial Information From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems Emergent Computational Dialogue Management Architecture For Task-Oriented Spoken Dialogue Systems An Analysis of Dialogues with Our Dialogue System Through a WWW page Modelling Spoken Dialogues With State Transition Diagrams: Experiences With The CSLU Toolkit Situated Dialogue Coordination For Spoken Dialogue Systems Robust Spoken Dialogue Systems for Consumer Products: a Concrete Application A German Dialogue System for Scheduling Dates and Meetings by Naturally Spoken Continuous Speech Spoken Dialogue System Using Corpus-Based Hidden Markov Model A Realistic Wizard of Oz Simulation of a Multimodal Spoken Language System A Syllable-Based Chinese Spoken Dialogue System for Telephone Directory Services Primarily Trained with A Corpus How Disagreement Expressions are Used in Cooperative Tasks Segmentation, Labelling and Speech Corpora 1 Acoustic Indicators Of Topic Segmentation IViE - A Comparative Transcription system for Intonational Variation in English Automatic Segmental and Prosodic Labeling of Mandarin Speech Database Automatic Labelling of German Prosody Multimodal Spoken Language Processing 2 Speech Driven 3-D Face Point Trajectory Synthesis Algorithm Speech-to-Lip Movement Synthesis Based on the EM Algorithm Using Audio-Visual HMMs Learning Words from Natural Audio-Visual Input Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database Prosody and Emotion 4 Intonative Structure as a Determinant of Word Order Variation in Dutch Verbal Endgroups Experiments on the Meaning of Two Pitch Accent Types: The 'Pointed Hat' Versus the Accent-lending Fall in Dutch Phonetic and Phonological Markers of Contrastive Focus in Korean Reconciling Two Competing Views on Contrastiveness Neural Networks, Fuzzy and Evolutionary Methods 2 Modular Neural Networks for Low-Complex Phoneme Recognition Global Optimisation of Neural Network Models Via Sequential Sampling-Importance Resampling Efficient Computation of MMI Neural Networks for Large Vocabulary Speech Recognition Systems Modular Connectionist Systems for Identifying Complex Arabic Phonetic Features Large Vocabulary Continuous Speech Recognition 1 Real-Time Recognition of Broadcast News Automatic Recognition of Korean Broadcast News Speech Telephone-Based Conversational Speech Recognition in the JUPITER Domain Japanese Large-Vocabulary Continuous Speech Recognition System Based on Microsoft Whisper Partitioning And Transcription Of Broadcast News Data Speaker and Language Recognition 2 Speaker Detection in Broadcast Speech Databases Multilateral Techniques for Speaker Recognition Real Time Speaker Indexing Based on Subspace Method - Application to TV News Articles and Debate SHEEP, GOATS, LAMBS and WOLVES: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation Progress in Speaker Recognition at Dragon Systems A Comparative Study Of Speaker Verification Systems Using The Polycost Database Signal Processing and Speech Analysis 2 Improving Pitch Estimation with Short Duration Speech Samples An Instantaneous-Frequency-Based Pitch Extraction Method for High-Quality Speech Transformation: Revised TEMPO in the STRAIGHT-Suite Speaker-Independent Speech Recognition Using Micro Segment Spectrum Integration On Robust Speech Analysis Based On Time-Varying Complex AR Model Spectral Basis Functions from Discriminant Analysis Prosody and Emotion 5 The Tilt Intonation Model Analysis of Occurrence of Pauses and Their Durations in Japanese Text Reading A Statistical Study of Pitch Target Points in Five Languages Fully Automatic Prosody Generator For Text-to-Speech Automatic Prosodic Labeling of 6 Languages Automatic Utterance Type Detection Using Suprasegmental Features Robust Speech Processing in Adverse Environments 4 Spectral Sequence Compensation Based on Continuity of Spectral Sequence Robust Features for Speech Recognition Systems Interfacing of CASA and Partial Recognition Based on a Multistream Technique AN RNN-Based Compensation Method for Mandarin Telephone Speech Recognition Robust Speech Recognition Using Discriminative Stream Weighting and Parameter Interpolation Acoustic Backing-Off in the Local Distance Computation for Robust Automatic Speech Recognition Noise Model Selection For Robust Speech Recognition A Novel Iterative Signal Enhancement Algorithm for Noise Reduction in Speech Missing Data Reconstruction for Robust Automatic Speech Recognition in the Framework of Hybrid HMM/ANN Systems Recognition from GSM Digital Speech Conversational Speech Systems For On-Board Car Navigation And Assistance A Signal Processing System for Having the Sound "Pop-Out" in Noise Thanks to the Image of the Speaker's Lips: New Advances Using Multi-Layer Perceptrons Robust Speech Activity Detection in the Presence of Noise Robust Automatic Speech Recognition by the Application of a Temporal-Correlation-Based Recurrent Multilayer Neural Network to the Mel-Based Cepstral Coefficients Speech Recognition from GSM Codec Parameters Improved Parallel Model Combination Based on Better Domain Transformation for Speech Recognition Under Noisy Environments Robust Speech/Non-Speech Detection in Adverse Conditions Based on Noise and Speech Statistics Speech Recognition In Car Noise Environments Using Multiple Models According To Noise Masking Levels Spectral Noise Subtraction With Recursive Gain Curves A Novel Robust Speech Recognition Algorithm Based on Multi-Models and Integrated Decision Method On the Interaction Between Time and Frequency Filtering of Speech Parameters for Robust Speech Recognition Inference Of Missing Spectrographic Features For Robust Speech Recognition SNR-Dependent Flooring and Noise Overestimation for Joint Application of Spectral Subtraction and Model Combination Improved Robust Speech Recognition Considering Signal Correlation Approximated by Taylor Series Speech Recognition in Noisy Environment Using Weighted Projection-Based Likelihood Measure Evaluation of Model Adaptation by HMM Decomposition on Telephone Speech Recognition Comparative Experiments to Evaluate a Voiced-Unvoiced-Based Pre-Processing Approach to Robust Automatic Speech Recognition in Low-SNR Environments Signal Extraction From Noisy Signal Based on Auditory Scene Analysis Frequency Domain Binaural Model as the Front End of Speech Recognition System A Study on the Recognition of Low Bit-Rate Encoded Speech Weighted Parallel Model Combination for Noisy Speech Recognition Favourable and Unfavourable Short Duration Segments of Speech in Noise Segmentation, Labelling and Speech Corpora 2 An Efficient Labeling Tool for the QuickSig Speech Database Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies Resegmentation of SWITCHBOARD Automatic Generation of Visual Scenarios for Spoken Corpora Acquisition Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge A New Fast Algorithm for Automatic Segmentation of Continuous Speech Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech Korean Prosodic Break Index Labelling by a New Mixed Method of LDA and VQ MOOSE: Management Of Otago Speech Environment Phonetic Alignment: Speech Synthesis Based vs. Hybrid HMM/ANN Customisation And Quality Assessment Of Spoken Language Description A Silence/Noise/Music/Speech Splitting Algorithm Audio-Visual Segmentation for Content-Based Retrieval Same News is Good News: Automatically Collecting Reoccurring Radio News Stories An Annotation System for Melodic Aspects of German Spontaneous Speech Additional Use of Phoneme Duration Hypotheses in Automatic Speech Segmentation Towards a Minimal Standard for Dialogue Transcripts: a New SGML Architecture for the HCRC Map Task Corpus Speech Technology Applications and Human-Machine Interface 1 Steps Toward The Integration Of Speaker Recognition In Real-World Telecom Applications A Bimodal Korean Address Entry/Retrieval System Usability Evaluation of IVR Systems With DTMF and ASR SALSA Version 1.0: A Speech-Based Web Browser for Hong Kong English A Language for Creating Speech Applications The Use of Automatic Speech Recognition to Reduce the Interference Between Concurrent Tasks of Driving and Phoning Interactive Listening to Structured Speech Content on the Internet MSF Format For The Representation Of Speech Synchronized Moving Image Effects of Using Speech in Timetable Information Systems for WWW The Interactive Systems Labs View4You Video Indexing System SEMOLE: A Robust Framework For Gathering Information From The World Wide Web Enhancing a WIMP Based Interface With Speech, Gaze Tracking and Agents Now You Hear It, Now You Don't: Empirical Studies of Audio Browsing Behavior Behavior A Voice Verifier for Face/Voice Based Person Verification System On The Use Of Automatic Speech Recognition For TV Captioning An Undergraduate Course on Speech Recognition Based on the CSLU Toolkit Real Time Voice Alteration Based on Linear Prediction Evaluation and Implementation of a Voice-Activated Dialing System with Utterance Verification Towards a Mandarin Voice Memo System Large Vocabulary Continuous Speech Recognition 2 Grammatical Word Graph Re-Generation for Spontaneous Speech Recognition Compression Algorithm Of Trigram Language Models Based On Maximum Likelihood Estimation Morphological Modeling of Word Classes for Language Models A Comparative Study Between Polyclass and Multiclass Language Models Log-Linear Interpolation Of Language Models The Applicability of Adaptive Language Modelling for the Broadcast News Task Text-To-Speech Synthesis 3 The IBM Trainable Speech Synthesis System ProSynth: An Integrated Prosodic Approach to Device-Independent, Natural-Sounding Speech Synthesis Total Quality Evaluation of Speech Synthesis Systems Comparative Evaluation of Synthetic Prosody with the PURR Method SABLE: A Standard For TTS Markup Prosodic vs. Segmental Contributions to Naturalness in a Diphone Synthesizer Language Acquisition 1 Non-Native Productions Of Japanese Single Stops That Are Too Long For One Mora Unit The Process Of Generation And Development Of Second Language Japanese Accentuation Perceptual Properties of Russians with Japanese Fricatives Assessment of Dutch Pronunciation by Means of Automatic Speech Recognition Technology Phonetic-Level Mispronunciation Detection in Non-Native Swedish Speech Computer-Based Second Language Production Training By Using Spectrographic Representation And HMM-Based Speech Recognition Scores Acoustic Phonetics 1 Assimilation of Place in Japanese and Dutch Prosodic Constraint on V-to-V Coarticulation in Japanese Postvocalic /r/-deletion in Standard Dutch: How Experimental Phonology Can Profit From ASR Technology More Evidence For The Perceptual Basis Of Sound Change? Suprasegmental Effects In The Development Of Distinctive Nasalization Speech Production Of Vowel Sequences Using A Physiological Articulatory Model Speaker Adaptation 2 Eigenvoices for Speaker Adaptation Speaker Clustering Using Direct Maximisation of the MLLR-Adapted Likelihood Incremental On-Line Speaker Adaptation in Adverse Conditions Cluster Adaptive Training for Speech Recognition Speech Coding 2 Towards a Unified Model for Low Bit-Rate Speech Coding Using a Recognition-Synthesis Approach On the Significance of Temporal Masking in Speech Coding Waveform Interpolation Coding With Pitch-Spaced Subbands An Improved Decomposition Method For WI Using IIR Wavelet Filter Banks Hidden Markov Model Techniques 2 Real-Time Probabilistic Segmentation for Segment-Based Speech Recognition Toward Markov Random Field Modeling of Speech Hidden Markov Models for Trajectory Modeling Multilingual Perception and Recognition 1 Bilingual and Dialectal Adaptation and Retraining Language Independent and Language Adaptive Large Vocabulary Speech Recognition A Method for Measuring the Intelligibility and Nonnativeness of Phone Quality in Foreign Language Pronunciation Training Large Vocabulary Continuous Speech Recognition 3 The BBN Single-Phonetic-Tree Fast-Match Algorithm An Efficient Two-pass Search Algorithm Using Word Trellis Index Nozomi -- a Fast, Memory-Efficient Stack Decoder For LVCSR Reducing the OOV Rate in Broadcast News Speech Recognition Using Automatically-Derived Acoustic Sub-word Units in Large Vocabulary Speech Recognition Fabricating Conversational Speech Data with Acoustic Models: a Program to Examine Model-Data Mismatch Articulatory Modelling 3 An Electropalatographic, Kinematic, and Acoustic Analysis of Supralaryngeal Correlates of Word-Level Prominence Contrasts in English Consistencies and Inconsistencies Between EPG and Locus Equation Data on Coarticulation Synergy Between Jaw And Lips/Tongue Movements : Consequences In Articulatory Modelling Modelling Tongue Configuration in German Vowel Production Optopalatograph: Real-time Feedback of Tongue Movement in 3D Effects of Contrastive Focal Accent on Linguopalatal Articulation and Coarticulation in the French [kskl] Cluster Language Acquisition 2 Spoken Word Identification by Native and Nonnative Speakers of English: Effects of Training, Modality, Context and Phonetic Environment The Effect Of Background Knowledge On First And Second Language Comprehension Difficulty Comparison of Cross-language Coarticulation: English, Japanese and Japanese-accented English Plasticity Of Non-Native Phonetic Perception And Production: A Training Study The Relation Between Perceptual and Production Categories in Acquisition The Development of Perceptual Cue-Weighting in Children Aged 6 to 12 Speaker and Language Recognition 3 Robust Speaker Verification Insensitive to Session-dependent Utterance Variation and Handset-dependent Distortion A Comparative Evaluation of Variance Flooring Techniques in HMM-based Speaker Verification Text-Independent Speaker Verification Using Automatically Labelled Acoustic Segments A Fast Decoding Algorithm Based on Sequential Detection of the Changes in Distribution Speaker Verification With Ensemble Classifiers Based On Linear Speech Transforms Speaker Recognition Based On Discriminative Projection Models Text-To-Speech Synthesis 4 A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech Analytic Generation of Synthesis Units by Closed Loop Training for Totally Speaker Driven Text to Speech System (TOS Drive TTS) Modeling the Microprosody of Pitch and Loudness for Speech Synthesis with Neural Networks Spectral Smoothing for Concatenative Speech Synthesis MIMIC : A Voice-Adaptive Phonetic-Tree Speech Synthesiser Automatic Generation Of Korean Pronunciation Variants By Multistage Applications Of Phonological Rules Techniques for Accurate Automatic Annotation of Speech Waveforms Optimized Stopping Criteria for Tree-Based Unit Selection in Concatenative Synthesis Automatic Transcription of Intonation Using an Identified Prosodic Alphabet Frequency Analysis of Phonetic Units for Concatenative Synthesis in Catalan Investigating the Syntactic Characteristics of English Tone Units The UPC Text-to-Speech System for Spanish and Catalan The New Version of the ROMVOX Text-to-Speech Synthesis System Based on a Hybrid Time Domain-LPC Synthesis Technique An F0 Contour Control Model for Totally Speaker Driven Text to Speech System On the Relationship of Speech Rates with Prosodic Units in Dialogue Speech On the Reduction of Concatenation Artefacts in Diphone Synthesis Error Analysis and Confidence Measure of Chinese Word Segmentation Energy Contour Generation for a Sentence Using a Neural Network Learning Method A Computational Algorithm For F0 Contour Generation In Korean Developed With Prosodically Labeled Databases Using K-ToBI System Rapid-Deployment Text-to-Speech in the DIPLOMAT System Formant Diphone Parameter Extraction Utilising a Labelled Single-Speaker Database A New Synthetic Speech/Sound Control Language A Study on the Natural-Sounding Japanese Phonetic Word Synthesis by Using the VCV-Balanced Word Database That Consists of the Words Uttered Forcibly in Two Types of Pitch Accent Letter to Sound Rules for Accented Lexicon Compression A Name Announcement Algorithm with Memory Size and Computational Power Constraints How a French TTS System can Describe Loanwords Improvements in Slovene Text-to-Speech Synthesis Automatic Rule Generation for Linguistic Features Analysis Using Inductive Learning Technique: Linguistic Features Analysis in TOS Drive TTS System Segmental Duration Control Based on an Articulatory Model Text Analysis for the Bell Labs French Text-to-Speech System Modeling Vowel Duration for Japanese Text-to-Speech Synthesis Towards A Chinese Text-To-Speech System With Higher Naturalness Spoken Language Understanding Systems 4 Grammar Fragment Acquisition using Syntactic and Semantic Clustering Non-Expert Access to Unification Based Speech Understanding Natural Language Call Routing: A Robust, Self-Organizing Approach Automatic Grammar Induction from Semantic Parsing BTH: An Efficient Parsing Algorithm for Word-Spotting Syntax Coordination: Interaction of Discourse and Extrapositions Hierarchical Tag-Graph Search for Spontaneous Speech Understanding in Spoken Dialog Systems Extraction of the Dialog Act and the Topic From Utterances in a Spoken Dialog System Fast Computation of Maximum Entropy / Minimum Divergence Feature Gain Stochastic Language Models for Speech Recognition and Understanding Linguistically Engineered Tools for Speech Recognition Error Analysis Estimating Entropy of a Language from Optimal Word Insertion Penalty A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues A Hierarchical Language Model for CSR Spoken Language Understanding Within Dialogs Using a Graphical Model of Task Structure Keyword Extraction of Radio News using Domain Identification based on Categories of an Encyclopedia Human Speech Perception 2 Fundamental Frequency Fluctuation in Continuous Vowel Utterance and its Perception Estimation of Mental Lexicon Size with Word Familiarity Database Vowel Quality in Spontaneous Speech: What Makes a Good Vowel? Cooperation and Competition of Burst and Formant Transitions for the Perception and Identification of French Stops The Effect of Modifying Formant Amplitudes on the Perception of French Vowels Generated by Copy Synthesis Segmental and Tonal Processing in Cantonese Phonological Similarity Effects in Cantonese Spoken-Word Processing On The Learnability Of The Voicing Contrast For Initial Stops Acoustic and Perceptual Characteristic of Italian Stop Consonants Acoustic Cues for the Auditory Identification of the Spanish Fricative /f/ Recognition of Vowels in Fricative Context. Voicing Affects Perceived Manner of Articulation. Enhancement Techniques to Improve the Intelligibility of Consonants in Noise : Speaker and Listener Effects Boundaries of Perception of Long Tones in Taiwanese Speech Effects of Phonetic Quality and Duration on Perceptual Acceptability of Temporal Changes in Speech Dynamic vs. Static Spectral Detail in the Perception of Gated Stops Phonological Units In Speech Segmentation And Phonological Awareness How Far Do Speakers Back Up in Repairs? A Quantitatve Model Don't Blame It (All) On The Pause: Further ERP Evidence For A Prosody-Induced Garden-Path In Running Speech The Role of Stress for Lexical Selection in Dutch The Perception of Stressed Syllables in Finnish The Perception Of The Morae With Devocalized Vowels In Japanese Language. Large Vocabulary Continuous Speech Recognition 4 High Resolution Decision Tree based Acoustic Modeling beyond CART Unsupervised Training of a Speech Recognizer Using TV Broadcasts A New Method to Achieve Fast Acoustic Matching for Speech Recognition Improved Parameter Tying for Efficient Acoustic Model Evaluation in Large Vocabulary Continuous Speech Recognition A New Look at HMM Parameter Tying for Large Vocabulary Speech Recognition Factor Analysis Invariant to Linear Transformations of Data Spoken Language Understanding Systems 2 Automatic Ambiguity Detection Empowering Knowledge Based Speech Understanding through Statistics Concept-Driven Speech Understanding Incorporated with a Statistic Language Model On The Limitations of Stochastic Conceptual Finite-State Language Models For Speech Understanding Towards Speech Understanding Across Multiple Languages Automatic Detection of Sentence Boundaries and Disfluencies Based on Recognized Words Signal Processing and Speech Analysis 3 Determination of Articulatory Positions from Speech Acoustics by Applying Dynamic Articulatory Constraints Recognizing Emotions in Speech Using Short-term and Long-term Features PeriphEar : A Nonlinear Active Model of the Auditory Periphery The Voicing Feature for Stop Consonants: Acoustic Phonetic Analyses and Automatic Speech Recognition Experiments Wavelet-Based Energy Binning Cepstral Features for Automatic Speech Recognition Articulatory Analysis using a Codebook for Articulatory based Low Bit-Rate Speech Coding Human Speech Perception 3 Categorical Perception: Important Phenomenon or Lasting Myth? Categorical Perception of Vowels Suprasegmental Cues for the Segmentation of Identical Vowel Sequences in Japanese Perception Of Concurrent Approximant-Vowel Syllables Perceived Swedish Vowel Quantity: Effects of Postvocalic Consonant Duration Speaker Adaptation 3 On-line Hierarchical Transformation of Hidden Markov Models for Speaker Adaptation High-Speed Speaker Adaptation Using Phoneme Dependent Tree-Structured Speaker Clustering The Use of Confidence Measures in Unsupervised Adaptation of Speech Recognizers Speaker Normalization with All-Pass Transforms Toward On-Line Learning of Chinese Continuous Speech Recognition System The CHAM Model of Hyperarticulate Adaptation During Human-Computer Error Resolution Spoken Language Understanding Systems 3 Language Modeling for Content Extraction in Human-Computer Dialogues A Language Model Combining Trigrams and Stochastic Context-Free Grammars Online Adaptation of Language Models in Spoken Dialogue Systems Language Model Adaptation for Spoken Language Systems Detecting Topic Shifts Using a Cache Memory A Discourse Coding Scheme for Conversational Spanish Multimodal Spoken Language Processing 3 Referential Features and Linguistic Indirection in Multimodal Language Multimodal Language Processing Implementation of Coordinative Nodding Behavior on Spoken Dialogue Systems Use of Non-Verbal Information in Communication Between Human and Robot What You See is (Almost) What You Hear: Design Principles For User Interfaces For Accessing Speech Archives Acoustic Phonetics 2 Regional Variation in the Vowels of Female Adolescents from Sydney A Kinematic Analysis Of New Zealand And Australian English Vowel Spaces Syllable-Onset Acoustic Properties Associated with Syllable-Coda Voicing Articulatory, Acoustic and Perceptual Aspects of Fricative-Stop Coarticulation Efficiency As An Organizing Principle Of Natural Speech Within-Speaker Variability Due to Speaking Manners Large Vocabulary Continuous Speech Recognition 5 A Thesaurus-Based Statistical Language Model for Broadcast News Transcription Effect of Task Complexity on Search Strategies for the Motorola Lexicus Continuous Speech Recognition System New Features For Confidence Annotation Multi-Span Statistical Language Modeling for Large Vocabulary Speech Recognition Maximum-Likelihood Updates Of HMM Duration Parameters For Discriminative Continuous Speech Recognition Towards Better Integration of Semantic Predictors in Statistical Language Modeling An Asymmetric Stochastic Language Model Based on Multi-Tagged Words Product-Code Vector Quantization of Cepstral Parameters for Speech Recognition Over the WWW Context Dependent Tree Based Transforms For Phonetic Speech Recognition Interfacing Acoustic Models with Natural Language Processing Systems Hierarchical Cluster Language Modeling With Statistical Rule Extraction For Rescoring N-Best Hypotheses During Speech Decoding Dealing With Out-of-Vocabulary Words and Speech Disfluencies in an N-Gram Based Speech Understanding System Source-Extended Language Model for Large Vocabulary Continuous Speech Recognition Time Dependent Language Model For Broadcast News Transcription And Its Post-Correction Exploiting Transitions and Focussing on Linguistic Properties for ASR A Unified Framework for Sublexical and Linguistic Modelling Supporting Flexible Vocabulary Speech Understanding A Method for Modeling Liaison in a Speech Recognition System for French On Variable Sampling Frequencies in Speech Recognition Pronunciation Modeling for Large Vocabulary Conversational Speech Recognition Time Shift Invariant Speech Recognition The Demiphone Versus the Triphone in a Decision-tree State-Tying Framework Word Clustering for A Word Bi-gram Model A Large Vocabulary Continuous Speech Recognition Hybrid System for the Portuguese Language Speech Recognition Performance on a new Voicemail Transcription Task Grammatical and Statistical Word Prediction System for Spanish Integrated in an Aid for People with Disabilities Segmentation Using a Maximum Entropy Approach Recognition Performance of a Large-Scale Dependency Grammar Language Model A Bootstrap Technique for Building Domain-Dependent Language Models Estimation of the Probability Distributions of Stochastic Context-Free Grammars From the k-Best Derivations Robust HMM Estimation with Gaussian Merging-Splitting and Tied-Transform HMMs Nonlinear Interpolation of Topic Models for Language Model Adaptation Performance Evaluation of Word Phrase and Noun Category Language Models For Broadcast News Speech Recognition Robust Automatic Continuous-Speech Recognition Based on a Voiced-Unvoiced Decision Double Tree Beam Search Using Hierarchical Subword Units Text Segmentation and Topic Tracking on Broadcast News Via a Hidden Markov Model Approach Multi-Phone Strings as Subword Units for Speech Recognition Phonetic Modification of the Syllable /tu/ in Two Spontaneous American English Dialogues Efficient Lattice Representation and Generation Modeling Pronunciation Variation for a Dutch CSR: Testing Three Methods Comparison of Language Modelling Techniques for Russian and English Optimized POS-Based Language Models for Large Vocabulary Speech Recognition Reducing Peak Search Effort Using Two-Tier Pruning Using Untranscribed Training Data to Improve Performance Telephone Band LVCSR for Hearing-Impaired Users Using X-Gram For Efficient Speech Recognition Speech Coding 3 A New Linear Predictive Method for Compression of Speech Signals Hierarchical Temporal Decomposition: A Novel Approach To Efficient Compression Of Spectral Characteristics Of Speech Speech Intelligibility Testing for New Technologies Efficient Quantization Of LSF Parameters Based on Temporal Decomposition A Sinusoidal Harmonic Vocoder at 1.2 kbps Using Auditory Perceptual Characteristics A 16 Kbit/s Wideband CELP Coder Using MEL-Generalized Cepstral Analysis and its Subjective Evaluation Comparison Of Spectral Estimation Techniques For Low Bit-Rate Speech Coding Low Bit Rate Coding for Speech and Audio Using Mel Linear Predictive Coding (MLPC) Analysis Comparison Study on VQ Codevector Index Assignment Using Linguistic Knowledge To Improve The Design Of Low-Bit Rate LSF Quantisation Transform Coding of LSF Parameters Using Wavelets Source Controlled Variable Bit-Rate Speech Coder Based On Waveform Interpolation Improving Speaker Recognisability In Phonetic Vocoders Language Acquisition 3 / Multilingual Perception and Recognition 2 Speech Perception and Spoken Language in Children with Impaired Hearing Quantitative Assessment of Second Language Learners' Fluency: an Automatic Approach Cross-Language Merged Speech Units And Their Descriptive Phonetic Correlates Crosslinguistic Disfluency Modelling: A Comparative Analysis of Swedish and American English Human--Human and Human--Machine Dialogues Calibration Of Machine Scores For Pronunciation Grading Phonetic-Distance-Based Hypothesis Driven Lexical Adaptation For Transcribing Multlingual Broadcast News Automatic Pronunciation Error Detection and Guidance for Foreign Language Learning Lexical Access for Large-Vocabulary Speech Recognition The Effect of Fundamental Frequency on Mandarin Speech Recognition The Perception Of Nativeness: Variable Speakers And Flexible Listeners Voice Dictation in the Secondary School Classroom The Importance of the First Syllable in English Spoken Word Recognition by Adult Japanese Speakers Spoken L2 Teaching with Contrastive Visual and Auditory Feedback The Role Of Phonological, Morphological, And Orthographic Knowledge In The Intuitive Syllabification Of Dutch Words: A Longitudinal Approach The Acquisition of Japanese Compound Accent Rule The Acquisition of Putonghua Phonology Enhancing Speech Processing of Japanese Learners of English Utilizing Time-Scale Expansion With Constant Pitch A Bootstrap Training Approach for Language Model Classifiers Voice Onset Time Patterns in 7-, 9- and 11-Year Old Children Some Developmental Patterns in the Speech of 6-, 8- and 10-Year Old Children: an Acoustic Phonetic Study Language Development After Extreme Childhood Deprivation: A Case Study Phonological Elements As A Basis For Language-Independent ASR A Phonetic and Acoustic Study of Babbling in an Italian Child Rescoring Multiple Pronunciations Generated from Spelled Words Segmentation, Labelling and Speech Corpora 3 A Recursive Algorithm for the Forced Alignment of Very Long Audio Segments The Selection of Pronunciation Variants: Comparing the Performance of Man and Machine Acoustic Confidence Measures for Segmenting Broadcast News A Duration-Based Confidence Measure for Automatic Segmentation of Noise Corrupted Speech Segmentation and Classification of Broadcast News Audio Speaker Recruitment Methods And Speaker Coverage - Experiences From A Large Multilingual Speech Database Collection Text-To-Speech Synthesis 5 A Phonologically Motivated Method of Selecting Non-Uniform Units A Synthesis Method Based on Concatenation of Demisyllables and a Residual Excited Vocal Tract Model Exploration of Acoustic Correlates in Speaker Selection for Concatenative Synthesis A Perceptual Evaluation of Distance Measures for Concatenative Speech Synthesis HMM-Based Smoothing For Concatenative Speech Synthesis A Nonlinear Unit Selection Strategy for Concatenative Speech Synthesis Based on Syllable Level Features Spoken Language Generation and Translation 2 A Generic Algorithm for Generating Spoken Monologues On the Use of Automatically Generated Discourse-Level Information in a Concept-to-Speech Synthesis System Learning Phrase-Based Head Transduction Models for Translation of Spoken Utterances Probabilistic Dialogue Act Extraction for Concept Based Multilingual Translation Systems Fast Decoding For Statistical Machine Translation A Japanese-to-English Speech Translation System: ATR-MATRIX Human Speech Perception 4 Orthografik Inkoncistensy Ephekts in Foneme Detektion? The Effect of Orthographic Knowledge on the Segmentation of Speech Spotting (Different Types of) Words in (Different Types of) Context Correlation Between Consonantal VC Transitions And Degree Of Perceptual Confusion Of Place Contrast In Hindi Perception Of Tonal Rises And Falls For Accentuation And Phrasing In Swedish Speech Intelligibility Derived From Exceedingly Sparse Spectral Information Robust Speech Processing in Adverse Environments 5 Auditory Modeling Techniques For Robust Pitch Extraction And Noise Reduction Wavelet Transform-based Speech Enhancement A Practical Perceptual Frequency Autoregressive HMM Enhancement System An Effective Quality Evaluation Protocol For Speech Enhancement Algorithms An Adaptive Beamforming Microphone Array System Using A Blind Deconvolution Speech Enhancement Using Critical Band Spectral Subtraction Text-To-Speech Synthesis 6 How To Handle "Foreign" Sounds in Swedish Text-to-Speech Conversion: Approaching the 'Xenophone' Problem Multi-lingual Concatenative Speech Synthesis On The Use Of F0 Features In Automatic Segmentation For Speech Synthesis A Linguistic and Prosodic Database for Data-Driven Japanese TTS Synthesis Text-to-Speech Voice Adaptation from Sparse Training Data Describing Intonation with a Parametric Model Speech Technology Applications and Human-Machine Interface 2 Development of CAI System Employing Synthesized Speech Responses Using Combined Decisions and Confidence Measures for Name Recognition in Automatic Directory Assistance Systems VPQ: A Spoken Language Interface to Large Scale Directory Information SCAN - Speech Content Based Audio Navigator: A System Overview Controlling a HIFI With a Continuous Speech Understanding System User Evaluation Of The Mask Kiosk Prosody and Emotion 6 A Contrastive Study of Lexical Stress Placement in Singapore English and British English Integrated Recognition of Words and Phrase Boundaries Phrase Accents Revisited: Comparative Evidence From Standard and Cypriot Greek Phonetic Invariance and Phonological Stability: Lithuanian Pitch Accents A HMM-Based Recognition System for Perceptive Relevant Pitch Movements of Spontaneous German Speech Towards a Reversible Symbolic Coding of Intonation Hidden Markov Model Techniques 3 A Statistical Phonemic Segment Model for Speech Recognition Based on Automatic Phonemic Segmentation Improved Feature Decorrelation for HMM-based Speech Recognition Efficient High-Order Hidden Markov Modelling A Time-Synchronous, Tree-based Search Strategy in the Acoustic Fast Match of an Asynchronous Speech Recognition System Effective Structural Adaptation of LVCSR Systems to Unseen Domains Using Hierarchical Connectionist Acoustic Models Support Vector Machines for Speech Recognition Natural Number Recognition Using Discriminatively Trained Inter-Word Context Dependent Hidden Markov Models Information Theoretic Approaches to Model Selection Continuous Speech Recognition Using Segmental Unit Input HMMs with a Mixture of Probability Density Functions and Context Dependency Gaussian Density Tree Structure in a Multi-Gaussian HMM-Based Speech Recognition System Generalized Phone Modeling Based on Piecewise Linear Segment Lattice A Flexible Method of Creating HMM Using Block-Diagonalization of Covariance Matrices HMM Topology Selection For Accurate Acoustic And Duration Modeling Context-Dependent Duration Modelling for Continuous Speech Recognition Training of Context-Dependent Subspace Distribution Clustering Hidden Markov Model Unsupervised Training of HMMs With Variable Number of Mixture Components Per State Acoustic Observation Context Modeling in Segment Based Speech Recognition Capturing Discriminative Information Using Multiple Modeling Techniques Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition An Adaptive Gradient-Search Based Algorithm for Discriminative Training of HMM's Task Adaptation of Sub-Lexical Unit Models Using the Minimum Confusibility Criterion on Task Independent Databases Stochastic Calculus, Non-Linear Filtering, and the Internal Model Principle: Implications for Articulatory Speech Recognition The Use of Meta-HMM in Multistream HMM Training for Automatic Speech Recognition Enhanced ASR By Acoustic Feature Filtering Soft State-Tying for HMM-based Speech Recognition Estimation Of Models For Non-Native Speech In Computer-Assisted Language Learning Based On Linear Model Combination Duration Modeling Using Cumulative Duration Probability and Speaking Rate Compensation Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1 SIVHA, Visual Speech Synthesis System Using Automatic Speech Recognition and its Possible Effects on the Voice The Importance Of F0 or Voice Pitch for Perception of Tonal Language: Simulations With Cochlear Implant Speech Processing Strategies Assessing High-Level Language In Individuals With Multiple Sclerosis: A Pilot Study Design Of Cochlear Implant Device For Transmitting Voice Pitch Information In Speech Sound Of Asian Languages Abnormal Volume-Duration Relationship in Parkinsonian Speech Analysis Of Disordered Speech Signal Using Wavelet Transform Multi-Channel Pulsation Strategy For Electric Stimulation Of Cochlea Synthetic Faces as a Lipreading Support Predicting Language Scores From The Speech Perception Scores Of Hearing-Impaired Children Content-Independent Duration Model on Categories of Voice and Unvoice Segments Dynamical Spectrogram, an Aid for the Deaf Evidence of Dual-Route Phonetic Encoding From Apraxia of Speech: Implications for Phonetic Encoding Models Speech Communication Profiles Across The Adult Lifespan: Persons Without Self-Identified Hearing Impairment Human Speech Production Time as a Factor in the Acoustic Variation of Schwa On The Structure Of Vowel Space: A Genealogy Of General Phonetic Concepts The Relationship Between Intensity and Subglottal Pressure with Controlled Pitch Segmentation Of The Airway From The Surrounding Tissues On Magnetic Resonance Images: A Comparative Study Recovering Vocal Tract Shapes from MFCC Parameters Quantification of Pharyngeal Articulations using Measurements from Laryngoscopic Images Variance and Invariance in Speech Rate as a Reflection of Conceptual Planning Correspondence Between the Glottal Gesture Overlap Pattern and Vowel Devoicing in Japanese Evaluation of Japanese Manners of Generating Word Accent of English Based on a Stressed Syllable Detection Technique Independence Of Consonantal Voicing And Vocoid F0 Perturbation In English And Japanese Reduction of English Function Words in Switchboard Duration Compensation in Non-Adjacent Consonant and Temporal Regularity Relationship Between Lip Shapes And Acoustical Characteristics During Speech A Model to Represent Propagation and Radiation of Higher-Order Modes for 3-D Vocal-Tract Configuration FEM Analysis of Aspirated Air Flow in Three-Dimensional Vocal Tract During Fricative Consonant Phonation Trajectory Formation of Articulatory Movements for a Given Sequence of Phonemes Contextual Effects on Voicing Profiles of German and Mandarin Consonants Reconstructing the Tongue Surface from Six Cross-Sectional Contours: Ultrasound Data Articulability of Two Consecutive Morae in Japanese Speech Production: Evidence from Sound Exchange Errors in Spontaneous Speech Coarticulation and Degrees of Freedom in the Elaboration of a New Articulatory Plant: GENTIANE A Pressure Sensitive Palatography: Application of New Pressure Sensitive Sheet for Measuring Tongue-Palatal Contact Pressure Dual-Route Phonetic Encoding: Some Acoustic Evidence Fast and Slow Speech Rate: A Characterisation for French Segmentation, Labelling and Speech Corpora 4 A Multilingual Prosodic Database The CSLU Speaker Recognition Corpus How Effective Is Unsupervised Data Collection For Children's Speech Recognition? An Algorithm for Automatic Generation of Mandarin Phonetic Balanced Corpus Towards a Formal Framework for Linguistic Annotations Forming Generic Models Of Speech For Uniform Database Access Speaker and Language Recognition 4 On The Convergence Of Gaussian Mixture Models: Improvements Through Vector Quantization Modeling Dynamic Prosodic Variation for Speaker Verification Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics Spoken Language Identification Using The SpeechDat Corpus Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks On the Importance of Components of the Modulation Spectrum for Speaker Verification Speech Technology Applications and Human-Machine Interface 3 Is Speech The Right Thing For Your Application? A PC-Based Tool for Helping in Diagnosis of Pathologic Voice Web-Based Educational Tools for Speech Technology Universal Speech Tools: The CSLU Toolkit Creating a Mexican Spanish Version of the CSLU Toolkit A Voice User Interface Demonstration System for Mexican Spanish Utterance Verification and Word Spotting 2 Context Dependent Anti Subword Modeling for Utterance Verification Combination of Confidence Measures in Isolated Word Recognition Confidence Measures for HMM-based Speech Recognition Vocabulary-Independent Word Confidence Measure Using Subword Features A New Confidence Measure Based on Rank-Ordering Subphone Scores Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase Detection and Verification Large Vocabulary Continuous Speech Recognition 6 Sharable Software Repository for Japanese Large Vocabulary Continuous Speech Recognition The Design of the Newspaper-Based Japanese Large Vocabulary Continuous Speech Recognition Corpus Indexing and Classification of TV News Articles Based on Speech Dictation Using Word Bigram Parametric Trajectory Mixtures for LVCSR Neural Networks, Fuzzy and Evolutionary Methods 3 Fuzzy-Integration Based Normalization for Speaker Verification Improving The Generalization Performance Of The MCE/GPD Learning Acoustic Speech Recognition Model by Neural Net Equation with Competition and Cooperation Improved Surname Pronunciations Using Decision Trees Speech Processing for the Speech-Impaired and Hearing-Impaired 2 A Speechreading Aid Based on Phonetic ASR Training Speech through Visual Feedback Patterns Word Sequence Pair Spotting for Synchronization of Speech and Text in Production of Closed-Caption TV Programs for the Hearing Impaired Volume Regulation in Parkinsonian Speech Prosody and Emotion 7 On the Amount and Domain of Focal Lengthening in Swedish Differential Lengthening Of Syllabic Constituents In French: The Effect Of Accent Type And Speaking Style Prosodic Analysis of Fillers and Self-Repair in Japanese Speech A Synthesis-Oriented Model Of Phrasal Pitch Movements In Standard Chinese SST Student Day - Poster Session 1 Non-Linear Probability Estimation Method Used in HMM for Modeling Frame Correlation Patterns Of Linguopalatal Contact During Japanese Vowel Devoicing Speech Separation Based on the GMM PDF Estimation Growth Transform of A Sum of Rational Functions and Its Application in Estimating HMM Parameters Two Automatic Approaches for Analyzing Connected Speech Processes in Dutch The Use Of Broad Phonetic Class Models In Speaker Recognition Analysis and Treatment of Esophageal Speech for the Enhancement of its Comprehension High Quality Text-to-Speech System in Spanish for Handicapped People Factors Affecting Speech Retrieval Perception Of Words With Vowel Reduction SST Student Day - Poster Session 2 Automated Captioning of Television Programs: Development and Analysis of a Soundtrack Corpus On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System Speech Recognition Using the Probabilistic Neural Network A Language Modeling Based on a Hierarchical Approach: M_n^v Temporal Variables in Lectures in the Japanese Language Building a Statistical Model of the Vowel Space for Phoneticians Computer-Mediated Input And The Acquisition Of L2 Vowels Speech Analysis By Subspace Methods Of Spectral Line Estimation Pausing in Swedish Spontaneous Speech Prosody And Voice Quality In The Expression Of Emotions Acoustic Analysis of /l/ in Glossectomees