Home
Full List of Titles
1: ICSLP'98 Proceedings
Keynote Speeches
Text-To-Speech Synthesis 1
Spoken Language Models and Dialog 1
Prosody and Emotion 1
Hidden Markov Model Techniques 1
Speaker and Language Recognition 1
Multimodal Spoken Language Processing 1
Isolated Word Recognition
Robust Speech Processing in Adverse Environments 1
Spoken Language Models and Dialog 2
Articulatory Modelling 1
Talking to Infants, Pets and Lovers
Robust Speech Processing in Adverse Environments 2
Spoken Language Models and Dialog 3
Speech Coding 1
Articulatory Modelling 2
Prosody and Emotion 2
Neural Networks, Fuzzy and Evolutionary Methods 1
Utterance Verification and Word Spotting 1 / Speaker Adaptation 1
Text-To-Speech Synthesis 2
Spoken Language Models and Dialog 4
Human Speech Perception 1
Robust Speech Processing in Adverse Environments 3
Speech and Hearing Disorders 1
Prosody and Emotion 3
Spoken Language Understanding Systems 1
Signal Processing and Speech Analysis 1
Spoken Language Generation and Translation 1
Spoken Language Models and Dialog 5
Segmentation, Labelling and Speech Corpora 1
Multimodal Spoken Language Processing 2
Prosody and Emotion 4
Neural Networks, Fuzzy and Evolutionary Methods 2
Large Vocabulary Continuous Speech Recognition 1
Speaker and Language Recognition 2
Signal Processing and Speech Analysis 2
Prosody and Emotion 5
Robust Speech Processing in Adverse Environments 4
Segmentation, Labelling and Speech Corpora 2
Speech Technology Applications and Human-Machine Interface 1
Large Vocabulary Continuous Speech Recognition 2
Text-To-Speech Synthesis 3
Language Acquisition 1
Acoustic Phonetics 1
Speaker Adaptation 2
Speech Coding 2
Hidden Markov Model Techniques 2
Multilingual Perception and Recognition 1
Large Vocabulary Continuous Speech Recognition 3
Articulatory Modelling 3
Language Acquisition 2
Speaker and Language Recognition 3
Text-To-Speech Synthesis 4
Spoken Language Understanding Systems 4
Human Speech Perception 2
Large Vocabulary Continuous Speech Recognition 4
Spoken Language Understanding Systems 2
Signal Processing and Speech Analysis 3
Human Speech Perception 3
Speaker Adaptation 3
Spoken Language Understanding Systems 3
Multimodal Spoken Language Processing 3
Acoustic Phonetics 2
Large Vocabulary Continuous Speech Recognition 5
Speech Coding 3
Language Acquisition 3 / Multilingual Perception and Recognition 2
Segmentation, Labelling and Speech Corpora 3
Text-To-Speech Synthesis 5
Spoken Language Generation and Translation 2
Human Speech Perception 4
Robust Speech Processing in Adverse Environments 5
Text-To-Speech Synthesis 6
Speech Technology Applications and Human-Machine Interface 2
Prosody and Emotion 6
Hidden Markov Model Techniques 3
Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1
Human Speech Production
Segmentation, Labelling and Speech Corpora 4
Speaker and Language Recognition 4
Speech Technology Applications and Human-Machine Interface 3
Utterance Verification and Word Spotting 2
Large Vocabulary Continuous Speech Recognition 6
Neural Networks, Fuzzy and Evolutionary Methods 3
Speech Processing for the Speech-Impaired and Hearing-Impaired 2
Prosody and Emotion 7
2: SST Student Day
SST Student Day - Poster Session 1
SST Student Day - Poster Session 2
Author Index
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
Multimedia Files
|
- Keynote Speeches
- Cochlear Implants In The Second And Third Millennia
- The Use of Linguistic Hierarchies in Speech Understanding
- Text-To-Speech Synthesis 1
- Unsupervised Training of Phone Duration and Energy Models for Text-to-Speech Synthesis
- Improved Duration Modeling of English Phonemes Using a Root Sinusoidal Transformation
- Efficient Adaptation of TTS Duration Model to New Speakers
- Duration Modeling For HMM-Based Speech Synthesis
- Spoken Language Models and Dialog 1
- An Educational Dialogue System with a User Controllable Dialogue Manager
- End-User Driven Dialogue System Design: The Reward Experience
- The Design of a Multi-Domain Mandarin Chinese Spoken Dialogue System
- An Integrated Dialogue System for the Automation of Call Centre Services
- Prosody and Emotion 1
- Tones of a Tridialectal: Acoustic and Perceptual Data on Ten Linguistic Tonetic Contrasts Between Lao, Nyo and Standard Thai
- Tone Sandhi Between Complex Tones in a Seven-Tone Southern Thai Dialect
- The Acoustic And Perceptual Features Of Tone In The Tibeto-Burman Language Ao Naga
- The Differential Status of Semivowels in the Acoustic Phonetic Realisation of Tone
- Hidden Markov Model Techniques 1
- Nonreciprocal Data Sharing in Estimating HMM Parameters
- Data-Driven Extensions to HMM Statistical Dependencies
- Use of High-Level Linguistic Constraints for Constructing Feature-Based Phonological Model in Speech Recognition
- Speaker and Language Recognition 1
- Sub-Band Based Speaker Verification Using Dynamic Recombination Weights
- Measuring the Dynamic Encoding of Speaker Identity and Dialect in Prosodic Parameters
- German Regional Variants - A Problem for Automatic Speech Recognition?
- Improving Accent Identification Through Knowledge Of English Syllable Structure
- Multi-Dimensional Scaling of Listener Responses to Complex Auditory Stimuli
- Same Talker, Different Language
- The Impact of Regional Variety Upon Specific Word Categories in Spontaneous German
- Speech Pre-Processing Against Intentional Imposture In Speaker Recognition
- A Comparison of Two Unsupervised Approaches to Accent Identification
- The Influence of Accents in Australian English Vowels and their Relation to Articulatory Tract Parameters
- Automatic Language Recognition Using High-Order HMMs
- Speaker Recognition Using Residual Signal Of Linear and Nonlinear Prediction Models
- An Implementation and Evaluation of an On-Line Speaker Verification System for Field Trials
- Speaker Verification on the Polycost Database Using Frequency Filtered Spectral Energies
- A High-Performance Text-Independent Speaker Identification System Based on BCDM
- Representation Of Voice Quality Features Associated With Talker Individuality
- Candidate Selection Based on Significance Testing and its Use in Normalisation and Scoring
- Japanese Forensic Phonetics: Non-Contemporaneous Within-Speaker Variation In Natural And Read-Out Speech
- Statistical Modeling of Pronunciation and Production Variations for Speech Recognition
- Dialect Maps and Dialect Research; Useful Tools for Automatic Speech Recognition?
- Text Independent Speaker Recognition Using Micro-Prosody
- Speaker Verification Using Fundamental Frequency
- On Optimum Normalization Method Used for Speaker Verification
- Recurrent Substrings and Data Fusion for Language Recognition
- Text-Independent Speaker Recognition Using Multiple Information Sources
- Discriminative Training Of GMM Using a Modified EM Algorithm for Speaker Recognition
- Language Identification Incorporating Lexical Information
- A VQ Based Speaker Recognition System Based in Histogram Distances. Text Independent and for Noisy Environments
- Spanish Dialects: Phonetic Transcription
- Acoustic Analysis of Japanese English Prosody: Comparison Between Fukushima Dialect Speakers and Tokyo Dialect Speakers in Declarative Sentences and Yes-No Questions
- A Context-Dependent Approach for Speaker Verification Using Sequential Decision
- Quantitative Influence of Speech Variability Factors for Automatic Speaker Verification in Forensic Tasks
- Creating Hidden Markov Models for Fast Speech
- Speaker Identification using Relaxation Labeling
- A Novel Technique for the Combination of Utterance and Speaker Verification Systems in a Text-Dependent Speaker Verification Task
- A Forensic Phonetic Investigation into Non-contemporaneous Variation in the F-pattern of Similar-sounding Speakers.
- Human vs. Machine Speaker Identification with Telephone Speech
- A Comparison of Fusion Techniques in Mel-cepstral Based Speaker Identification
- On the Influence of Hyperarticulated Speech on Recognition Performance
- Text-Independent Speaker Identification and Verification Using the TIMIT Database
- Incorporating Linguistic Knowledge Into Automatic Dialect Identification of Spanish
- A Novel Text-Independent Speaker Verification Method Using the Global Speaker Model
- Multimodal Spoken Language Processing 1
- A Fast Method of Producing Talking Head Mouth Shapes from Real Speech
- The Efficiency of Multimodal Interaction: a Case Study
- Audio and Audio-visual Perception of Consonants Disturbed by White Noise and 'Cocktail Party'
- Overview of the Maya Spoken Language System
- Automatic Recognition of Spontaneous Speech Dialogues
- Using an Animated Talking Character in a Web-Based City Guide Demonstrator
- Influence of Facial Views on the McGurk Effect in Auditory Noise
- The Intellimedia Workbench - a Generic Environment for Multimodal Systems
- STAMP: A Suite Of Tools For Analyzing Multimodal System Processing
- Cultural Similarities and Differences in the Recognition of Audio-Visual Speech Stimuli
- A Multimodal-Input Multimedia-Output Guidance System: MMGS
- HMM-based Visual Speech Recognition Using Intensity and Location Normalization
- A Hierarchy Probability-Based Visual Features Extraction Method for Speechreading
- Integration Of Talking Heads And Text-To-Speech Synthesizers For Visual TTS
- Isolated Word Recognition
- Improving Accuracy of Telephony-based, Speaker-Independent Speech Recognition
- Rejection in Speech Recognition Systems with Limited Training
- A Four Layer Sharing HMM System For Very Large Vocabulary Isolated Word Recognition
- A Comparative Study Of Hybrid Modelling Techniques For Improved Telephone Speech Recognition
- Smoothing and Tying for Korean Flexible Vocabulary Isolated Word Recognition
- Recent Work on a Preselection Module for a Flexible Large Vocabulary Speech Recognition System in Telephone Environment
- A Study of Noise Robustness for Speaker Independent Speech Recognition Method Using Phoneme Similarity Vector
- Classification of Taiwanese Tones Based on Pitch and Energy Movements
- Phoneme-Based Recognition for the Norwegian SpeechDat(II) Database
- Robust Feature Extraction for Alphabet Recognition
- Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network
- Improving the Speaker-Dependency of Subword-Unit-Based Isolated Word Recognition
- Speaker Independent Speech Recognition Method using Constrained Time Alignment near Phoneme Discriminative Frame
- A Nonstationary Autoregressive HMM With Gain Adaptation For Speech Recognition
- A Large-Vocabulary Taiwanese (MIN-NAN) Multi-Syllabic Word Recognition System Based Upon Right-Context-Dependent Phones with State Clustering by Acoustic Decision Tree
- Speech Recognition Based on the Distance Calculation Between Intermediate Phonetic Code Sequences in Symbolic Domain
- High Accuracy Chinese Speech Recognition Approach with Chinese Input Technology for Telecommunication Use
- Robust Speech Processing in Adverse Environments 1
- Robust Speech Recognition using HMM's with Toeplitz State Covariance matrices
- Modeling of Output Probability Distribution to Improve Small Vocabulary Speech Recognition in Adverse Environments
- Robust and Compact Multilingual Word Recognizers Using Features Extracted from a Phoneme Similarity Front-End
- An Effect of Adaptive Beamforming on Hands-Free Speech Recognition Based on 3-D Viterbi Search
- Coherence-based Subband Decomposition for Robust Speech and Speaker Recognition in Noisy and Reverberant Rooms
- A Minimax Search Algorithm for CDHMM based Robust Continuous Speech Recognition
- Spoken Language Models and Dialog 2
- An Event Driven Model for Dialogue Systems
- Automatic Classification of Dialogue Contexts for Dialogue Predictions
- Automatic Identification of Command Boundaries in a Conversational Natural Language User Interface
- The Predictive Power of Game Structure in Dialogue Act Recognition: Experimental Results Using Maximum Entropy Estimation
- A Schema Based Approach To Dialog Control
- Expanding A Time-Sensitive Conversational Architecture For Turn-Taking To Handle Content-Driven Interruption
- Articulatory Modelling 1
- A Three-Dimensional Linear Articulatory Model Based on MRI Data
- On Loops and Articulatory Biomechanics
- Magnetic Resonance Measurements of the Velum Port Opening
- Cantilever-Type Force-Sensor-Mounted Palatal Plate for Measuring Palatolingual Contact Stress and Pattern During Speech Phonation
- Determination of the Vocal Tract Spectrum from the Articulatory Movements Based on the Search of an Articulatory-Acoustic Database
- An MRI Study On The Relationship Between Oral Cavity Shape And Larynx Position
- Talking to Infants, Pets and Lovers
- Acoustic And Affective Qualities Of IDS In English
- Acoustic Qualities Of IDS And ADS In Thai
- Pragmatic Characteristics of Infant Directed Speech
- Are You My Little Pussy-Cat? Acoustic, Phonetic And Affective Qualities Of Infant- And Pet-Directed Speech
- Special Speech Registers: Talking To Australian And Thai Infants, And To Pets
- Robust Speech Processing in Adverse Environments 2
- Performance Improvements Through Combining Phone- And Syllable-Scale Information In Automatic Speech Recognition
- Predictive Adaptation and Compensation for Robust Speech Recognition
- Influence of the Speaking Style and the Noise Spectral Tilt on the Lombard Reflex and Automatic Speech Recognition
- Data-driven PMC and Bayesian Learning Integration for Fast Model Adaptation in Noisy Conditions
- Improving The Noise And Spectral Robustness Of An Isolated-Word Recognizer Using An Auditory-Model Front End
- A Model for Speech Reverberation and Intelligibility Restoring Filters
- Spoken Language Models and Dialog 3
- On Different Functions of Repetitive Utterances
- Prosody-Based Detection of the Context of Backchannel Responses
- Robust Interpretation for Spoken Dialogue Systems
- System-User Interaction and Response Strategy in Spoken Dialogue System
- Organizing Self-Motivated Dialogue with Autonomous Creatures
- Fly with the EAGLES: Evaluation of the "ACCeSS" Spoken Language Dialogue System
- Speech Coding 1
- A Very Low Bit Rate Speech Coder Using HMM With Speaker Adaptation
- ITU-T G.729 Extension At 6.4 kbps
- Adaptive Transformation for Segmented Parametric Speech Coding
- Speech Enhancement Using STC-Based Bandwidth Extension
- Performance And Optimization Of The SEEVOC Algorithm
- Articulatory Modelling 2
- Acoustic-Articulatory Evaluation of the Upper Vowel-Formant Region and its Presumed Speaker-Specific Potency
- Control of Larynx Height in Vowel Production
- Analyzing the Effect of Secondary Excitations of the Vocal Tract on Vocal Intensity in Different Loudness Conditions
- An Analysis of Modal Coupling Effects During the Glottal Cycle: Formant Synthesizers from Time-Domain Finite-Difference Simulations
- Laryngoscopic Analysis of Pharyngeal Articulations and Larynx-Height Voice Quality Settings
- Effects of Shapes of Radiational Aperture on Radiation Characteristics
- Prosody and Emotion 2
- De-accentuation: Linguistic Environments and Prosodic Realizations
- Towards an Automatic Classification of Emotions in Speech
- Can We Hear Smile?
- The Automatic Marking of Prominence in Spontaneous Speech Using Duration and Part of Speech Information
- On A Pitch Alteration Technique in Excited Cepstral Spectrum for High Quality TTS
- Dovetailing of Acoustics and Prosody in Spontaneous Speech Recognition
- A Computational Memory and Processing Model for Prosody
- Convergence Of Fundamental Frequencies In Conversation: If It Happens, Does It Matter?
- Analysis and Interpretation of Fundamental Frequency Contours of British English in Terms of a Command-Response Model
- Common Patterns In Word Level Prosody
- Prosodic Structure in Japanese Spontaneous Speech
- An Acoustic-Phonetic Description Of Word Tone In Kagoshima Japanese
- Representing Prosodic Words Using Statistical Models of Moraic Transition of Fundamental Frequency Contours of Japanese
- Disambiguation of Korean Utterances Using Automatic Intonation Recognition
- Multi-Level Rhythm Control for Speech Synthesis Using Hybrid Data Driven and Rule-Based Approaches
- EGG Model of Ditoneme in Mandarin
- Temporal Organization of Speech for Normal and Fast Rates
- A Syllable-based Generalization of Japanese Accentuation
- Non-Adjacent Segmental Effects in Tonal Realization of Accentual Phrase in Seoul Korean
- Improvement on Connected Numbers Recognition Using Prosodic Information
- Phonetic Investigation of Boundary Pitch Movements in Japanese
- Phonetic and Phonological Characteristics of Paralinguistic Information in Spoken Japanese
- ToBI Accent Type Recognition
- The Influence of Syllable Structure on the Timing of Intonational Events in German
- New Prosodic Control Rules For Expressive Synthetic Speech
- The Use of F0 Reliability Function for Prosodic Command Analysis on F0 Contour Generation Model
- Analysis of Effects of Lexical Accent, Syntax, and Global Speech Rate upon the Local Speech Rate
- On the Effects of Speech Rate upon Parameters of the Command-Response Model for the Fundamental Frequency Contours of Speech
- The Maximum-Based Description of F0 Contours and its Application to English
- Perceived Prominence and Acoustic Parameters in American English
- Generating Emotional Speech with a Concatenative Synthesizer
- A Perceptive Measure of Pure Prosody Linguistic Functions with Reiterant Sentences
- Prosodic Parameters in Emotional Speech
- Automatic Detection of Prominence (as Defined by Listeners' Judgements) in Read Aloud Dutch Sentences
- A Schema for Illocutionary Act Identification With Prosodic Feature
- An Algorithm for Choosing Japanese Acknowledgments using Prosodic Cues and Context
- A Study of Tones and Tempo in Continuous Mandarin Digit Strings and their Application in Telephone Quality Speech Recognition
- Simulated Emotions: an Acoustic Study of Voice and Perturbation Measures
- A Robust Tone Recognition Method of Chinese Based on Sub-syllabic F0 Contours
- The Microprosodics of Tone Sandhi in Shanghai Disyllabic Compounds
- Jitter And Shimmer Differences Between Pathological Voices Of School Children
- Neural Networks, Fuzzy and Evolutionary Methods 1
- A Comparison of Thai Speech Recognition Systems Using Hidden Markov Model, Neural Network, and Fuzzy-Neural Network
- Phoneme Recognition with Statistical Modeling of the Prediction Error of Neural Networks
- Neural Network Based Pronunciation Modeling With Applications To Speech Recognition
- A Comparative Study of OCON and MLP Architectures for Phoneme Recognition
- Evaluation and Integration of Neural-Network Training Techniques for Continuous Digit Recognition
- Hierarchical Neural Networks (HNN) for Chinese Continuous Speech Recognition
- Neural Network Motivation for Segmental Distribution
- Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition Of Natural Numbers
- Initial Speech Recognition Results Using The Multinet Architecture
- Selection of the Optimal Structure of the Continuous HMM Using the Genetic Algorithm
- A Proposed Decision Rule For Speaker Recognition Based On Fuzzy C-Means Clustering
- Fuzzy Gaussian Mixture Models For Speaker Recognition
- A New Strategy of Fuzzy-Neural Network for Thai Numeral Speech Recognition
- Thai Polysyllabic Word Recognition Using Fuzzy-Neural Network
- Utterance Verification and Word Spotting 1 / Speaker Adaptation 1
- Word Verification Using Confidence Measures in Speech Recognition
- Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems
- Two-Pass Utterance Verification Algorithm for Long Natural Numbers Recognition
- A*-Admissible Key-Phrase Spotting With Sub-Syllable Level Utterance Verification
- Speaker-Independent Upfront Dialect Adaptation in a Large Vocabulary Continuous Speech Recognizer
- Word-Based Acoustic Confidence Measures for Large-Vocabulary Speech Recognition
- Improved Utterance Rejection Using Length Dependent Thresholds
- Bayesian Constrained Frequency Warping HMMS for Speaker Normalisation
- An Evaluation of Keyword Spotting Performance Utilizing False Alarm Rejection Based on Prosodic Information
- Predictive Speaker Adaptation and Its Prior Training
- Powerful Syllabic Fillers for General-Task Keyword-Spotting and Unlimited-Vocabulary Continuous-Speech Recognition
- Confidence Scoring for Speech Understanding Systems
- Phonological Rules for Enhancing Acoustic Enrollment of Unknown Words
- Recognition-Based Word Counting for Reliable Barge-in and Early Endpoint Detection in Continuous Speech Recognition
- Linear Discriminant - A New Criterion For Speaker Normalization
- Confidence Measures Derived from an Acceptor HMM
- Telephone Speech Multi-Keyword Spotting Using Fuzzy Search Algorithm and Prosodic Verification
- Topic Recognition for News Speech Based on Keyword Spotting
- Text-To-Speech Synthesis 2
- Prosody Prediction for Speech Synthesis using Transformational Rule-based Learning
- Representing the Environments for Phonological Processes in an Accent-Independent Lexicon for Synthesis of English
- Efficient Lexical Retrieval for English Text-to-Speech Synthesis
- Spoken Language Models and Dialog 4
- SQEL: A Multilingual and Multifunctional Dialogue System
- Semi-Automated Incremental Prototyping of Spoken Dialog Systems
- Beyond Structured Dialogues: Factoring out Grounding
- Human Speech Perception 1
- Heads And Tails in Word Perception: Evidence For `Early-to-Late' Processing in Listening and Reading
- Evidence for Early Effects of Sentence Context on Word Segmentation
- Assimilation and Anticipation in Word Perception
- Lexical Activation by Assimilated and Reduced Tokens
- Robust Speech Processing in Adverse Environments 3
- Linear and Nonlinear Speech Feature Analysis for Stress Classification
- Speech Feature Modeling for Robust Stressed Speech Recognition
- Combining Articulatory and Acoustic Information for Speech Recognition in Noisy and Reverberant Environments
- Improving Speaker Identification Performance in Reverberant Conditions using Lip Information
- Speech and Hearing Disorders 1
- Adults With a Severe-to-Profound Hearing Impairment. Investigating the Effects of Linguistic Context on Speech Perception
- Speech Perception in Dyslexia: Measurements From Birth Onwards
- An Acoustic Analysis of Vowel Production Across Tasks in a Case of Non-fluent Progressive Aphasia
- Speech Technology in Clinical Environments
- Prosody and Emotion 3
- What Spreads, And How? Tonal Rightward Spreading on Shanghai Disyllabic Compounds
- Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from Four Zhejiang Wu Dialects
- Emotional Speech Synthesis: From Speech Database to TTS
- Some Acoustic Characteristics Of Emotion
- Spoken Language Understanding Systems 1
- GALAXY-II: A Reference Architecture for Conversational System Development
- Improvements in Speech Understanding Accuracy Through the Integration of Hierarchical Linguistic, Prosodic, And Phonological Constraints in the Jupiter Domain
- Towards Robust Methods for Spoken Document Retrieval
- Signal Processing and Speech Analysis 1
- Maximum a Posteriori Pitch Tracking
- Vowel Separation Using the Reassigned Amplitude-Modulation Spectrum
- Feature Decorrelation Methods in Speech Recognition. A Comparative Study
- Multi-Resolution for Speech Analysis
- Dynamic features in Children's Vowels
- Effectiveness of Phase-Corrected Rasta for Continuous Speech Recognition
- Techniques For Capturing Temporal Variations In Speech Signals With Fixed-Rate Processing
- Automatic Detection of Landmark for Nasal Consonants from Speech Waveform
- Plug and Play Software for Designing High-Level Speech Processing Systems
- Creating Speaker Independent HMM Models for Restricted Database Using STRAIGHT-TEMPO Morphing
- Restoration Of Hyperbaric Speech By Correction Of The Formants And The Pitch
- Voice Conversion Based on Parameter Transformation
- Noise Robust Two-Stream Auditory Feature Extraction Method for Speech Recognition
- Heterogeneous Measurements and Multiple Classifiers for Speech Recognition
- Joint Recognition and Segmentation Using Phonetically Derived Features and a Hybrid Phoneme Model
- TRAPS - Classifiers Of Temporal Patterns
- Robust Measurement of Fundamental Frequency and Degree of Voicing
- Micropower Electro-Magnetic Sensors for Speech Characterization, Recognition, Verification, and other applications
- Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments
- Statistical Integration of Temporal Filter Banks for Robust Speech Recognition Using Linear Discriminant Analysis (LDA)
- Feature-Based Approach to Speech Recognition
- Periodicity Emphasis of Voice Wave using Nonlinear IIR Digital Filters and Its Applications
- Speech Recognition Via Phonetically Featured Syllables
- Do Phonetic Features Help to Improve Consonant Identification in ASR?
- Perceptual and Acoustic Properties of Phonemes in Continuous Speech for Different Speaking Rate
- On Robust Sequential Estimator Based on T-Distribution with Forgetting Factor for Speech Analysis
- Discriminant Wavelet Basis Construction for Speech Recognition
- An Efficient Mel-LPC Analysis Method for Speech Recognition
- Discriminative Weighting of Multi-Resolution Sub-Band Cepstral Features for Speech Recognition
- Separation of Singing and Piano Sounds
- Modeling of Variations in Cepstral Coefficients Caused by F0 Changes and its Application to Speech Processing
- A Detection Framework for Locating Phonetic Events
- On Frequency Averaging For Spectral Analysis In Speech Recognition
- Wavelet Transform Domain Blind Equalization and Its Application to Speech Analysis
- A Novel Method of Formant Analysis and Glottal Inverse Filtering
- Vector Quantizer Acceleration for an Automatic Speech Recognition Application
- Local Speech Rate as a Combination of Syllable and Phone Rate
- Recovering Gestures From Speech Signals: A Preliminary Study for Nasal Vowels
- Extended Linear Discriminant Analysis (ELDA) for Speech Recognition
- Speech, Silence, Music and Noise Classification of TV Broadcast Material
- The Relation Between Vocal Tract Shape And Formant Frequencies Can Be Described By Means Of A System Of Coupled Differential Equations
- Improving Speech Recognizer by Broader Acoustic-Phonetic Group Classification
- Separation of Speech Source and Filter by Time-Domain Deconvolution
- On the Application of the AM-FM Model for the Recovery of Missing Frequency Bands of Telephone Speech
- Estimation of Voice Source and Vocal Tract Parameters Using Combined Subspace-Based and Amplitude Spectrum-Based Algorithm
- The Distance Measure For Line Spectrum Pairs Applied to Speech Recognition
- Spoken Language Generation and Translation 1
- The Modeling and Realization of Natural Speech Generation System
- "Ko Tok Ples Ensin bilong Tok Pisin" or The TP-CLE: A First Report From a Pilot Speech-to-Speech Translation Project From Swedish to Tok Pisin
- An Iterative, DP-Based Search Algorithm For Statistical Machine Translation
- Information Extraction and Text Generation of News Reports for a Swedish-English Bilingual Spoken Dialogue System
- Utterance Generation for Transaction Dialogues
- Example-Based Error Recovery Method For Speech Translation: Repairing Sub-Trees According to the Semantic Distance
- Context Sensitive Generation of Descriptions
- An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues
- Generating Pitch Accents in a Concept-to-Speech System Using a Knowledge Base
- Making the Most of Multiplicity: a Multi-Parser Multi-Strategy Architecture for the Robust Processing of Spoken Language
- Natural-Sounding Speech Synthesis Using Variable-Length Units
- Spoken Language Models and Dialog 5
- A Robust Dialogue Model for Spoken Dialogue Processing
- The REWARD Service Creation Environment. An Overview
- An Analysis of the Timing of Turn-Taking in a Corpus of Goal-Oriented Dialogue
- The Provision of Corrective Feedback in a Spoken Dialogue CALL System
- Evaluation of Dialog Strategies for a Tourist Information Retrieval System
- Designing a Multimodal Dialogue System for Information Retrieval
- The Research Project of Man-Computer Dialogue System in Chinese
- Interfaces for Speech Recognition Systems: the Impact of Vocabulary Constraints and Syntax on Performance
- Pacing Spoken Directions to Suit the Listener
- A Spoken Dialogue System Utilizing Spatial Information
- From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems
- Emergent Computational Dialogue Management Architecture For Task-Oriented Spoken Dialogue Systems
- An Analysis of Dialogues with Our Dialogue System Through a WWW page
- Modelling Spoken Dialogues With State Transition Diagrams: Experiences With The CSLU Toolkit
- Situated Dialogue Coordination For Spoken Dialogue Systems
- Robust Spoken Dialogue Systems for Consumer Products: a Concrete Application
- A German Dialogue System for Scheduling Dates and Meetings by Naturally Spoken Continuous Speech
- Spoken Dialogue System Using Corpus-Based Hidden Markov Model
- A Realistic Wizard of Oz Simulation of a Multimodal Spoken Language System
- A Syllable-Based Chinese Spoken Dialogue System for Telephone Directory Services Primarily Trained with A Corpus
- How Disagreement Expressions are Used in Cooperative Tasks
- Segmentation, Labelling and Speech Corpora 1
- Acoustic Indicators Of Topic Segmentation
- IViE - A Comparative Transcription system for Intonational Variation in English
- Automatic Segmental and Prosodic Labeling of Mandarin Speech Database
- Automatic Labelling of German Prosody
- Multimodal Spoken Language Processing 2
- Speech Driven 3-D Face Point Trajectory Synthesis Algorithm
- Speech-to-Lip Movement Synthesis Based on the EM Algorithm Using Audio-Visual HMMs
- Learning Words from Natural Audio-Visual Input
- Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database
- Prosody and Emotion 4
- Intonative Structure as a Determinant of Word Order Variation in Dutch Verbal Endgroups
- Experiments on the Meaning of Two Pitch Accent Types: The 'Pointed Hat' Versus the Accent-lending Fall in Dutch
- Phonetic and Phonological Markers of Contrastive Focus in Korean
- Reconciling Two Competing Views on Contrastiveness
- Neural Networks, Fuzzy and Evolutionary Methods 2
- Modular Neural Networks for Low-Complex Phoneme Recognition
- Global Optimisation of Neural Network Models Via Sequential Sampling-Importance Resampling
- Efficient Computation of MMI Neural Networks for Large Vocabulary Speech Recognition Systems
- Modular Connectionist Systems for Identifying Complex Arabic Phonetic Features
- Large Vocabulary Continuous Speech Recognition 1
- Real-Time Recognition of Broadcast News
- Automatic Recognition of Korean Broadcast News Speech
- Telephone-Based Conversational Speech Recognition in the JUPITER Domain
- Japanese Large-Vocabulary Continuous Speech Recognition System Based on Microsoft Whisper
- Partitioning And Transcription Of Broadcast News Data
- Speaker and Language Recognition 2
- Speaker Detection in Broadcast Speech Databases
- Multilateral Techniques for Speaker Recognition
- Real Time Speaker Indexing Based on Subspace Method - Application to TV News Articles and Debate
- SHEEP, GOATS, LAMBS and WOLVES: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation
- Progress in Speaker Recognition at Dragon Systems
- A Comparative Study Of Speaker Verification Systems Using The Polycost Database
- Signal Processing and Speech Analysis 2
- Improving Pitch Estimation with Short Duration Speech Samples
- An Instantaneous-Frequency-Based Pitch Extraction Method for High-Quality Speech Transformation: Revised TEMPO in the STRAIGHT-Suite
- Speaker-Independent Speech Recognition Using Micro Segment Spectrum Integration
- On Robust Speech Analysis Based On Time-Varying Complex AR Model
- Spectral Basis Functions from Discriminant Analysis
- Prosody and Emotion 5
- The Tilt Intonation Model
- Analysis of Occurrence of Pauses and Their Durations in Japanese Text Reading
- A Statistical Study of Pitch Target Points in Five Languages
- Fully Automatic Prosody Generator For Text-to-Speech
- Automatic Prosodic Labeling of 6 Languages
- Automatic Utterance Type Detection Using Suprasegmental Features
- Robust Speech Processing in Adverse Environments 4
- Spectral Sequence Compensation Based on Continuity of Spectral Sequence
- Robust Features for Speech Recognition Systems
- Interfacing of CASA and Partial Recognition Based on a Multistream Technique
- AN RNN-Based Compensation Method for Mandarin Telephone Speech Recognition
- Robust Speech Recognition Using Discriminative Stream Weighting and Parameter Interpolation
- Acoustic Backing-Off in the Local Distance Computation for Robust Automatic Speech Recognition
- Noise Model Selection For Robust Speech Recognition
- A Novel Iterative Signal Enhancement Algorithm for Noise Reduction in Speech
- Missing Data Reconstruction for Robust Automatic Speech Recognition in the Framework of Hybrid HMM/ANN Systems
- Recognition from GSM Digital Speech
- Conversational Speech Systems For On-Board Car Navigation And Assistance
- A Signal Processing System for Having the Sound "Pop-Out" in Noise Thanks to the Image of the Speaker's Lips: New Advances Using Multi-Layer Perceptrons
- Robust Speech Activity Detection in the Presence of Noise
- Robust Automatic Speech Recognition by the Application of a Temporal-Correlation-Based Recurrent Multilayer Neural Network to the Mel-Based Cepstral Coefficients
- Speech Recognition from GSM Codec Parameters
- Improved Parallel Model Combination Based on Better Domain Transformation for Speech Recognition Under Noisy Environments
- Robust Speech/Non-Speech Detection in Adverse Conditions Based on Noise and Speech Statistics
- Speech Recognition In Car Noise Environments Using Multiple Models According To Noise Masking Levels
- Spectral Noise Subtraction With Recursive Gain Curves
- A Novel Robust Speech Recognition Algorithm Based on Multi-Models and Integrated Decision Method
- On the Interaction Between Time and Frequency Filtering of Speech Parameters for Robust Speech Recognition
- Inference Of Missing Spectrographic Features For Robust Speech Recognition
- SNR-Dependent Flooring and Noise Overestimation for Joint Application of Spectral Subtraction and Model Combination
- Improved Robust Speech Recognition Considering Signal Correlation Approximated by Taylor Series
- Speech Recognition in Noisy Environment Using Weighted Projection-Based Likelihood Measure
- Evaluation of Model Adaptation by HMM Decomposition on Telephone Speech Recognition
- Comparative Experiments to Evaluate a Voiced-Unvoiced-Based Pre-Processing Approach to Robust Automatic Speech Recognition in Low-SNR Environments
- Signal Extraction From Noisy Signal Based on Auditory Scene Analysis
- Frequency Domain Binaural Model as the Front End of Speech Recognition System
- A Study on the Recognition of Low Bit-Rate Encoded Speech
- Weighted Parallel Model Combination for Noisy Speech Recognition
- Favourable and Unfavourable Short Duration Segments of Speech in Noise
- Segmentation, Labelling and Speech Corpora 2
- An Efficient Labeling Tool for the QuickSig Speech Database
- Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies
- Resegmentation of SWITCHBOARD
- Automatic Generation of Visual Scenarios for Spoken Corpora Acquisition
- Automatic Detection of Semantic Boundaries Based on Acoustic and Lexical Knowledge
- A New Fast Algorithm for Automatic Segmentation of Continuous Speech
- Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech
- Korean Prosodic Break Index Labelling by a New Mixed Method of LDA and VQ
- MOOSE: Management Of Otago Speech Environment
- Phonetic Alignment: Speech Synthesis Based vs. Hybrid HMM/ANN
- Customisation And Quality Assessment Of Spoken Language Description
- A Silence/Noise/Music/Speech Splitting Algorithm
- Audio-Visual Segmentation for Content-Based Retrieval
- Same News is Good News: Automatically Collecting Reoccurring Radio News Stories
- An Annotation System for Melodic Aspects of German Spontaneous Speech
- Additional Use of Phoneme Duration Hypotheses in Automatic Speech Segmentation
- Towards a Minimal Standard for Dialogue Transcripts: a New SGML Architecture for the HCRC Map Task Corpus
- Speech Technology Applications and Human-Machine Interface 1
- Steps Toward The Integration Of Speaker Recognition In Real-World Telecom Applications
- A Bimodal Korean Address Entry/Retrieval System
- Usability Evaluation of IVR Systems With DTMF and ASR
- SALSA Version 1.0: A Speech-Based Web Browser for Hong Kong English
- A Language for Creating Speech Applications
- The Use of Automatic Speech Recognition to Reduce the Interference Between Concurrent Tasks of Driving and Phoning
- Interactive Listening to Structured Speech Content on the Internet
- MSF Format For The Representation Of Speech Synchronized Moving Image
- Effects of Using Speech in Timetable Information Systems for WWW
- The Interactive Systems Labs View4You Video Indexing System
- SEMOLE: A Robust Framework For Gathering Information From The World Wide Web
- Enhancing a WIMP Based Interface With Speech, Gaze Tracking and Agents
- Now You Hear It, Now You Don't: Empirical Studies of Audio Browsing Behavior Behavior
- A Voice Verifier for Face/Voice Based Person Verification System
- On The Use Of Automatic Speech Recognition For TV Captioning
- An Undergraduate Course on Speech Recognition Based on the CSLU Toolkit
- Real Time Voice Alteration Based on Linear Prediction
- Evaluation and Implementation of a Voice-Activated Dialing System with Utterance Verification
- Towards a Mandarin Voice Memo System
- Large Vocabulary Continuous Speech Recognition 2
- Grammatical Word Graph Re-Generation for Spontaneous Speech Recognition
- Compression Algorithm Of Trigram Language Models Based On Maximum Likelihood Estimation
- Morphological Modeling of Word Classes for Language Models
- A Comparative Study Between Polyclass and Multiclass Language Models
- Log-Linear Interpolation Of Language Models
- The Applicability of Adaptive Language Modelling for the Broadcast News Task
- Text-To-Speech Synthesis 3
- The IBM Trainable Speech Synthesis System
- ProSynth: An Integrated Prosodic Approach to Device-Independent, Natural-Sounding Speech Synthesis
- Total Quality Evaluation of Speech Synthesis Systems
- Comparative Evaluation of Synthetic Prosody with the PURR Method
- SABLE: A Standard For TTS Markup
- Prosodic vs. Segmental Contributions to Naturalness in a Diphone Synthesizer
- Language Acquisition 1
- Non-Native Productions Of Japanese Single Stops That Are Too Long For One Mora Unit
- The Process Of Generation And Development Of Second Language Japanese Accentuation
- Perceptual Properties of Russians with Japanese Fricatives
- Assessment of Dutch Pronunciation by Means of Automatic Speech Recognition Technology
- Phonetic-Level Mispronunciation Detection in Non-Native Swedish Speech
- Computer-Based Second Language Production Training By Using Spectrographic Representation And HMM-Based Speech Recognition Scores
- Acoustic Phonetics 1
- Assimilation of Place in Japanese and Dutch
- Prosodic Constraint on V-to-V Coarticulation in Japanese
- Postvocalic /r/-deletion in Standard Dutch: How Experimental Phonology Can Profit From ASR Technology
- More Evidence For The Perceptual Basis Of Sound Change? Suprasegmental Effects In The Development Of Distinctive Nasalization
- Speech Production Of Vowel Sequences Using A Physiological Articulatory Model
- Speaker Adaptation 2
- Eigenvoices for Speaker Adaptation
- Speaker Clustering Using Direct Maximisation of the MLLR-Adapted Likelihood
- Incremental On-Line Speaker Adaptation in Adverse Conditions
- Cluster Adaptive Training for Speech Recognition
- Speech Coding 2
- Towards a Unified Model for Low Bit-Rate Speech Coding Using a Recognition-Synthesis Approach
- On the Significance of Temporal Masking in Speech Coding
- Waveform Interpolation Coding With Pitch-Spaced Subbands
- An Improved Decomposition Method For WI Using IIR Wavelet Filter Banks
- Hidden Markov Model Techniques 2
- Real-Time Probabilistic Segmentation for Segment-Based Speech Recognition
- Toward Markov Random Field Modeling of Speech
- Hidden Markov Models for Trajectory Modeling
- Multilingual Perception and Recognition 1
- Bilingual and Dialectal Adaptation and Retraining
- Language Independent and Language Adaptive Large Vocabulary Speech Recognition
- A Method for Measuring the Intelligibility and Nonnativeness of Phone Quality in Foreign Language Pronunciation Training
- Large Vocabulary Continuous Speech Recognition 3
- The BBN Single-Phonetic-Tree Fast-Match Algorithm
- An Efficient Two-pass Search Algorithm Using Word Trellis Index
- Nozomi -- a Fast, Memory-Efficient Stack Decoder For LVCSR
- Reducing the OOV Rate in Broadcast News Speech Recognition
- Using Automatically-Derived Acoustic Sub-word Units in Large Vocabulary Speech Recognition
- Fabricating Conversational Speech Data with Acoustic Models: a Program to Examine Model-Data Mismatch
- Articulatory Modelling 3
- An Electropalatographic, Kinematic, and Acoustic Analysis of Supralaryngeal Correlates of Word-Level Prominence Contrasts in English
- Consistencies and Inconsistencies Between EPG and Locus Equation Data on Coarticulation
- Synergy Between Jaw And Lips/Tongue Movements : Consequences In Articulatory Modelling
- Modelling Tongue Configuration in German Vowel Production
- Optopalatograph: Real-time Feedback of Tongue Movement in 3D
- Effects of Contrastive Focal Accent on Linguopalatal Articulation and Coarticulation in the French [kskl] Cluster
- Language Acquisition 2
- Spoken Word Identification by Native and Nonnative Speakers of English: Effects of Training, Modality, Context and Phonetic Environment
- The Effect Of Background Knowledge On First And Second Language Comprehension Difficulty
- Comparison of Cross-language Coarticulation: English, Japanese and Japanese-accented English
- Plasticity Of Non-Native Phonetic Perception And Production: A Training Study
- The Relation Between Perceptual and Production Categories in Acquisition
- The Development of Perceptual Cue-Weighting in Children Aged 6 to 12
- Speaker and Language Recognition 3
- Robust Speaker Verification Insensitive to Session-dependent Utterance Variation and Handset-dependent Distortion
- A Comparative Evaluation of Variance Flooring Techniques in HMM-based Speaker Verification
- Text-Independent Speaker Verification Using Automatically Labelled Acoustic Segments
- A Fast Decoding Algorithm Based on Sequential Detection of the Changes in Distribution
- Speaker Verification With Ensemble Classifiers Based On Linear Speech Transforms
- Speaker Recognition Based On Discriminative Projection Models
- Text-To-Speech Synthesis 4
- A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech
- Analytic Generation of Synthesis Units by Closed Loop Training for Totally Speaker Driven Text to Speech System (TOS Drive TTS)
- Modeling the Microprosody of Pitch and Loudness for Speech Synthesis with Neural Networks
- Spectral Smoothing for Concatenative Speech Synthesis
- MIMIC : A Voice-Adaptive Phonetic-Tree Speech Synthesiser
- Automatic Generation Of Korean Pronunciation Variants By Multistage Applications Of Phonological Rules
- Techniques for Accurate Automatic Annotation of Speech Waveforms
- Optimized Stopping Criteria for Tree-Based Unit Selection in Concatenative Synthesis
- Automatic Transcription of Intonation Using an Identified Prosodic Alphabet
- Frequency Analysis of Phonetic Units for Concatenative Synthesis in Catalan
- Investigating the Syntactic Characteristics of English Tone Units
- The UPC Text-to-Speech System for Spanish and Catalan
- The New Version of the ROMVOX Text-to-Speech Synthesis System Based on a Hybrid Time Domain-LPC Synthesis Technique
- An F0 Contour Control Model for Totally Speaker Driven Text to Speech System
- On the Relationship of Speech Rates with Prosodic Units in Dialogue Speech
- On the Reduction of Concatenation Artefacts in Diphone Synthesis
- Error Analysis and Confidence Measure of Chinese Word Segmentation
- Energy Contour Generation for a Sentence Using a Neural Network Learning Method
- A Computational Algorithm For F0 Contour Generation In Korean Developed With Prosodically Labeled Databases Using K-ToBI System
- Rapid-Deployment Text-to-Speech in the DIPLOMAT System
- Formant Diphone Parameter Extraction Utilising a Labelled Single-Speaker Database
- A New Synthetic Speech/Sound Control Language
- A Study on the Natural-Sounding Japanese Phonetic Word Synthesis by Using the VCV-Balanced Word Database That Consists of the Words Uttered Forcibly in Two Types of Pitch Accent
- Letter to Sound Rules for Accented Lexicon Compression
- A Name Announcement Algorithm with Memory Size and Computational Power Constraints
- How a French TTS System can Describe Loanwords
- Improvements in Slovene Text-to-Speech Synthesis
- Automatic Rule Generation for Linguistic Features Analysis Using Inductive Learning Technique: Linguistic Features Analysis in TOS Drive TTS System
- Segmental Duration Control Based on an Articulatory Model
- Text Analysis for the Bell Labs French Text-to-Speech System
- Modeling Vowel Duration for Japanese Text-to-Speech Synthesis
- Towards A Chinese Text-To-Speech System With Higher Naturalness
- Spoken Language Understanding Systems 4
- Grammar Fragment Acquisition using Syntactic and Semantic Clustering
- Non-Expert Access to Unification Based Speech Understanding
- Natural Language Call Routing: A Robust, Self-Organizing Approach
- Automatic Grammar Induction from Semantic Parsing
- BTH: An Efficient Parsing Algorithm for Word-Spotting
- Syntax Coordination: Interaction of Discourse and Extrapositions
- Hierarchical Tag-Graph Search for Spontaneous Speech Understanding in Spoken Dialog Systems
- Extraction of the Dialog Act and the Topic From Utterances in a Spoken Dialog System
- Fast Computation of Maximum Entropy / Minimum Divergence Feature Gain
- Stochastic Language Models for Speech Recognition and Understanding
- Linguistically Engineered Tools for Speech Recognition Error Analysis
- Estimating Entropy of a Language from Optimal Word Insertion Penalty
- A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues
- A Hierarchical Language Model for CSR
- Spoken Language Understanding Within Dialogs Using a Graphical Model of Task Structure
- Keyword Extraction of Radio News using Domain Identification based on Categories of an Encyclopedia
- Human Speech Perception 2
- Fundamental Frequency Fluctuation in Continuous Vowel Utterance and its Perception
- Estimation of Mental Lexicon Size with Word Familiarity Database
- Vowel Quality in Spontaneous Speech: What Makes a Good Vowel?
- Cooperation and Competition of Burst and Formant Transitions for the Perception and Identification of French Stops
- The Effect of Modifying Formant Amplitudes on the Perception of French Vowels Generated by Copy Synthesis
- Segmental and Tonal Processing in Cantonese
- Phonological Similarity Effects in Cantonese Spoken-Word Processing
- On The Learnability Of The Voicing Contrast For Initial Stops
- Acoustic and Perceptual Characteristic of Italian Stop Consonants
- Acoustic Cues for the Auditory Identification of the Spanish Fricative /f/
- Recognition of Vowels in Fricative Context.
- Voicing Affects Perceived Manner of Articulation.
- Enhancement Techniques to Improve the Intelligibility of Consonants in Noise : Speaker and Listener Effects
- Boundaries of Perception of Long Tones in Taiwanese Speech
- Effects of Phonetic Quality and Duration on Perceptual Acceptability of Temporal Changes in Speech
- Dynamic vs. Static Spectral Detail in the Perception of Gated Stops
- Phonological Units In Speech Segmentation And Phonological Awareness
- How Far Do Speakers Back Up in Repairs? A Quantitatve Model
- Don't Blame It (All) On The Pause: Further ERP Evidence For A Prosody-Induced Garden-Path In Running Speech
- The Role of Stress for Lexical Selection in Dutch
- The Perception of Stressed Syllables in Finnish
- The Perception Of The Morae With Devocalized Vowels In Japanese Language.
- Large Vocabulary Continuous Speech Recognition 4
- High Resolution Decision Tree based Acoustic Modeling beyond CART
- Unsupervised Training of a Speech Recognizer Using TV Broadcasts
- A New Method to Achieve Fast Acoustic Matching for Speech Recognition
- Improved Parameter Tying for Efficient Acoustic Model Evaluation in Large Vocabulary Continuous Speech Recognition
- A New Look at HMM Parameter Tying for Large Vocabulary Speech Recognition
- Factor Analysis Invariant to Linear Transformations of Data
- Spoken Language Understanding Systems 2
- Automatic Ambiguity Detection
- Empowering Knowledge Based Speech Understanding through Statistics
- Concept-Driven Speech Understanding Incorporated with a Statistic Language Model
- On The Limitations of Stochastic Conceptual Finite-State Language Models For Speech Understanding
- Towards Speech Understanding Across Multiple Languages
- Automatic Detection of Sentence Boundaries and Disfluencies Based on Recognized Words
- Signal Processing and Speech Analysis 3
- Determination of Articulatory Positions from Speech Acoustics by Applying Dynamic Articulatory Constraints
- Recognizing Emotions in Speech Using Short-term and Long-term Features
- PeriphEar : A Nonlinear Active Model of the Auditory Periphery
- The Voicing Feature for Stop Consonants: Acoustic Phonetic Analyses and Automatic Speech Recognition Experiments
- Wavelet-Based Energy Binning Cepstral Features for Automatic Speech Recognition
- Articulatory Analysis using a Codebook for Articulatory based Low Bit-Rate Speech Coding
- Human Speech Perception 3
- Categorical Perception: Important Phenomenon or Lasting Myth?
- Categorical Perception of Vowels
- Suprasegmental Cues for the Segmentation of Identical Vowel Sequences in Japanese
- Perception Of Concurrent Approximant-Vowel Syllables
- Perceived Swedish Vowel Quantity: Effects of Postvocalic Consonant Duration
- Speaker Adaptation 3
- On-line Hierarchical Transformation of Hidden Markov Models for Speaker Adaptation
- High-Speed Speaker Adaptation Using Phoneme Dependent Tree-Structured Speaker Clustering
- The Use of Confidence Measures in Unsupervised Adaptation of Speech Recognizers
- Speaker Normalization with All-Pass Transforms
- Toward On-Line Learning of Chinese Continuous Speech Recognition System
- The CHAM Model of Hyperarticulate Adaptation During Human-Computer Error Resolution
- Spoken Language Understanding Systems 3
- Language Modeling for Content Extraction in Human-Computer Dialogues
- A Language Model Combining Trigrams and Stochastic Context-Free Grammars
- Online Adaptation of Language Models in Spoken Dialogue Systems
- Language Model Adaptation for Spoken Language Systems
- Detecting Topic Shifts Using a Cache Memory
- A Discourse Coding Scheme for Conversational Spanish
- Multimodal Spoken Language Processing 3
- Referential Features and Linguistic Indirection in Multimodal Language
- Multimodal Language Processing
- Implementation of Coordinative Nodding Behavior on Spoken Dialogue Systems
- Use of Non-Verbal Information in Communication Between Human and Robot
- What You See is (Almost) What You Hear: Design Principles For User Interfaces For Accessing Speech Archives
- Acoustic Phonetics 2
- Regional Variation in the Vowels of Female Adolescents from Sydney
- A Kinematic Analysis Of New Zealand And Australian English Vowel Spaces
- Syllable-Onset Acoustic Properties Associated with Syllable-Coda Voicing
- Articulatory, Acoustic and Perceptual Aspects of Fricative-Stop Coarticulation
- Efficiency As An Organizing Principle Of Natural Speech
- Within-Speaker Variability Due to Speaking Manners
- Large Vocabulary Continuous Speech Recognition 5
- A Thesaurus-Based Statistical Language Model for Broadcast News Transcription
- Effect of Task Complexity on Search Strategies for the Motorola Lexicus Continuous Speech Recognition System
- New Features For Confidence Annotation
- Multi-Span Statistical Language Modeling for Large Vocabulary Speech Recognition
- Maximum-Likelihood Updates Of HMM Duration Parameters For Discriminative Continuous Speech Recognition
- Towards Better Integration of Semantic Predictors in Statistical Language Modeling
- An Asymmetric Stochastic Language Model Based on Multi-Tagged Words
- Product-Code Vector Quantization of Cepstral Parameters for Speech Recognition Over the WWW
- Context Dependent Tree Based Transforms For Phonetic Speech Recognition
- Interfacing Acoustic Models with Natural Language Processing Systems
- Hierarchical Cluster Language Modeling With Statistical Rule Extraction For Rescoring N-Best Hypotheses During Speech Decoding
- Dealing With Out-of-Vocabulary Words and Speech Disfluencies in an N-Gram Based Speech Understanding System
- Source-Extended Language Model for Large Vocabulary Continuous Speech Recognition
- Time Dependent Language Model For Broadcast News Transcription And Its Post-Correction
- Exploiting Transitions and Focussing on Linguistic Properties for ASR
- A Unified Framework for Sublexical and Linguistic Modelling Supporting Flexible Vocabulary Speech Understanding
- A Method for Modeling Liaison in a Speech Recognition System for French
- On Variable Sampling Frequencies in Speech Recognition
- Pronunciation Modeling for Large Vocabulary Conversational Speech Recognition
- Time Shift Invariant Speech Recognition
- The Demiphone Versus the Triphone in a Decision-tree State-Tying Framework
- Word Clustering for A Word Bi-gram Model
- A Large Vocabulary Continuous Speech Recognition Hybrid System for the Portuguese Language
- Speech Recognition Performance on a new Voicemail Transcription Task
- Grammatical and Statistical Word Prediction System for Spanish Integrated in an Aid for People with Disabilities
- Segmentation Using a Maximum Entropy Approach
- Recognition Performance of a Large-Scale Dependency Grammar Language Model
- A Bootstrap Technique for Building Domain-Dependent Language Models
- Estimation of the Probability Distributions of Stochastic Context-Free Grammars From the k-Best Derivations
- Robust HMM Estimation with Gaussian Merging-Splitting and Tied-Transform HMMs
- Nonlinear Interpolation of Topic Models for Language Model Adaptation
- Performance Evaluation of Word Phrase and Noun Category Language Models For Broadcast News Speech Recognition
- Robust Automatic Continuous-Speech Recognition Based on a Voiced-Unvoiced Decision
- Double Tree Beam Search Using Hierarchical Subword Units
- Text Segmentation and Topic Tracking on Broadcast News Via a Hidden Markov Model Approach
- Multi-Phone Strings as Subword Units for Speech Recognition
- Phonetic Modification of the Syllable /tu/ in Two Spontaneous American English Dialogues
- Efficient Lattice Representation and Generation
- Modeling Pronunciation Variation for a Dutch CSR: Testing Three Methods
- Comparison of Language Modelling Techniques for Russian and English
- Optimized POS-Based Language Models for Large Vocabulary Speech Recognition
- Reducing Peak Search Effort Using Two-Tier Pruning
- Using Untranscribed Training Data to Improve Performance
- Telephone Band LVCSR for Hearing-Impaired Users
- Using X-Gram For Efficient Speech Recognition
- Speech Coding 3
- A New Linear Predictive Method for Compression of Speech Signals
- Hierarchical Temporal Decomposition: A Novel Approach To Efficient Compression Of Spectral Characteristics Of Speech
- Speech Intelligibility Testing for New Technologies
- Efficient Quantization Of LSF Parameters Based on Temporal Decomposition
- A Sinusoidal Harmonic Vocoder at 1.2 kbps Using Auditory Perceptual Characteristics
- A 16 Kbit/s Wideband CELP Coder Using MEL-Generalized Cepstral Analysis and its Subjective Evaluation
- Comparison Of Spectral Estimation Techniques For Low Bit-Rate Speech Coding
- Low Bit Rate Coding for Speech and Audio Using Mel Linear Predictive Coding (MLPC) Analysis
- Comparison Study on VQ Codevector Index Assignment
- Using Linguistic Knowledge To Improve The Design Of Low-Bit Rate LSF Quantisation
- Transform Coding of LSF Parameters Using Wavelets
- Source Controlled Variable Bit-Rate Speech Coder Based On Waveform Interpolation
- Improving Speaker Recognisability In Phonetic Vocoders
- Language Acquisition 3 / Multilingual Perception and Recognition 2
- Speech Perception and Spoken Language in Children with Impaired Hearing
- Quantitative Assessment of Second Language Learners' Fluency: an Automatic Approach
- Cross-Language Merged Speech Units And Their Descriptive Phonetic Correlates
- Crosslinguistic Disfluency Modelling: A Comparative Analysis of Swedish and American English Human--Human and Human--Machine Dialogues
- Calibration Of Machine Scores For Pronunciation Grading
- Phonetic-Distance-Based Hypothesis Driven Lexical Adaptation For Transcribing Multlingual Broadcast News
- Automatic Pronunciation Error Detection and Guidance for Foreign Language Learning
- Lexical Access for Large-Vocabulary Speech Recognition
- The Effect of Fundamental Frequency on Mandarin Speech Recognition
- The Perception Of Nativeness: Variable Speakers And Flexible Listeners
- Voice Dictation in the Secondary School Classroom
- The Importance of the First Syllable in English Spoken Word Recognition by Adult Japanese Speakers
- Spoken L2 Teaching with Contrastive Visual and Auditory Feedback
- The Role Of Phonological, Morphological, And Orthographic Knowledge In The Intuitive Syllabification Of Dutch Words: A Longitudinal Approach
- The Acquisition of Japanese Compound Accent Rule
- The Acquisition of Putonghua Phonology
- Enhancing Speech Processing of Japanese Learners of English Utilizing Time-Scale Expansion With Constant Pitch
- A Bootstrap Training Approach for Language Model Classifiers
- Voice Onset Time Patterns in 7-, 9- and 11-Year Old Children
- Some Developmental Patterns in the Speech of 6-, 8- and 10-Year Old Children: an Acoustic Phonetic Study
- Language Development After Extreme Childhood Deprivation: A Case Study
- Phonological Elements As A Basis For Language-Independent ASR
- A Phonetic and Acoustic Study of Babbling in an Italian Child
- Rescoring Multiple Pronunciations Generated from Spelled Words
- Segmentation, Labelling and Speech Corpora 3
- A Recursive Algorithm for the Forced Alignment of Very Long Audio Segments
- The Selection of Pronunciation Variants: Comparing the Performance of Man and Machine
- Acoustic Confidence Measures for Segmenting Broadcast News
- A Duration-Based Confidence Measure for Automatic Segmentation of Noise Corrupted Speech
- Segmentation and Classification of Broadcast News Audio
- Speaker Recruitment Methods And Speaker Coverage - Experiences From A Large Multilingual Speech Database Collection
- Text-To-Speech Synthesis 5
- A Phonologically Motivated Method of Selecting Non-Uniform Units
- A Synthesis Method Based on Concatenation of Demisyllables and a Residual Excited Vocal Tract Model
- Exploration of Acoustic Correlates in Speaker Selection for Concatenative Synthesis
- A Perceptual Evaluation of Distance Measures for Concatenative Speech Synthesis
- HMM-Based Smoothing For Concatenative Speech Synthesis
- A Nonlinear Unit Selection Strategy for Concatenative Speech Synthesis Based on Syllable Level Features
- Spoken Language Generation and Translation 2
- A Generic Algorithm for Generating Spoken Monologues
- On the Use of Automatically Generated Discourse-Level Information in a Concept-to-Speech Synthesis System
- Learning Phrase-Based Head Transduction Models for Translation of Spoken Utterances
- Probabilistic Dialogue Act Extraction for Concept Based Multilingual Translation Systems
- Fast Decoding For Statistical Machine Translation
- A Japanese-to-English Speech Translation System: ATR-MATRIX
- Human Speech Perception 4
- Orthografik Inkoncistensy Ephekts in Foneme Detektion?
- The Effect of Orthographic Knowledge on the Segmentation of Speech
- Spotting (Different Types of) Words in (Different Types of) Context
- Correlation Between Consonantal VC Transitions And Degree Of Perceptual Confusion Of Place Contrast In Hindi
- Perception Of Tonal Rises And Falls For Accentuation And Phrasing In Swedish
- Speech Intelligibility Derived From Exceedingly Sparse Spectral Information
- Robust Speech Processing in Adverse Environments 5
- Auditory Modeling Techniques For Robust Pitch Extraction And Noise Reduction
- Wavelet Transform-based Speech Enhancement
- A Practical Perceptual Frequency Autoregressive HMM Enhancement System
- An Effective Quality Evaluation Protocol For Speech Enhancement Algorithms
- An Adaptive Beamforming Microphone Array System Using A Blind Deconvolution
- Speech Enhancement Using Critical Band Spectral Subtraction
- Text-To-Speech Synthesis 6
- How To Handle "Foreign" Sounds in Swedish Text-to-Speech Conversion: Approaching the 'Xenophone' Problem
- Multi-lingual Concatenative Speech Synthesis
- On The Use Of F0 Features In Automatic Segmentation For Speech Synthesis
- A Linguistic and Prosodic Database for Data-Driven Japanese TTS Synthesis
- Text-to-Speech Voice Adaptation from Sparse Training Data
- Describing Intonation with a Parametric Model
- Speech Technology Applications and Human-Machine Interface 2
- Development of CAI System Employing Synthesized Speech Responses
- Using Combined Decisions and Confidence Measures for Name Recognition in Automatic Directory Assistance Systems
- VPQ: A Spoken Language Interface to Large Scale Directory Information
- SCAN - Speech Content Based Audio Navigator: A System Overview
- Controlling a HIFI With a Continuous Speech Understanding System
- User Evaluation Of The Mask Kiosk
- Prosody and Emotion 6
- A Contrastive Study of Lexical Stress Placement in Singapore English and British English
- Integrated Recognition of Words and Phrase Boundaries
- Phrase Accents Revisited: Comparative Evidence From Standard and Cypriot Greek
- Phonetic Invariance and Phonological Stability: Lithuanian Pitch Accents
- A HMM-Based Recognition System for Perceptive Relevant Pitch Movements of Spontaneous German Speech
- Towards a Reversible Symbolic Coding of Intonation
- Hidden Markov Model Techniques 3
- A Statistical Phonemic Segment Model for Speech Recognition Based on Automatic Phonemic Segmentation
- Improved Feature Decorrelation for HMM-based Speech Recognition
- Efficient High-Order Hidden Markov Modelling
- A Time-Synchronous, Tree-based Search Strategy in the Acoustic Fast Match of an Asynchronous Speech Recognition System
- Effective Structural Adaptation of LVCSR Systems to Unseen Domains Using Hierarchical Connectionist Acoustic Models
- Support Vector Machines for Speech Recognition
- Natural Number Recognition Using Discriminatively Trained Inter-Word Context Dependent Hidden Markov Models
- Information Theoretic Approaches to Model Selection
- Continuous Speech Recognition Using Segmental Unit Input HMMs with a Mixture of Probability Density Functions and Context Dependency
- Gaussian Density Tree Structure in a Multi-Gaussian HMM-Based Speech Recognition System
- Generalized Phone Modeling Based on Piecewise Linear Segment Lattice
- A Flexible Method of Creating HMM Using Block-Diagonalization of Covariance Matrices
- HMM Topology Selection For Accurate Acoustic And Duration Modeling
- Context-Dependent Duration Modelling for Continuous Speech Recognition
- Training of Context-Dependent Subspace Distribution Clustering Hidden Markov Model
- Unsupervised Training of HMMs With Variable Number of Mixture Components Per State
- Acoustic Observation Context Modeling in Segment Based Speech Recognition
- Capturing Discriminative Information Using Multiple Modeling Techniques
- Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition
- An Adaptive Gradient-Search Based Algorithm for Discriminative Training of HMM's
- Task Adaptation of Sub-Lexical Unit Models Using the Minimum Confusibility Criterion on Task Independent Databases
- Stochastic Calculus, Non-Linear Filtering, and the Internal Model Principle: Implications for Articulatory Speech Recognition
- The Use of Meta-HMM in Multistream HMM Training for Automatic Speech Recognition
- Enhanced ASR By Acoustic Feature Filtering
- Soft State-Tying for HMM-based Speech Recognition
- Estimation Of Models For Non-Native Speech In Computer-Assisted Language Learning Based On Linear Model Combination
- Duration Modeling Using Cumulative Duration Probability and Speaking Rate Compensation
- Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition
- Speech and Hearing Disorders 2 / Speech Processing for the Speech and Hearing Impaired 1
- SIVHA, Visual Speech Synthesis System
- Using Automatic Speech Recognition and its Possible Effects on the Voice
- The Importance Of F0 or Voice Pitch for Perception of Tonal Language: Simulations With Cochlear Implant Speech Processing Strategies
- Assessing High-Level Language In Individuals With Multiple Sclerosis: A Pilot Study
- Design Of Cochlear Implant Device For Transmitting Voice Pitch Information In Speech Sound Of Asian Languages
- Abnormal Volume-Duration Relationship in Parkinsonian Speech
- Analysis Of Disordered Speech Signal Using Wavelet Transform
- Multi-Channel Pulsation Strategy For Electric Stimulation Of Cochlea
- Synthetic Faces as a Lipreading Support
- Predicting Language Scores From The Speech Perception Scores Of Hearing-Impaired Children
- Content-Independent Duration Model on Categories of Voice and Unvoice Segments
- Dynamical Spectrogram, an Aid for the Deaf
- Evidence of Dual-Route Phonetic Encoding From Apraxia of Speech: Implications for Phonetic Encoding Models
- Speech Communication Profiles Across The Adult Lifespan: Persons Without Self-Identified Hearing Impairment
- Human Speech Production
- Time as a Factor in the Acoustic Variation of Schwa
- On The Structure Of Vowel Space: A Genealogy Of General Phonetic Concepts
- The Relationship Between Intensity and Subglottal Pressure with Controlled Pitch
- Segmentation Of The Airway From The Surrounding Tissues On Magnetic Resonance Images: A Comparative Study
- Recovering Vocal Tract Shapes from MFCC Parameters
- Quantification of Pharyngeal Articulations using Measurements from Laryngoscopic Images
- Variance and Invariance in Speech Rate as a Reflection of Conceptual Planning
- Correspondence Between the Glottal Gesture Overlap Pattern and Vowel Devoicing in Japanese
- Evaluation of Japanese Manners of Generating Word Accent of English Based on a Stressed Syllable Detection Technique
- Independence Of Consonantal Voicing And Vocoid F0 Perturbation In English And Japanese
- Reduction of English Function Words in Switchboard
- Duration Compensation in Non-Adjacent Consonant and Temporal Regularity
- Relationship Between Lip Shapes And Acoustical Characteristics During Speech
- A Model to Represent Propagation and Radiation of Higher-Order Modes for 3-D Vocal-Tract Configuration
- FEM Analysis of Aspirated Air Flow in Three-Dimensional Vocal Tract During Fricative Consonant Phonation
- Trajectory Formation of Articulatory Movements for a Given Sequence of Phonemes
- Contextual Effects on Voicing Profiles of German and Mandarin Consonants
- Reconstructing the Tongue Surface from Six Cross-Sectional Contours: Ultrasound Data
- Articulability of Two Consecutive Morae in Japanese Speech Production: Evidence from Sound Exchange Errors in Spontaneous Speech
- Coarticulation and Degrees of Freedom in the Elaboration of a New Articulatory Plant: GENTIANE
- A Pressure Sensitive Palatography: Application of New Pressure Sensitive Sheet for Measuring Tongue-Palatal Contact Pressure
- Dual-Route Phonetic Encoding: Some Acoustic Evidence
- Fast and Slow Speech Rate: A Characterisation for French
- Segmentation, Labelling and Speech Corpora 4
- A Multilingual Prosodic Database
- The CSLU Speaker Recognition Corpus
- How Effective Is Unsupervised Data Collection For Children's Speech Recognition?
- An Algorithm for Automatic Generation of Mandarin Phonetic Balanced Corpus
- Towards a Formal Framework for Linguistic Annotations
- Forming Generic Models Of Speech For Uniform Database Access
- Speaker and Language Recognition 4
- On The Convergence Of Gaussian Mixture Models: Improvements Through Vector Quantization
- Modeling Dynamic Prosodic Variation for Speaker Verification
- Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics
- Spoken Language Identification Using The SpeechDat Corpus
- Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks
- On the Importance of Components of the Modulation Spectrum for Speaker Verification
- Speech Technology Applications and Human-Machine Interface 3
- Is Speech The Right Thing For Your Application?
- A PC-Based Tool for Helping in Diagnosis of Pathologic Voice
- Web-Based Educational Tools for Speech Technology
- Universal Speech Tools: The CSLU Toolkit
- Creating a Mexican Spanish Version of the CSLU Toolkit
- A Voice User Interface Demonstration System for Mexican Spanish
- Utterance Verification and Word Spotting 2
- Context Dependent Anti Subword Modeling for Utterance Verification
- Combination of Confidence Measures in Isolated Word Recognition
- Confidence Measures for HMM-based Speech Recognition
- Vocabulary-Independent Word Confidence Measure Using Subword Features
- A New Confidence Measure Based on Rank-Ordering Subphone Scores
- Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase Detection and Verification
- Large Vocabulary Continuous Speech Recognition 6
- Sharable Software Repository for Japanese Large Vocabulary Continuous Speech Recognition
- The Design of the Newspaper-Based Japanese Large Vocabulary Continuous Speech Recognition Corpus
- Indexing and Classification of TV News Articles Based on Speech Dictation Using Word Bigram
- Parametric Trajectory Mixtures for LVCSR
- Neural Networks, Fuzzy and Evolutionary Methods 3
- Fuzzy-Integration Based Normalization for Speaker Verification
- Improving The Generalization Performance Of The MCE/GPD Learning
- Acoustic Speech Recognition Model by Neural Net Equation with Competition and Cooperation
- Improved Surname Pronunciations Using Decision Trees
- Speech Processing for the Speech-Impaired and Hearing-Impaired 2
- A Speechreading Aid Based on Phonetic ASR
- Training Speech through Visual Feedback Patterns
- Word Sequence Pair Spotting for Synchronization of Speech and Text in Production of Closed-Caption TV Programs for the Hearing Impaired
- Volume Regulation in Parkinsonian Speech
- Prosody and Emotion 7
- On the Amount and Domain of Focal Lengthening in Swedish
- Differential Lengthening Of Syllabic Constituents In French: The Effect Of Accent Type And Speaking Style
- Prosodic Analysis of Fillers and Self-Repair in Japanese Speech
- A Synthesis-Oriented Model Of Phrasal Pitch Movements In Standard Chinese
- SST Student Day - Poster Session 1
- Non-Linear Probability Estimation Method Used in HMM for Modeling Frame Correlation
- Patterns Of Linguopalatal Contact During Japanese Vowel Devoicing
- Speech Separation Based on the GMM PDF Estimation
- Growth Transform of A Sum of Rational Functions and Its Application in Estimating HMM Parameters
- Two Automatic Approaches for Analyzing Connected Speech Processes in Dutch
- The Use Of Broad Phonetic Class Models In Speaker Recognition
- Analysis and Treatment of Esophageal Speech for the Enhancement of its Comprehension
- High Quality Text-to-Speech System in Spanish for Handicapped People
- Factors Affecting Speech Retrieval
- Perception Of Words With Vowel Reduction
- SST Student Day - Poster Session 2
- Automated Captioning of Television Programs: Development and Analysis of a Soundtrack Corpus
- On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System
- Speech Recognition Using the Probabilistic Neural Network
- A Language Modeling Based on a Hierarchical Approach: M_n^v
- Temporal Variables in Lectures in the Japanese Language
- Building a Statistical Model of the Vowel Space for Phoneticians
- Computer-Mediated Input And The Acquisition Of L2 Vowels
- Speech Analysis By Subspace Methods Of Spectral Line Estimation
- Pausing in Swedish Spontaneous Speech
- Prosody And Voice Quality In The Expression Of Emotions
- Acoustic Analysis of /l/ in Glossectomees
|