INTERSPEECH 2004 - ICSLP, 8th International Conference on Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004. ISCA 2004 Chin-Hui Lee:
From decoding-driven to detection-based paradigms for automatic speech recognition.
In search of a universal phonetic alphabet - theory and application of an organic visible speech-.
From X-ray or MRU data to sounds through articulatory synthesis: towards an integrated view of the speech communication process.
Sreeram Balakrishnan, Karthik Visweswariah, Vaibhava Goel:
Stochastic gradient adaptation of front-end parameters.
Antoine Raux, Rita Singh:
Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions.
Chao Huang, Tao Chen, Eric Chang:
Transformation and combination of hiden Markov models for speaker selection training.
Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Improving eigenspace-based MLLR adaptation by kernel PCA.
Nikos Chatzichrisafis, Vassilios Digalakis, Vassilios Diakoloukas, Costas Harizakis:
Rapid acoustic model development using Gaussian mixture clustering and language adaptation.
Karthik Visweswariah, Ramesh A. Gopinath:
Adaptation of front end parameters in a speech recognizer.
Diego Giuliani, Matteo Gerosa, Fabio Brugnara:
Speaker normalization through constrained MLLR based transforms.
Xiangyu Mu, Shuwu Zhang, Bo Xu:
Multi-layer structure MLLR adaptation algorithm with subspace regression classes and tying.
Georg Stemmer, Stefan Steidl, Christian Hacker, Elmar Nöth:
Adaptation in the pronunciation space for non-native speech recognition.
Xuechuan Wang, Douglas D. O'Shaughnessy:
Robust ASR model adaptation by feature-based statistical data mapping.
Zhaobing Han, Shuwu Zhang, Bo Xu:
A novel target-driven generalized JMAP adaptation algorithm.
Brian Mak, Simon Ka-Lung Ho, James T. Kwok:
Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA.
Hyung Bae Jeon, Dong Kook Kim:
Maximum a posteriori eigenvoice speaker adaptation for Korean connected digit recognition.
Wei Wang, Stephen A. Zahorian:
Vocal tract normalization based on spectral warping.
Koji Tanaka, Fuji Ren, Shingo Kuroiwa, Satoru Tsuge:
Acoustic model adaptation for coded speech using synthetic speech.
Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino:
Speaker adaptation method for CALL system using bilingual speakers' utterances.
Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task.
Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang:
Speaker clustering of speech utterances using a voice characteristic reference space.
Young Kuk Kim, Hwa Jeon Song, Hyung Soon Kim:
Performance improvement of connected digit recognition using unsupervised fast speaker adaptation.
Hyung Soon Kim, Hwa Jeon Song:
Simultaneous estimation of weights of eigenvoices and bias compensation vector for rapid speaker adaptation.
Speaker dependent model order selection of spectral envelopes.
Enrico Bocchieri, Michael Riley, Murat Saraclar:
Methods for task adaptation of acoustic models with limited transcribed in-domain data.
Atsushi Fujii, Tetsuya Ishikawa, Katsunobu Itou, Tomoyosi Akiba:
Unsupervised topic adaptation for lecture speech retrieval.
Haibin Liu, Zhenyang Wu:
Mean and covariance adaptation based on minimum classification error linear regression for continuous density HMMs.
Goshu Nagino, Makoto Shozakai:
Design of ready-made acoustic model library by two-dimensional visualization of acoustic space.
Jean-Luc Gauvain, Abdelkhalek Messaoudi, Holger Schwenk:
Language recognition using phone latices.
ACCDIST: a metric for comparing speakers' accents.
Michael Levit, Allen L. Gorin, Patrick Haffner, Hiyan Alshawi, Elmar Nöth:
Aspects of named entity processing.
Josep Maria Crego, José B. Mariño, Adrià de Gispert:
Finite-state-based and phrase-based statistical machine translation.
Tanja Schultz, Szu-Chen Stan Jou, Stephan Vogel, Shirin Saleem:
Using word latice information for a tighter coupling in speech translation systems.
Teruhisa Misu, Tatsuya Kawahara, Kazunori Komatani:
Confirmation strategy for document retrieval systems with spoken dialog interface.
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:
Multilayer subword units for open-vocabulary spoken document retrieval.
Yoshiaki Itoh, Kazuyo Tanaka, Shi-wook Lee:
An efficient partial matching algorithm toward speech retrieval by speech.
Celestin Sedogbo, Sébastien Herry, Bruno Gas, Jean-Luc Zarader:
Language detection by neural discrimination.
Ricardo de Córdoba, Javier Ferreiros, Valentín Sama, Javier Macías Guarasa, Luis Fernando D'Haro, Fernando Fernández-Martínez:
Language identification techniques based on full recognition in an air traffic control task.
John H. L. Hansen, Umit H. Yapanel, Rongqing Huang, Ayako Ikeno:
Dialect analysis and modeling for automatic classification.
Emmanuel Ferragne, François Pellegrino:
Rhythm in read british English: interdialect variability.
Pascale Fung, Yi Liu, Yongsheng Yang, Yihai Shen, Dekai Wu:
A grammar-based Chinese to English speech translation system for portable devices.
Cost-sensitive call classification.
Mikko Kurimo, Ville T. Turunen, Inger Ekman:
An evaluation of a spoken document retrieval baseline system in finish.
Hui Jiang, Pengfei Liu, Imed Zitouni:
Discriminative training of naive Bayes classifiers for natural language call routing.
Nicolas Moreau, Hyoung-Gook Kim, Thomas Sikora:
Phonetic confusion based document expansion for spoken document retrieval.
Euisok Chung, Soojong Lim, Yi-Gyu Hwang, Myung-Gil Jang:
Hybrid named entity recognition for question-answering system.
Jitendra Ajmera, Iain McCowan, Hervé Bourlard:
An online audio indexing system.
Eric Sanders, Febe de Wet:
Histogram normalisation and the recognition of names and ontology words in the MUMIS project.
Rui Amaral, Isabel Trancoso:
Improving the topic indexation and segmentation modules of a media watch system.
Melissa Barkat-Defradas, Rym Hamdi, Emmanuel Ferragne, François Pellegrino:
Speech timing and rhythmic structure in arabic dialects: a comparison of two approaches.
Hsin-Min Wang, Shih-Sian Cheng:
METRIC-SEQDAC: a hybrid approach for audio segmentation.
Jen-Wei Kuo, Yao-Min Huang, Berlin Chen, Hsin-Min Wang:
Statistical Chinese spoken document retrieval using latent topical information.
Masahiko Matsushita, Hiromitsu Nishizaki, Seiichi Nakagawa, Takehito Utsuro:
Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task.
Ruiqiang Zhang, Gen-ichiro Kikui, Hirofumi Yamamoto, Frank K. Soong, Taro Watanabe, Eiichiro Sumita, Wai Kit Lo:
Improved spoken language translation using n-best speech recognition hypotheses.
Kakeung Wong, Man-Hung Siu:
Automatic language identification using discrete hidden Markov model.
Bowen Zhou, Daniel Déchelotte, Yuqing Gao:
Two-way speech-to-speech translation on handheld devices.
HLT modules scalability within the NESPOLE! project.
Correlation between VOT and F0 in the perception of Korean stops and affricates.
Aude Noiray, Lucie Ménard, Marie-Agnès Cathiard, Christian Abry, Christophe Savariaux:
The development of anticipatory labial coarticulation in French: a pionering study.
Melvyn John Hunt:
Speech recognition, sylabification and statistical phonetics.
Data-driven approaches for automatic detection of syllable boundaries.
Anne Cutler, Dennis Norris, Núria Sebastián-Gallés:
Phonemic repertoire and similarity within the vocabulary.
Sameer Maskey, Alan W. Black, Laura Tomokiya:
Boostrapping phonetic lexicons for new languages.
Mirjam Broersma, K. Marieke Kolkman:
Lexical representation of non-native phonemes.
Jong-Pyo Lee, Tae-Yeoub Jang:
A comparative study on the production of inter-stress intervals of English speech by English native speakers and Korean speakers.
Emi Zuiki Murano, Mihoko Teshigawara:
Articulatory correlates of voice qualities of god guys and bad guys in Japanese anime: an MRI study.
Effects of phonetic contexts on the duration of phonetic segments in fluent read speech.
A study on nasal coda los in continuous speech.
An improved pair-wise variability index for comparing the timing characteristics of speech.
An acoustic study of speech rhythm in taiwan English.
Language specific phonetic rules: evidence from domain-initial strengthening.
Spectral characteristics of the release bursts in Korean alveolar stops.
Rob van Son, Olga Bolotova, Louis C. W. Pols, Mietta Lennes:
Frequency effects on vowel reduction in three typologically different languages (dutch, finish, Russian).
Julia Abresch, Stefan Breuer:
Assessment of non-native phones in anglicisms by German listeners.
Phonology of exceptions for for Korean grapheme-to-phoneme conversion.
Shigeyoshi Kitazawa, Shinya Kiriyama:
Acoustic and prosodic analysis of Japanese vowel-vowel hiatus with laryngeal effect.
A cross-linguistic acoustic comparison of unreleased word-final stops: Korean and Thai.
Taehong Cho, Elizabeth K. Johnson:
Acoustic correlates of phrase-internal lexical boundaries in dutch.
Taehong Cho, James M. McQueen:
Phonotactics vs. phonetic cues in native and non-native listening: dutch and Korean listeners' perception of dutch and English.
Svetlana Kaminskaia, François Poiré:
Comparing intonation of two varieties of French using normalized F0 values.
Mira Oh, Kee-Ho Kim:
Phonetic realization of the suffix-suppressed accentual phrase in Korean.
H. Timothy Bunnell, James B. Polikoff, Jane McNicholas:
Spectral moment vs. bark cepstral analysis of children's word-initial voiceles stops.
Pronunciation assessment based upon the compatibility between a learner's pronunciation structure and the target language's lexical structure.
Spread of high tone in akita Japanese.
Juan Ignacio Godino-Llorente, María Victoria Rodellar Biarge, Pedro Gómez Vilda, Francisco Díaz Pérez, Agustín Álvarez Marquina, Rafael Martínez-Olalla:
Biomechanical parameter fingerprint in the mucosal wave power spectral density.
Cheolwoo Jo, Soo-Geon Wang, Byung-Gon Yang, Hyung-Soon Kim, Tao Li:
Classification of pathological voice including severely noisy cases.
Qiang Fu, Peter Murphy:
A robust glottal source model estimation technique.
Hiroki Mori, Yasunori Kobayashi, Hideki Kasuya, Hajime Hirose, Noriko Kobayashi:
F0 and formant frequency distribution of dysarthric speech - a comparative study.
Hideki Kawahara, Yumi Hirachi, Masanori Morise, Hideki Banno:
Procedure "senza vibrato": a key component for morphing singing.
Claudia Manfredi, Giorgio Peretti, Laura Magnoni, Fabrizio Dori, Ernesto Iadanza:
Thyroplastic medialisation in unilateral vocal fold paralysis: assessing voice quality recovering.
Gernot Kubin, Martin Hagmüller:
Voice enhancement of male speakers with laryngeal neoplasm.
Jong Min Choi, Myung-Whun Sung, Kwang Suk Park, Jeong-Hun Hah:
A comparison of the perturbation analysis between PRAAT and computerize speech lab.
Ji Ming, Baochun Hou:
Evaluation of universal compensation on Aurora 2 and 3 and beyond.
Hugo Van hamme:
PROSPECT features and their application to missing data techniques for robust speech recognition.
Hugo Van hamme, Patrick Wambacq, Veronique Stouten:
Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement.
Hans-Günter Hirsch, Harald Finster:
Applying the Aurora feature extraction schemes to a phoneme based recognition task.
Zhipeng Zhang, Tomoyuki Ohya, Sadaoki Furui:
Evaluation of tree-structured piecewise linear transformation-based noise adaptation on AURORA2 database.
Tor André Myrvoll, Satoshi Nakamura:
Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm.
Akira Sasou, Kazuyo Tanaka, Satoshi Nakamura, Futoshi Asano:
HMM-based feature compensation method: an evaluation using the AURORA2.
Xuechuan Wang, Douglas D. O'Shaughnessy:
Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping.
Benjamin J. Shannon, Kuldip K. Paliwal:
MFCC computation from magnitude spectrum of higher lag autocorrelation coefficients for robust speech recognition.
Muhammad Ghulam, Takashi Fukuda, Junsei Horikawa, Tsuneo Nitta:
A noise-robust feature extraction method based on pitch-synchronous ZCPA for ASR.
José C. Segura, Ángel de la Torre, Javier Ramírez, Antonio J. Rubio, M. Carmen Benítez:
Including uncertainty of speech observations in robust speech recognition.
Takeshi Yamada, Jiro Okada, Nobuhiko Kitawaki:
Integration of n-best recognition results obtained by multiple noise reduction algorithms.
Panji Setiawan, Sorel Stan, Tim Fingscheidt:
Revisiting some model-based and data-driven denoising algorithms in Aurora 2 context.
Guo-Hong Ding, Bo Xu:
Exploring high-performance speech recognition in noisy environments using high-order taylor series expansion.
Wing-Hei Au, Man-Hung Siu:
A robust training algorithm based on neighborhood information.
Siu Wa Lee, Pak-Chung Ching:
In-phase feature induction: an effective compensation technique for robust speech recognition.
Jeff Siu-Kei Au-Yeung, Man-Hung Siu:
Improved performance of Aurora 4 using HTK and unsupervised MLLR adaptation.
Shang-nien Tsai, Lin-Shan Lee:
A new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering.
Christian Fügen, Hartwig Holzapfel, Alex Waibel:
Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition.
Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano:
Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs.
Hironori Oshikawa, Norihide Kitaoka, Seiichi Nakagawa:
Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary.
Imed Zitouni, Minkyu Lee, Hui Jiang:
Constrained minimization technique for topic identification using discriminative training and support vector machines.
Jason D. Williams, Steve J. Young:
Characterizing task-oriented dialog using a simulated ASR chanel.
Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
A spoken dialog system based on automatic grammar generation and template-based weighting for autonomous mobile robots.
Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino:
Noise adaptive spoken dialog system based on selection of multiple dialog strategies.
Mikko Hartikainen, Markku Turunen, Jaakko Hakulinen, Esa-Pekka Salonen, J. Adam Funk:
Flexible dialogue management using distributed and dynamic dialogue control.
Contextual revision in information seeking conversation systems.
Ian M. O'Neill, Philip Hanna, Xingkun Liu, Michael F. McTear:
Cross domain dialogue modelling: an object-based approach.
Hirohiko Sagawa, Teruko Mitamura, Eric Nyberg:
A comparison of confirmation styles for error handling in a speech dialog system.
Fan Yang, Peter A. Heeman:
Using computer simulation to compare two models of mixed-initiative.
Fan Yang, Peter A. Heeman, Kristy Hollingshead:
Towards understanding mixed-initiative in task-oriented dialogues.
Peter Wolf, Joseph Woelfel, Jan C. van Gemert, Bhiksha Raj, David Wong:
Spokenquery: an alternate approach to chosing items with speech.
Shona Douglas, Deepak Agarwal, Tirso Alonso, Robert M. Bell, Mazin G. Rahim, Deborah F. Swayne, Chris Volinsky:
Mining customer care dialogs for "daily news".
Jens Edlund, Gabriel Skantze, Rolf Carlson:
Higgins - a spoken dialogue system for investigating error handling techniques.
Fuliang Weng, Lawrence Cavedon, Badri Raghunathan, Danilo Mirkovic, Hua Cheng, Hauke Schmidt, Harry Bratt, Rohit Mishra, Stanley Peters, Sandra Upson, Elizabeth Shriberg, Carsten Bergmann, Lin Zhao:
A conversational dialogue system for cognitively overloaded users.
Gerhard Hanrieder, Stefan W. Hamerich:
Modeling generic dialog applications for embedded systems.
Matthew N. Stuttle, Jason D. Williams, Steve J. Young:
A framework for dialogue data collection with a simulated ASR channel.
A multi-layer conversation management approach for information seeking applications.
Thomas K. Harris, Roni Rosenfeld:
A universal speech interface for appliances.