


default search action
Speech Communication, Volume 55
Volume 55, Number 1, January 2013
- Matthew P. Black, Athanasios Katsamanis

, Brian R. Baucom
, Chi-Chun Lee
, Adam C. Lammert, Andrew Christensen, Panayiotis G. Georgiou
, Shrikanth S. Narayanan:
Toward automating a human behavioral coding system for married couples' interactions using speech acoustic features. 1-21 - Robin Hofe, Stephen R. Ell, Michael J. Fagan, James M. Gilbert

, Phil D. Green
, Roger K. Moore
, Sergey I. Rybchenko:
Small-vocabulary speech recognition using a silent speech interface based on magnetic sensing. 22-32 - Paul Boersma, Katerina Chládková

:
Detecting categorical perception in continuous discrimination data. 33-39 - Wei Qiu, Bing-Zhao Li

, Xue-Wen Li:
Speech recovery based on the linear canonical transform. 40-50 - Harish Arsikere, Gary K. F. Leung, Steven M. Lulich, Abeer Alwan:

Automatic estimation of the first three subglottal resonances from adults' speech signals with application to speaker height estimation. 51-70 - Yue Ming, Qiuqi Ruan, Guodong Gao:

A Mandarin edutainment system integrated virtual learning environments. 71-83 - Fangju Wang, Kyle Swegles:

Modeling user behavior online for disambiguating user input in a spoken dialogue system. 84-98 - Alexander Sepúlveda

, Rodrigo Capobianco Guido
, Germán Castellanos-Domínguez
:
Estimation of relevant time-frequency features using Kendall coefficient for articulator position inference. 99-110 - Can Yagli, M. A. Tugtekin Turan

, Engin Erzin
:
Artificial bandwidth extension of spectral envelope along a Viterbi path. 111-118 - Xing Fan, John H. L. Hansen:

Acoustic analysis and feature transformation from neutral to whisper for speaker identification within whispered speech audio streams. 119-134 - Guillaume Gibert, Yvonne Leung

, Catherine J. Stevens
:
Control of speech-related facial movements of an avatar from video. 135-146 - Adam C. Lammert, Louis Goldstein, Shrikanth S. Narayanan, Khalil Iskarous:

Statistical methods for estimation of direct and differential kinematics of the vocal tract. 147-161 - Anoop Deoras, Tomás Mikolov, Stefan Kombrink, Kenneth Church

:
Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model. 162-177 - Roman Cmejla

, Jan Rusz
, Petr Bergl, Jan Vokral
:
Bayesian changepoint detection for the automatic assessment of fluency and articulatory disorders. 178-189 - Xiang Zuo, Taisuke Sumii, Naoto Iwahashi, Mikio Nakano

, Kotaro Funakoshi, Natsuki Oka:
Correcting phoneme recognition errors in learning word pronunciation through speech interaction. 190-203
Volume 55, Number 2, February 2013
- Hadi Veisi

, Hossein Sameti:
Speech enhancement using hidden Markov models in Mel-frequency domain. 205-220 - Daniel Bolaños, Ronald A. Cole, Wayne H. Ward, Gerald A. Tindal, Paula J. Schwanenflugel, Melanie R. Kuhn:

Automatic assessment of expressive oral reading. 221-236 - Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:

Multitaper MFCC and PLP features for speaker verification using i-vectors. 237-251 - Martin Wöllmer, Björn W. Schuller

, Gerhard Rigoll:
Keyword spotting exploiting Long Short-Term Memory. 252-265 - Cheng-Yu Yeh, Shun-Chieh Chang, Shaw-Hwa Hwang:

A consistency analysis on an acoustic module for Mandarin text-to-speech. 266-277 - Gilles Degottex

, Pierre Lanchantin, Axel Röbel, Xavier Rodet:
Mixed source model and its adapted vocal tract filter estimate for voice transformation and synthesis. 278-294 - John Kane, Christer Gobl

:
Evaluation of glottal closure instant detection in a range of voice qualities. 295-314 - Christopher Dromey, Gwi-Ok Jang, Kristi Hollis:

Assessing correlations between lingual movements and formants. 315-328 - Tomoharu Iwata, Shinji Watanabe

:
Influence relation estimation based on lexical entrainment in conversation. 329-339 - Yongwon Jeong:

Unified framework for basis-based speaker adaptation based on sample covariance matrix of variable dimension. 340-346 - Takashi Nose

, Takao Kobayashi:
An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model. 347-357 - Pei Chee Yong, Sven Nordholm

, Hai Huyen Dam:
Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement. 358-376 - Tasuku Oonishi, Koji Iwano

, Sadaoki Furui:
A noise-robust speech recognition approach incorporating normalized speech/non-speech likelihood into hypothesis scores. 377-386
Volume 55, Number 3, March 2013
- Peng Dai

, Ing Yann Soon:
An improved model of masking effects for robust speech recognition system. 387-396 - John Kane, Christer Gobl

:
Automating manual user strategies for precise voice source analysis. 397-414 - Seong-Jun Hahm, Shinji Watanabe

, Atsunori Ogawa, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura:
Prior-shared feature and model space speaker adaptation by consistently employing map estimation. 415-431 - Tao Xu, Wenwu Wang, Wei Dai:

Sparse coding with adaptive dictionary learning for underdetermined blind speech separation. 432-450 - Daniel Neiberg, Giampiero Salvi

, Joakim Gustafson:
Semi-supervised methods for exploring the acoustics of simple productive feedback. 451-469 - Seiichi Nakagawa, Keisuke Iwami, Yasuhisa Fujii, Kazumasa Yamamoto

:
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric. 470-485 - Stephen Winters, Mary Grantham O'Brien:

Perceived accentedness and intelligibility: The relative contributions of F0 and duration. 486-507
Volume 55, Number 4, May 2013
- Yi Zhang, Yunxin Zhao:

Real and imaginary modulation spectral subtraction for speech enhancement. 509-522 - Taher S. Mirzahasanloo

, Nasser Kehtarnavaz, Vanishree Gopalakrishna, Philipos C. Loizou:
Environment-adaptive speech enhancement for bilateral cochlear implants using a single processor. 523-534 - Hai Huyen Dam, Dedi Rimantho, Sven Nordholm

:
Second-order blind signal separation with optimal step size. 535-543 - Kyoung Won Nam, Yoon Sang Ji, Jonghee Han, Sangmin Lee, Dongwook Kim, Sung Hwa Hong, Dong Pyo Jang, In-Young Kim:

Clinical evaluation of the performance of a blind source separation algorithm combining beamforming and independent component analysis in hearing aid use. 544-552 - Chloe Gonseth, Anne Vilain, Coriandre Vilain:

An experimental study of speech/gesture interactions and distance encoding. 553-571 - Martin Cooke, Catherine Mayo, Cassia Valentini-Botinhao, Yannis Stylianou, Bastian Sauert, Yan Tang

:
Evaluating the intelligibility benefit of speech modifications in known noise conditions. 572-585
Volume 55, Number 5, June 2013
- Hilman Ferdinandus Pardede

, Koji Iwano
, Koichi Shinoda
:
Feature normalization based on non-extensive statistics for speech recognition. 587-599 - Karen Lander

, Cheryl M. Capek
:
Investigating the impact of lip visibility and talking style on speechreading performance. 600-605 - Ranniery Maia, Masami Akamine, Mark J. F. Gales:

Complex cepstrum for statistical parametric speech synthesis. 606-618 - Bingyin Xia, Changchun Bao:

Compressed domain speech enhancement method based on ITU-T G.722.2. 619-640 - Khaled Daqrouq

, Khalooq Y. Al Azzawi:
Arabic vowels recognition based on wavelet average framing linear prediction coding and neural network. 641-652 - Mahnoosh Mehrabani, John H. L. Hansen:

Singing speaker clustering based on subspace learning in the GMM mean supervector space. 653-666 - Byron D. Erath

, Matías Zanartu
, Kelley C. Stewart, Michael W. Plesniak
, David E. Sommer, Sean D. Peterson
:
A review of lumped-element models of voiced speech. 667-690 - Christos Koniaris, Giampiero Salvi

, Olov Engwall
:
On mispronunciation analysis of individual foreign speakers using auditory periphery models. 691-706 - Jia Min Karen Kua, Julien Epps

, Eliathamby Ambikairajah
:
i-Vector with sparse representation classification for speaker verification. 707-720 - Emina Kurtic, Guy J. Brown

, Bill Wells:
Resources for turn competition in overlapping talk. 721-743
Volume 55, Number 6, July 2013
- K. Sreenivasa Rao, Anil Kumar Vuppala:

Non-uniform time scale modification using instants of significant excitation and vowel onset points. 745-756 - Siow Yong Low, Duc-Son Pham

, Svetha Venkatesh
:
Compressive speech enhancement. 757-768 - John H. L. Hansen, Jun-Won Suh, Matthew R. Leonard:

In-set/out-of-set speaker recognition in sustained acoustic scenarios using sparse data. 769-781 - Bayya Yegnanarayana, Dhananjaya N. Gowda:

Spectro-temporal analysis of speech signals using zero-time windowing and group delay function. 782-795 - Cuiling Zhang, Geoffrey Stewart Morrison

, Ewald Enzinger
, Felipe Ochoa:
Effects of telephone transmission on the performance of formant-trajectory-based forensic voice comparison - Female voices. 796-813
Volume 55, Numbers 7-8, September 2013
- João Felipe Santos

, Stefano Cosentino, Oldooz Hazrati, Philipos C. Loizou, Tiago H. Falk
:
Objective speech intelligibility measurement for cochlear implant users in complex listening environments. 815-824 - Syaheerah Lebai Lutfi

, Fernando Fernández Martínez
, Juan Manuel Lucas-Cuesta, Lorena Lopez-Lebon, Juan Manuel Montero
:
A satisfaction-based model for affect recognition from conversational features in spoken dialog systems. 825-840 - Lee Ngee Tan, Abeer Alwan:

Multi-band summary correlogram-based pitch detection for noisy speech. 841-856 - Wesley Mattheyses, Lukas Latacz, Werner Verhelst:

Comprehensive many-to-many phoneme-to-viseme mapping and its application for concatenative visual speech synthesis. 857-876 - Wesley Mattheyses, Lukas Latacz, Werner Verhelst:

Erratum to "Comprehensive many-to-many phoneme-to-viseme mapping and its application for concatenative visual speech synthesis" [Speech Communication 55/7-8 (2013) 857-876]. 877
Volume 55, Number 9, October 2013
- Ben Milner:

Enhancing speech at very low signal-to-noise ratios using non-acoustic reference signals. 879-892 - Xueru Zhang, Kris Demuynck, Hugo Van hamme

:
Rapid speaker adaptation in latent speaker space with non-negative matrix factorization. 893-908 - Heikki Rasilo, Okko Räsänen

, Unto K. Laine:
Feedback and imitation by a caregiver guides a virtual infant to learn native phonemes and the skill of speech inversion. 909-931 - Xulei Bao, Jie Zhu:

An Improved Method for Late-Reverberant Suppression Based on Statistical Model. 932-940
Volume 55, Number 10, November - December 2013
- Ryo Yokoyama

, Yu Nasu, Koji Iwano
, Koichi Shinoda
:
Detection of overlapped speech using lapel microphones in meeting. 941-949 - Wen-Lin Zhang, Dan Qu, Wei-Qiang Zhang

, Bi-Cheng Li:
Rapid speaker adaptation using compressive sensing. 950-963 - Rongshan Yu:

Speech enhancement based on soft audible noise masking and noise power estimation. 964-974 - Mohamed Djendi, Pascal Scalart, André Gilloire:

Analysis of two-sensors forward BSS structure with post-filters in the presence of coherent and incoherent noise. 975-987 - Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker:

Classifying the socio-situational settings of transcripts of spoken discourses. 988-1002 - Sunhyun Yook, Kyoung Won Nam, Heepyung Kim, See Youn Kwon, Dongwook Kim, Sangmin Lee, Sung Hwa Hong, Dong Pyo Jang, In-Young Kim:

Modified segmental signal-to-noise ratio reflecting spectral masking effect for evaluating the performance of hearing aid algorithms. 1003-1010 - Fei Chen

, Lena L. N. Wong
, Yi Hu:
A Hilbert-fine-structure-derived physical metric for predicting the intelligibility of noise-distorted and noise-suppressed speech. 1011-1020 - Edward Ozimek, Jedrzej Kocinski

, Dariusz Kutzner, Aleksander Sek, Andrzej Wicher
:
Speech intelligibility for different spatial configurations of target speech and competing noise source in a horizontal and median plane. 1021-1032 - Petr Cerva

, Jan Silovský, Jindrich Zdánský, Jan Nouza, Ladislav Seps:
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives. 1033-1046 - Sibiri Tiémounou, Régine Le Bouquin-Jeannès

, Vincent Barriac:
On the identification of relevant degradation indicators in super wideband listening quality assessment models. 1047-1063 - Greg Short, Keikichi Hirose, Nobuaki Minematsu:

Japanese lexical accent recognition for a CALL system by deriving classification equations with perceptual experiments. 1064-1080 - Yi Zhang, Yunxin Zhao:

Modulation domain blind speech separation in noisy environments. 1081-1099

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














