- Rene Grzeszick, Axel Plinge, Gernot A. Fink:
Bag-of-Features Methods for Acoustic Event Detection and Classification. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1242-1252 (2017) - Sina Hafezi, Alastair H. Moore, Patrick A. Naylor:
Augmented Intensity Vectors for Direction of Arrival Estimation in the Spherical Harmonic Domain. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1956-1968 (2017) - Brian Hamilton, Stefan Bilbao:
FDTD Methods for 3-D Room Acoustics Simulation With High-Order Accuracy in Space and Time. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2112-2124 (2017) - Yoonchang Han, Jae-Hun Kim, Kyogu Lee:
Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 208-221 (2017) - Mark A. Hasegawa-Johnson, Preethi Jyothi, Daniel McCloy, Majid Mirbagheri, Giovanni M. Di Liberto, Amit Das, Bradley Ekin, Chunxi Liu, Vimal Manohar, Hao Tang, Edmund C. Lalor, Nancy F. Chen, Paul Hager, Tyler Kekona, Rose Sloan, Adrian K. C. Lee:
ASR for Under-Resourced Languages From Probabilistic Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 46-59 (2017) - Yuta Hatano, Chuang Shi, Yoshinobu Kajikawa:
Compensation for Nonlinear Distortion of the Frequency Modulation-Based Parametric Array Loudspeaker. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1709-1717 (2017) - Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
Duration-Controlled LSTM for Polyphonic Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2059-2070 (2017) - Qi He, Feng Bao, Changchun Bao:
Multiplicative Update of Auto-Regressive Gains for Codebook-Based Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 457-468 (2017) - Takuya Higuchi, Nobutaka Ito, Shoko Araki, Takuya Yoshioka, Marc Delcroix, Tomohiro Nakatani:
Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 780-793 (2017) - Gongping Huang, Jacob Benesty, Jingdong Chen:
On the Design of Frequency-Invariant Beampatterns With Uniform Circular Microphone Arrays. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 1140-1153 (2017) - Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee:
Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 60-71 (2017) - Yi-Chin Huang, Chung-Hsien Wu, Yan-You Chen, Ming-Ge Shie, Jhing-Fa Wang:
Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 1048-1060 (2017) - Qinghua Huang, Lin Zhang, Yong Fang:
Two-Stage Decoupled DOA Estimation Based on Real Spherical Harmonics for Spherical Arrays. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2045-2058 (2017) - Keisuke Imoto, Nobutaka Ono:
Spatial Cepstrum as a Spatial Feature Using a Distributed Microphone Array for Acoustic Scene Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1335-1343 (2017) - Abu Shafin Mohammad Mahdee Jameel, Shaikh Anowarul Fattah, Rajib Goswami, Wei-Ping Zhu, M. Omair Ahmad:
Noise Robust Formant Frequency Estimation Method Based on Spectral Model of Repeated Autocorrelation of Speech. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1357-1370 (2017) - Matthias Janke, Lorenz Diener:
EMG-to-Speech: Direct Generation of Speech From Facial Electromyographic Signals. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2375-2385 (2017) - Killian Janod, Mohamed Morchid, Richard Dufour, Georges Linarès, Renato De Mori:
Denoised Bottleneck Features From Deep Autoencoders for Telephone Conversation Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 25(9): 1809-1820 (2017) - Byeongho Jo, Jung-Woo Choi:
Spherical Harmonic Smoothing for Localizing Coherent Sound Sources. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1969-1984 (2017) - Emma Jokinen, Ulpu Remes, Paavo Alku:
Intelligibility Enhancement of Telephone Speech Using Gaussian Process Regression for Normal-to-Lombard Spectral Tilt Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1985-1996 (2017) - Naoyuki Kanda, Xugang Lu, Hisashi Kawai:
Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 1023-1034 (2017) - Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland:
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 818-828 (2017) - Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen:
Automatic Sentiment Detection in Naturalistic Audio. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1668-1679 (2017) - Seyran Khademi, Richard C. Hendriks, W. Bastiaan Kleijn:
Intelligibility Enhancement Based on Mutual Information. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1694-1708 (2017) - Hanieh Khalilian, Ivan V. Bajic, Rodney G. Vaughan:
A Simulation Study of a Three-Dimensional Sound Field Reproduction System for Immersive Communication. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 980-995 (2017) - Myung Jong Kim, Beiming Cao, Ted Mau, Jun Wang:
Speaker-Independent Silent Speech Recognition From Flesh-Point Articulatory Movements Using an LSTM Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2323-2336 (2017) - Jung-Hee Kim, Jin Kim, Jae Hyeon Jeon, Sang Won Nam:
Delayless Individual-Weighting-Factors Sign Subband Adaptive Filter With Band-Dependent Variable Step-Sizes. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1526-1534 (2017) - Ina Kodrasi, Simon Doclo:
Signal-Dependent Penalty Functions for Robust Acoustic Multi-Channel Equalization. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1512-1525 (2017) - Yuma Koizumi, Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi, Hitoshi Ohmuro:
Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 768-779 (2017) - Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 149-163 (2017) - Morten Kolbaek, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1901-1913 (2017)