default search action
28th EUSIPCO 2020: Amsterdam, Netherlands
- 28th European Signal Processing Conference, EUSIPCO 2020, Amsterdam, Netherlands, January 18-21, 2021. IEEE 2020, ISBN 978-9-0827-9705-3
- Mohammad MohammadAmini, Driss Matrouf:
Data augmentation versus noise compensation for x-vector speaker recognition systems in noisy environments. 1-5 - Ziqiang Shi, Liu Liu, Rujie Liu:
Hodge and Podge: Hybrid Supervised Sound Event Detection with Multi-Hot MixMatch and Composition Consistence Training. 1-5 - Dan Oneata, Lucian Georgescu, Horia Cucu, Dragos Burileanu, Corneliu Burileanu:
Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition. 1-5 - Kuldeep Khoria, Madhu R. Kamble, Hemant A. Patil:
Teager Energy Cepstral Coefficients for Classification of Normal vs. Whisper Speech. 1-5 - Changzeng Fu, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro:
An End-to-end Multitask Learning Model to Improve Speech Emotion Recognition. 1-5 - Gagan Rath, Fabien Racapé, Fabrice Urban, Fabrice Le Léannec, Franck Galpin, Karam Naser:
A general framework for directional intra prediction with varying angle for video coding. 1-4 - Mickael Rouvier, Richard Dufour, Pierre-Michel Bousquet:
Review of different robust x-vector extractors for speaker verification. 1-5 - Duowei Tang, Peter Kuppens, Luc Geurts, Toon van Waterschoot:
Adieu recurrence? End-to-end speech emotion recognition using a context stacking dilated convolutional network. 1-5 - Mattes Ohlenbusch, Aike Ahrens, Christian Rollwage, Jörg Bitzer:
Robust Drone Detection for Acoustic Monitoring Applications. 6-10 - Alessandro Ilic Mezza, Emanuël Anco Peter Habets, Meinard Müller, Augusto Sarti:
Unsupervised Domain Adaptation for Acoustic Scene Classification Using Band-Wise Statistics Matching. 11-15 - Karim Guirguis, Christoph Schorn, Andre Guntoro, Sherif Abdulatif, Bin Yang:
SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional Networks. 16-20 - Dhanunjaya Varma Devalraju, Padmanabhan Rajan, Aroor Dinesh Dileep:
Learning to Separate: Soundscape Classification using Foreground and Background. 21-25 - Federico Colangelo, Federica Battisti, Alessandro Neri:
Progressive Training Of Convolutional Neural Networks For Acoustic Events Classification. 26-30 - Daniel Krause, Archontis Politis, Konrad Kowalczyk:
Feature Overview for Joint Modeling of Sound Event Detection and Localization Using a Microphone Array. 31-35 - Saori Takeyama, Tatsuya Komatsu, Koichi Miyazaki, Masahito Togami, Shunsuke Ono:
Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation. 36-40 - Tatsuya Komatsu, Masahito Togami, Tsubasa Takahashi:
Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units. 41-45 - Pietro Stinco, Giovanni De Magistris, Alessandra Tesei, Kevin D. LePage:
Automatic Object Classification with Active Sonar using Unsupervised Anomaly Detection. 46-50 - Yuancheng Luo, Wontak Kim:
Fast Source-Room-Receiver Acoustics Modeling. 51-55 - Jaroslav Cmejla, Tomás Kounovský, Sharon Gannot, Zbynek Koldovský, Pinchas Tandeitnik:
MIRaGe: Multichannel Database of Room Impulse Responses Measured on High-Resolution Cube-Shaped Grid. 56-60 - Vincenzo Zaccà, Pablo Martínez-Nuevo, Martin Bo Møller, Jorge Martínez, Richard Heusdens:
Inferring the location of reflecting surfaces exploiting loudspeaker directivity. 61-65 - Luca Villa, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti:
A Methodology for the Estimation of Propagation Speed of Longitudinal Waves in Tone Wood. 66-70 - Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
High-Resolution Speaker Counting in Reverberant Rooms Using CRNN with Ambisonics Features. 71-75 - Suradej Duangpummet, Phrimphissa Kraikhun, Chatrin Phunruangsakao, Jessada Karnjana, Masashi Unoki, Waree Kongprawechnon:
Speech Privacy Protection based on Optimal Controlling Estimated Speech Transmission Index in Noisy Reverberant Environments. 76-80 - David S. Johnson, Sascha Grollmisch:
Techniques Improving the Robustness of Deep Learning Models for Industrial Sound Analysis. 81-85 - Hwiyong Choi, Haesang Yang, Seungjun Lee, Woojae Seong:
Type/position classification of inter-floor noise in residential buildings with a single microphone via supervised learning. 86-90 - Ayush Triapthi, Rupayan Chakraborty, Sunil Kumar Kopparapu:
Dementia Classification using Acoustic Descriptors Derived from Subsampled Signals. 91-95 - Albertus C. den Brinker, M. Coman, Okke Ouweltjes, Michael Crooks, Susannah Thackray-Nocera, Alyn H. Morice:
Performance Requirements for Cough Classifiers in Real-World Applications. 96-100 - Mirali Purohit, Mihir Parmar, Maitreya Patel, Harshit Malaviya, Hemant A. Patil:
Weak Speech Supervision: A case study of Dysarthria Severity Classification. 101-105 - William Vickers, B. Milner, Artjoms Gorpincenko, R. Lee:
Methods to Improve the Robustness of Right Whale Detection using CNNs in Changing Conditions. 106-110 - Nikonas Simou, Nikolaos Stefanakis, Panagiotis Zervas:
A Universal System for Cough Detection in Domestic Acoustic Environments. 111-115 - Amlu Anna Joshy, Rajeev Rajan:
Automated Dysarthria Severity Classification Using Deep Learning Frameworks. 116-120 - Kuan-Lin Chen, Ching Hua Lee, Bhaskar D. Rao, Harinath Garudadri:
Jointly Leveraging Decorrelation and Sparsity for Improved Feedback Cancellation in Hearing Aids. 121-125 - Peter Steiner, Simon Stone, Peter Birkholz, Azarakhsh Jalalvand:
Multipitch tracking in music signals using Echo State Networks. 126-130 - Andres Ferraro, Dmitry Bogdanov, Xavier Serra, Jay Ho Jeon, Jason Yoon:
How Low Can You Go? Reducing Frequency and Time Resolution in Current CNN Architectures for Music Auto-tagging. 131-135 - Juan Sebastián Gómez Cañón, Estefanía Cano, Perfecto Herrera, Emilia Gómez:
Transfer learning from speech to music: towards language-sensitive emotion recognition models. 136-140 - Ruchit Agrawal, Simon Dixon:
Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. 141-145 - Lech Kolonko, Jörg Velten, Anton Kummert:
Automatic Differentiating Wave Digital Filters with Multiple Nonlinearities. 146-150 - Alessandro Proverbio, Alberto Bernardini, Augusto Sarti:
Toward the Wave Digital Real-Time Emulation of Audio Circuits with Multiple Nonlinearities. 151-155 - Agelos Kratimenos, Kleanthis Avramidis, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos:
Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music. 156-160 - Javier Nistal, Stefan Lattner, Gaël Richard:
Comparing Representations for Audio Synthesis Using Generative Adversarial Networks. 161-165 - Xianghui Xie, Jared Houghtaling, Katrien Foubert, Toon van Waterschoot:
Computational Approach to Track Beats in Improvisational Music Performance. 166-170 - Mina Mounir, Peter Karsmakers, Toon van Waterschoot:
CNN-based Note Onset Detection using Synthetic Data Augmentation. 171-175 - Sofia-Eirini Kotti, Richard Heusdens, Richard C. Hendriks:
Clock-Offset and Microphone Gain Mismatch Invariant Beamforming. 176-180 - Amos Schreibman, Anna Barnov, Alex Gendelman, Eli Tzirkel:
RTF Based LCMV Beamformer with Multiple Reference Microphones. 181-185 - Federico Borra, Mirco Pezzoli, Luca Comanducci, Alberto Bernardini, Fabio Antonacci, Stefano Tubaro, Augusto Sarti:
A Fast Ray Space Transform for Wave Field Processing using Acoustic Arrays. 186-190 - Thomas Dietzen, Marc Moonen, Toon van Waterschoot:
Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components. 191-195 - Tobias Gburrek, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann, Reinhold Haeb-Umbach:
Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Networks. 196-200 - Patrick Meyer, Samy Elshamy, Jan Franzen, Tim Fingscheidt:
Multichannel Acoustic Echo Cancellation Applied to Microphone Leakage Reduction in Meetings. 201-205 - Santiago Ruiz, Toon van Waterschoot, Marc Moonen:
Distributed combined acoustic echo cancellation and noise reduction using GEVD-based distributed adaptive node specific signal estimation with prior knowledge. 206-210 - Jacob Benesty, Constantin Paleologu, Claudia Cristina Oprea, Silviu Ciochina:
An Iterative Multichannel Wiener Filter Based on a Kronecker Product Decomposition. 211-215 - Amélie Bosca, Alexandre Guérin, Lauréline Perotin, Srdan Kitic:
Dilated U-net based approach for multichannel speech enhancement from First-Order Ambisonics recordings. 216-220 - Vincent W. Neo, Christine Evers, Patrick A. Naylor:
Speech Dereverberation Performance of a Polynomial-EVD Subspace Approach. 221-225 - Juan Manuel Vera-Diaz, Daniel Pizarro, Javier Macías Guarasa:
Towards Domain Independence in CNN-based Acoustic Localization using Deep Cross Correlations. 226-230 - Christopher Schymura, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa:
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization. 231-235 - Maria Juhlin, Andreas Jakobsson:
Optimal Microphone Placement for Localizing Tonal Sound Sources. 236-240 - Milan Courcoux-Caro, Charles Vanwynsberghe, Cédric Herzet, Alexandre Baussard:
Sequential Sensor Placement using Bayesian Compressed Sensing for Source Localization. 241-245 - Yonggang Hu, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Sharon Gannot:
Decoupled Direction-of-Arrival Estimations Using Relative Harmonic Coefficients. 246-250 - Hui Chen, Tarig Ballal, Tareq Y. Al-Naffouri:
Phase-difference-based 3-D Source Localization Using a Compact Receiver Configuration. 251-255 - Frank Sanabria-Macias, Marta Marrón Romera, Javier Macías Guarasa:
3D Audiovisual Speaker Tracking with Distributed Sensors Configuration. 256-260 - Emad M. Grais, Fei Zhao, Mark D. Plumbley:
Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation. 261-265 - Masahito Togami:
Deep Multi-channel Speech Source Separation with Time-frequency Masking for Spatially Filtered Microphone Input Signal. 266-270 - Yuyang Huang, Ping Chu, Bin Liao:
Blind Separation of Convolutive Speech Mixtures Based on Local Sparsity and K-means. 271-275 - Yaron Laufer, Sharon Gannot:
A Bayesian Hierarchical Model for Blind Audio Source Separation. 276-280 - Michel Olvera, Emmanuel Vincent, Romain Serizel, Gilles Gasso:
Foreground-Background Ambient Sound Scene Separation. 281-285 - Mariem Bouafif Mansali, Tom Bäckström, Zied Lachiri:
Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals. 286-290 - Dexin Li, Gongping Huang, Yanqiang Lei, Jingdong Chen, Jacob Benesty:
Robust Source Separation with Differential Microphone Arrays and Independent Low-Rank Matrix Analysis. 291-295 - Shoichiro Takeda, Kenta Niwa, Shinya Shimizu:
Differentiable Max-Directivity Beamforming Normalization for Independent Vector Analysis. 296-300 - Taishi Nakashima, Robin Scheibler, Yukoh Wakabayashi, Nobutaka Ono:
Faster independent low-rank matrix analysis with pairwise updates of demixing vectors. 301-305 - Kazuyoshi Yoshii, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine, Aditya Arie Nugraha:
Fast Multichannel Correlated Tensor Factorization for Blind Source Separation. 306-310 - Yosuke Higuchi, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa:
Noise-robust Attention Learning for End-to-End Speech Recognition. 311-315 - Shi-Yan Weng, Tien-Hong Lo, Berlin Chen:
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features. 316-320 - Cong-Thanh Do, Shucong Zhang, Thomas Hain:
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness. 321-325 - Katerina Papadimitriou, Gerasimos Potamianos:
A Fully Convolutional Sequence Learning Approach for Cued Speech Recognition from Videos. 326-330 - Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. 331-335 - Gonzalo D. Sad, Juan Carlos Gómez:
Audio-Visual Speech Classification based on Absent Class Detection. 336-340 - Wentao Yu, Steffen Zeiler, Dorothea Kolossa:
Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition. 341-345 - Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr:
Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition. 346-350 - Simon W. McKnight, Aidan O. T. Hogg, Patrick A. Naylor:
Analysis of Phonetic Dependence of Segmentation Errors in Speaker Diarization. 381-385 - Gauri P. Prajapati, Madhu R. Kamble, Hemant A. Patil:
Energy Separation Based Features for Replay Spoof Detection for Voice Assistant. 386-390 - Roee Levy Leshem, Raja Giryes:
Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data. 391-395 - Kazuhiro Kobayashi, Tomoki Toda:
Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN. 396-400 - Frederik Bous, Luc Ardaillon, Axel Roebel:
Semi-supervised learning of glottal pulse positions in a neural analysis-synthesis framework. 401-405 - Rafael Ferro, Nicolas Obin, Axel Roebel:
CycleGAN Voice Conversion of Spectral Envelopes using Adversarial Weights. 406-410 - Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil:
CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion. 411-415 - João Silva, Marco A. Oliveira, Aníbal J. S. Ferreira:
Flexible parametric implantation of voicing in whispered speech under scarce training data. 416-420 - Sisi Shi, Andrew Busch, Kuldip K. Paliwal, Thomas Fickenscher:
On The Use of Discrete Cosine Transform Polarity Spectrum in Speech Enhancement. 421-425 - Konstantin Schmidt, Bernd Edler:
Blind Bandwidth Extension of Speech based on LPCNet. 426-430 - Yang Xian, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
Multi-Scale Residual Convolutional Encoder Decoder with Bidirectional Long Short-Term Memory for Single Channel Speech Enhancement. 431-435 - Bong-Ki Lee:
DNN Classification Model-based Speech Enhancement Using Mask Selection Technique. 436-440 - Takuya Hasumi, Tetsunori Kobayashi, Tetsuji Ogawa:
Investigation of Network Architecture for Single-Channel End-to-End Denoising. 441-445 - Dushyant Sharma, Lucia Berger, Carl Quillen, Patrick A. Naylor:
Non-Intrusive Estimation of Speech Signal Parameters using a Frame-based Machine Learning Approach. 446-450 - Sherif Abdulatif, Karim Armanious, Karim Guirguis, Jayasankar T. Sajeev, Bin Yang:
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks. 451-455 - Moe Takada, Shogo Seki, Patrick Lumban Tobing, Tomoki Toda:
Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals. 456-460 - Filip Wen-Fwu Tsai, Alireza M. Javid, Saikat Chatterjee:
Design of a Non-negative Neural Network to Improve on NMF. 461-465 - Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters. 466-470 - Pierre Mahé, Stéphane Ragot, Sylvain Marchand, Jérôme Daniel:
Ambisonic Coding with Spatial Image Correction. 471-475 - Thomas Joubaud, Grégory Pallone:
Electroacoustic method for the calibration of a heterogeneous distributed speaker system. 476-480 - Robbe Van Rompaey, Marc Moonen:
Distributed Adaptive Acoustic Contrast Control for Node-specific Sound Zoning in a Wireless Acoustic Sensor and Actuator Network. 481-485 - Huanyu Zuo, Prasanga N. Samarasinghe, Thushara D. Abhayapala:
Intensity Based Soundfield Reproduction over Multiple Sweet Spots Using an Irregular Loudspeaker Array. 486-490 - Rebecca C. Felsheim, Andreas Brendel, Patrick A. Naylor, Walter Kellermann:
Head Orientation Estimation from Multiple Microphone Arrays. 491-495 - Giovanni Pepe, Leonardo Gabrielli, Stefano Squartini, Luca Cattani, Carlo Tripodi:
Gravitational Search Algorithm for IIR Filter-Based Audio Equalization. 496-500 - Aziz Berkay Yesilyurt, Fatih Kamisli:
End-to-end Learned Image Compression with Conditional Latent Space Modeling for Entropy Coding. 501-505 - Paulo Eusébio, João Ascenso, Fernando Pereira:
Optimizing an Image Coding Framework with Deep Learning-based Pre- and Post-Processing. 506-510 - Ferdinand Jost, Pascal Peter, Joachim Weickert:
Compressing Piecewise Smooth Images with the Mumford-Shah Cartoon Model. 511-515 - Melpomeni Dimopoulou, Marc Antonini:
Image storage in DNA using Vector Quantization. 516-520 - Ionut Schiopu, Hongyue Huang, Adrian Munteanu:
A Study of Deep-Learning-based Prediction Methods for Lossless Coding. 521-525 - Emanuele Palma, Federica Battisti, Marco Carli, Pekka Astola, Ioan Tabus:
Subjective Quality Evaluation of Light Field Data Under Coding Distortions. 526-530 - Iago Storch, Guilherme Corrêa, Bruno Zatt, Luciano Agostini, Daniel Palomino:
ESA360 - Early SKIP Mode Decision Algorithm for Fast ERP 360 Video Coding. 535-539 - Guilherme Corrêa, Pargles Dall'Oglio, Daniel Palomino, Luciano Agostini:
Fast Block Size Decision for HEVC Encoders with On-the-Fly Trained Classifiers. 540-544 - Jayasingam Adhuran, Anil Fernando, Gosala Kulupana, Saverio G. Blasi:
Affine Intra-prediction for Versatile Video Coding. 545-549 - Matheus Lindino, Thiago Bubolz, Bruno Zatt, Daniel Palomino, Guilherme Corrêa:
Low-Complexity HEVC Transrating Based on Prediction Unit Mode Inheritance. 550-554 - Alex Borges, Daniel Palomino, Bruno Zatt, Marcelo Schiavon Porto, Guilherme Corrêa:
Fast VP9-to-AV1 Transcoding based on Block Partitioning Inheritance. 555-559 - Daiane Freitas, Rafael da Silva, Ícaro Siqueira, Cláudio Machado Diniz, Ricardo A. L. Reis, Mateus Grellert:
Hardware Architecture for the Regular Interpolation Filter of the AV1 Video Coding Standard. 560-564