default search action
WASPAA 2019: New Paltz, NY, USA
- 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019, New Paltz, NY, USA, October 20-23, 2019. IEEE 2019, ISBN 978-1-7281-1123-0
- Bongjun Kim, Bryan Pardo:
Sound Event Detection Using Point-Labeled Data. 1-5 - Yuma Koizumi, Shoichiro Saito, Masataka Yamaguchi, Shin Murata, Noboru Harada:
Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds. 6-10 - Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City Classification from Multiple Real-World Sound Scenes. 11-15 - Eduardo Fonseca, Frederic Font, Xavier Serra:
Model-Agnostic Approaches To Handling Noisy Labels When Training Sound Event Classifiers. 16-20 - Mina Mounir, Peter Karsmakers, Toon van Waterschoot:
Annotations Time Shift: A Key Parameter in Evaluating Musical Note Onset Detection Algorithms. 21-25 - Ryo Nishikimi, Eita Nakamura, Masataka Goto, Kazuyoshi Yoshii:
End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism. 26-30 - Timothy Roberts, Kuldip K. Paliwal:
Time-Scale Modification Using Fuzzy Epoch-Synchronous Overlap-Add (FESOLA). 31-34 - Stefan Lattner, Maarten Grachten:
High-Level Control of Drum Track Generation Using Learned Patterns of Rhythmic Interaction. 35-39 - Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck:
Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation. 40-44 - Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux:
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity. 45-49 - Grehisama Das, Elliot Kermit Canfield-Dafilou, Jonathan S. Abel:
On The Behavior of Delay Network Reverberator Modes. 50-54 - Juho Liski, Jussi Rämö, Vesa Välimäki:
Graphic Equalizer Design with Symmetric Biquad Filters. 55-59 - Henning F. Schepker, Simon Doclo:
Active Feedback Suppression for Hearing Devices Exploiting Multiple Loudspeakers. 60-64 - Jens Ahrens:
Perceptual Evaluation of Binaural Auralization of Data Obtained from the Spatial Decomposition Method. 65-69 - Zamir Ben-Hur, David Lou Alon, Ravish Mehra, Boaz Rafaely:
Sparse Representation of Hrtfs by Ear Alignment. 70-74 - Muhammad Shahnawaz, Craig T. Jin, Joan Alexis Glaunès, Augusto Sarti, Anthony I. Tew:
Morphological Weighting Improves Individualized Prediction of HRTF Directivity Patterns. 75-79 - Giorgia Cantisani, Slim Essid, Gaël Richard:
EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music. 80-84 - Hannes Gamper, Chandan K. A. Reddy, Ross Cutler, Ivan J. Tashev, Johannes Gehrke:
Intrusive and Non-Intrusive Perceptual Speech Quality Assessment Using a Convolutional Neural Network. 85-89 - Matteo Torcoli:
An Improved Measure of Musical Noise Based on Spectral Kurtosis. 90-94 - Thorsten Kastner, Jürgen Herre:
An Efficient Model for Estimating Subjective Quality of Separated Audio Source Signals. 95-99 - Xuan Dong, Donald S. Williamson:
A Classification-Aided Framework for Non-Intrusive Speech Quality Assessment. 100-104 - Chuyao Feng, Eva van Leer, David V. Anderson:
Identification of Voice Quality Variation Using I-Vectors. 105-109 - Takuma Okamoto:
3D Localized Sound Zone Generation with a Planar Omni-Directional Loudspeaker Array. 110-114 - Ryan M. Corey, Andrew C. Singer:
Motion-Tolerant Beamforming with Deformable Microphone Arrays. 115-119 - Jesper Rindom Jensen, Usama Saqib, Sharon Gannot:
An Em Method for Multichannel Toa and Doa Estimation of Acoustic Echoes. 120-124 - Vincent W. Neo, Christine Evers, Patrick A. Naylor:
Speech Enhancement Using Polynomial Eigenvalue Decomposition. 125-129 - Kouei Yamaoka, Robin Scheibler, Nobutaka Ono, Yukoh Wakabayashi:
Sub-Sample Time Delay Estimation via Auxiliary-Function-Based Iterative Updates. 130-134 - Huiyuan Sun, Thushara D. Abhayapala, Prasanga N. Samarasinghe:
Active Noise Control Over 3D Space with Multiple Circular Arrays. 135-139 - Lachlan Birnie, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Vladimir Tourbabin:
Sound Field Translation Methods for Binaural Reproduction. 140-144 - Maximilian Schäfer, Rudolf Rabenstein, Sebastian J. Schlecht:
Feedback Structures for a Transfer Function Model of a Circular Vibrating Membrane. 145-149 - Sebastian J. Schlecht, Emanuël A. P. Habets:
Dense Reverberation with Delay Feedback Matrices. 150-154 - Jacob Møller Hjerrild, Silvin Willemsen, Mads Græsbøll Christensen:
Physical Models For Fast Estimation Of Guitar String, Fret And Plucking Position. 155-159 - Tomoyasu Nakano, Kazuyoshi Yoshii, Yiming Wu, Ryo Nishikimi, Kin Wah Edward Lin, Masataka Goto:
Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer. 160-164 - Matthew Maciejewski, Gregory Sell, Yusuke Fujita, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur:
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains. 165-169 - Shrikant Venkataramani, Efthymios Tzinis, Paris Smaragdis:
A Style Transfer Approach to Source Separation. 170-174 - Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey:
Universal Sound Separation. 175-179 - Jonah Casebeer, Michael Colomb, Paris Smaragdis:
Deep Tensor Factorization for Spatially-Aware Scene Decomposition. 180-184 - Robin Scheibler, Nobutaka Ono:
Independent Vector Analysis with More Microphones Than Sources. 185-189 - Michael Günther, Haitham Afifi, Andreas Brendel, Holger Karl, Walter Kellermann:
Sparse Adaptation of Distributed Blind Source Separation in Acoustic Sensor Networks. 190-194 - Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor:
Multiple Hypothesis Tracking for Overlapping Speaker Segmentation. 195-199 - Wolfgang Mack, Emanuël A. P. Habets:
Declipping Speech Using Deep Filtering. 200-204 - Archit Gupta, Brendan Shillingford, Yannis M. Assael, Thomas C. Walters:
Speech Bandwidth Extension with Wavenet. 205-208 - Xianyun Wang, Changchun Bao, Rui Cheng:
IRM with Phase Parameterization for Speech Enhancement. 209-213 - Michael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund:
Generative Speech Enhancement Based on Cloned Networks. 214-218 - Samy Elshamy, Tim Fingscheidt:
Improvement of Speech Residuals for Speech Enhancement. 219-223 - Tomohiro Nakatani, Keisuke Kinoshita, Rintaro Ikeshita, Hiroshi Sawada, Shoko Araki:
Simultaneous Denoising, Dereverberation, and Source Separation Using a Unified Convolutional Beamformer. 224-228 - Ziyue Zhao, Samy Elshamy, Tim Fingscheidt:
A Perceptual Weighting Filter Loss for DNN Training In Speech Enhancement. 229-233 - Aswin Shanmugam Subramanian, Xiaofei Wang, Murali Karthick Baskar, Shinji Watanabe, Toru Taniguchi, Dung T. Tran, Yuya Fujita:
Speech Enhancement Using End-to-End Speech Recognition Objectives. 234-238 - Maximilian Strake, Bruno Defraene, Kristoff Fluyt, Wouter Tirry, Tim Fingscheidt:
Separated Noise Suppression and Speech Restoration: Lstm-Based Speech Enhancement in Two Stages. 239-243 - Masahito Togami, Tatsuya Komatsu:
Fast Convergence Algorithm for State-Space Model Based Speech Dereverberation by Multi-Channel Non-Negative Matrix Factorization. 244-248 - Ritwik Giri, Umut Isik, Arvindh Krishnaswamy:
Attention Wave-U-Net for Speech Enhancement. 249-253 - Shuyu Gong, Zhewei Wang, Tao Sun, Yuanhang Zhang, Charles D. Smith, Li Xu, Jundong Liu:
Dilated FCN: Listening Longer to Hear Better. 254-258 - Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification. 259-263 - Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Based On Class Label Embeddings. 264-267 - Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision. 268-272 - Kilian Schulze-Forster, Clement S. J. Doire, Gaël Richard, Roland Badeau:
Weakly Informed Audio Source Separation. 273-277 - Mark Cartwright, Jason Cramer, Justin Salamon, Juan Pablo Bello:
Tricycle: Audio Representation Learning from Sensor Network Data Using Self-Supervision. 278-282 - Renana Opochinsky, Bracha Laufer-Goldshtein, Sharon Gannot, Gal Chechik:
Deep Ranking-Based Sound Source Localization. 283-287 - Rintaro Ikeshita, Nobutaka Ito, Tomohiro Nakatani, Hiroshi Sawada:
Independent Low-Rank Matrix Analysis with Decorrelation Learning. 288-292 - Toru Taniguchi, Aswin Shanmugam Subramanian, Xiaofei Wang, Dung T. Tran, Yuya Fujita, Shinji Watanabe:
Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition. 293-297 - Xiaofei Li, Radu Horaud:
Multichannel Speech Enhancement Based On Time-Frequency Masking Using Subband Long Short-Term Memory. 298-302 - Soumi Maiti, Michael I. Mandel:
Parametric Resynthesis With Neural Vocoders. 303-307 - Zhepei Wang, Y. Cem Sübakan, Efthymios Tzinis, Paris Smaragdis, Laurent Charlin:
Continual Learning of New Sound Classes Using Generative Replay. 308-312 - Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada, Keisuke Imoto:
ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection. 313-317 - Léo Cances, Patrice Guyot, Thomas Pellegrini:
Evaluation of Post-Processing Algorithms for Polyphonic Sound Event Detection. 318-322 - Arjun Pankajakshan, Helen L. Bear, Emmanouil Benetos:
Polyphonic Sound Event and Sound Activity Detection: A Multi-Task Approach. 323-327 - Marc C. Green, Sharath Adavanne, Damian T. Murphy, Tuomas Virtanen:
Acoustic Scene Classification Using Higher-Order Ambisonic Features. 328-332 - Annamaria Mesaros, Sharath Adavanne, Archontis Politis, Toni Heittola, Tuomas Virtanen:
Joint Measurement of Localization and Detection of Sound Events. 333-337 - Noriyuki Tonami, Keisuke Imoto, Masahiro Niitsuma, Ryosuke Yamanishi, Yoichi Yamashita:
Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning. 338-342 - Lauréline Perotin, Alexandre Défossez, Emmanuel Vincent, Romain Serizel, Alexandre Guérin:
Regression Versus Classification for Neural Network Based Audio Source Localization. 343-347 - Yonggang Hu, Prasanga N. Samarasinghe, Thushara D. Abhayapala:
Sound Source Localization Using Relative Harmonic Coefficients in Modal Domain. 348-352 - Sebastian Braun, Ivan Tashev:
Acoustic Localization Using Spatial Probability in Noisy and Reverberant Environments. 353-357 - Duowei Tang, Maja Taseska, Toon van Waterschoot:
Supervised Contrastive Embeddings for Binaural Source Localization. 358-362 - Stefan Kühl, Alexander Bohlender, Matthias Schrammen, Peter Jax:
Improved Change Prediction for Combined Beamforming and Echo Cancellation with Application to a Generalized Sidelobe Canceler. 363-367 - Masahiro Nakanishi, Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari:
Two-Dimensional Sound Field Recording With Multiple Circular Microphone Arrays Considering Multiple Scattering. 368-372 - Nico Gößling, Wiebke Middelberg, Simon Doclo:
RTF-Steered Binaural MVDR Beamforming Incorporating Multiple External Microphones. 373-377 - Federico Borra, Steven Krenn, Israel Dejene Gebru, Dejan Markovic:
1ST-Order Microphone Array System for Large Area Sound Field Recording and Reconstruction: Discussion and Preliminary Results. 378-382 - Vladimir Tourbabin, Jacob Donley, Boaz Rafaely, Ravish Mehra:
Direction of Arrival Estimation In Highly Reverberant Environments Using Soft Time-Frequency Mask. 383-387 - Kenta Imaizumi, Kimitaka Tsutsumi, Atsushi Nakadaira, Yoichi Haneda:
Analytical Method of 2.5d Exterior Sound Field Synthesis By Using Multipole Loudspeaker Array. 388-392 - Zonglong Bai, Jesper Rindom Jensen, Jinwei Sun, Mads Græsbøll Christensen:
A Sparse Bayesian Learning Based RIR Reconstruction Method for Acoustic Toa And DOA Estimation. 393-397
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.