default search action
Mark D. Plumbley
Person information
- affiliation: University of Surrey, Guildford, UK
- affiliation (2002 - 2014): Queen Mary University of London, UK
- affiliation (1991 - 2001): King's College London, UK
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j69]Francesco Renna, Alex Gaudio, Sandra da Silva Mattos, Mark D. Plumbley, Miguel Tavares Coimbra:
Separation of the Aortic and Pulmonary Components of the Second Heart Sound via Alternating Optimization. IEEE Access 12: 34632-34643 (2024) - [j68]Yizhou Tan, Haojun Ai, Shengchen Li, Mark D. Plumbley:
Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1286-1297 (2024) - [j67]Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2871-2883 (2024) - [j66]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3311-3323 (2024) - [j65]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3339-3354 (2024) - [j64]Sara Atito Ali Ahmed, Muhammad Awais, Wenwu Wang, Mark D. Plumbley, Josef Kittler:
ASiT: Local-Global Audio Spectrogram Vision Transformer for Event Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3684-3693 (2024) - [j63]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Selective-Memory Meta-Learning With Environment Representations for Sound Event Localization and Detection. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4313-4327 (2024) - [c200]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning Temporal Resolution in Spectrogram for Audio Classification. AAAI 2024: 13873-13881 - [c199]Thomas Deacon, Mark D. Plumbley:
Working with AI Sound: Exploring the Future of Workplace AI Sound Technologies. CHIWORK 2024: 2:1-2:21 - [c198]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. EUSIPCO 2024: 1-5 - [c197]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. ICASSP 2024: 581-585 - [c196]Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
Audiosr: Versatile Audio Super-Resolution at Scale. ICASSP 2024: 1076-1080 - [c195]Xuenan Xu, Arshdeep Singh, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning. MLSP 2024: 1-6 - [c194]Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. MLSP 2024: 1-6 - [d6]Jisheng Bai, Mou Wang, Yafei Jia, Siwei Huang, Han Yin, Yutong Du, Dongzhe Zhang, Haohe Liu, Mark D. Plumbley, Woon-Seng Gan, Susanto Rahardja, Bin Xiang, Jianfeng Chen:
IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift Development Dataset. Zenodo, 2024 - [d5]Jisheng Bai, Mou Wang, Yafei Jia, Siwei Huang, Han Yin, Yutong Du, Dongzhe Zhang, Haohe Liu, Mark D. Plumbley, Woon-Seng Gan, Susanto Rahardja, Bin Xiang, Jianfeng Chen:
IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift Evaluation Dataset. Zenodo, 2024 - [i107]Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen:
Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift. CoRR abs/2402.02694 (2024) - [i106]Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. CoRR abs/2404.17806 (2024) - [i105]Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound. CoRR abs/2405.00233 (2024) - [i104]Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Improving Audio Generation with Visual Enhanced Caption. CoRR abs/2407.04416 (2024) - [i103]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. CoRR abs/2407.11745 (2024) - [i102]Xuenan Xu, Haohe Liu, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Efficient Audio Captioning with Encoder-Level Knowledge Distillation. CoRR abs/2407.14329 (2024) - [i101]Rhys Burchett-Vass, Arshdeep Singh, Gabriel Bibbó, Mark D. Plumbley:
Integrating IP Broadcasting with Audio Tags: Workflow and Challenges. CoRR abs/2407.15423 (2024) - [i100]Yizhou Tan, Yanru Wu, Yuanbo Hou, Xin Xu, Hui Bu, Shengchen Li, Dick Botteldooren, Mark D. Plumbley:
Exploring Differences between Human Perception and Model Inference in Audio Event Recognition. CoRR abs/2409.06580 (2024) - [i99]Yi Yuan, Xubo Liu, Haohe Liu, Mark D. Plumbley, Wenwu Wang:
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching. CoRR abs/2409.07614 (2024) - [i98]Gabriel Bibbó, Thomas Deacon, Arshdeep Singh, Mark D. Plumbley:
The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection. CoRR abs/2409.11262 (2024) - [i97]Annamaria Mesaros, Romain Serizel, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
A decade of DCASE: Achievements, practices, evaluations and future challenges. CoRR abs/2410.04951 (2024) - [i96]Jinbo Hu, Yin Cao, Ming Wu, Fang Kang, Feiran Yang, Wenwu Wang, Mark D. Plumbley, Jun Yang:
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection. CoRR abs/2411.06399 (2024) - 2023
- [j62]Zihang Song, Han Zhang, Sean Fuller, Andrew Lambert, Zhinong Ying, Petri Mähönen, Yonina C. Eldar, Shuguang Cui, Mark D. Plumbley, Clive Parini, Arumugam Nallanathan, Yue Gao:
Numerical evaluation on sub-Nyquist spectrum reconstruction methods. Frontiers Comput. Sci. 17(6): 176504 (2023) - [c193]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. EUSIPCO 2023: 765-769 - [c192]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5 - [c191]Arshdeep Singh, Mark D. Plumbley:
Efficient Similarity-Based Passive Filter Pruning for Compressing CNNS. ICASSP 2023: 1-5 - [c190]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. ICML 2023: 21450-21474 - [c189]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. INTERSPEECH 2023: 276-280 - [c188]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. INTERSPEECH 2023: 2838-2842 - [c187]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. INTERSPEECH 2023: 3799-3803 - [c186]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. INTERSPEECH 2023: 4164-4168 - [c185]James A. King, Arshdeep Singh, Mark D. Plumbley:
Compressing Audio CNNS with Graph Centrality Based Filter Pruning. WASPAA 2023: 1-5 - [i95]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. CoRR abs/2301.12503 (2023) - [i94]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study. CoRR abs/2303.03857 (2023) - [i93]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. CoRR abs/2303.17395 (2023) - [i92]Arshdeep Singh, Mark D. Plumbley:
Efficient CNNs via Passive Filter Pruning. CoRR abs/2304.02319 (2023) - [i91]James A. King, Arshdeep Singh, Mark D. Plumbley:
Compressing audio CNNs with graph centrality based filter pruning. CoRR abs/2305.03391 (2023) - [i90]Qiuqiang Kong, Ke Chen, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley:
Universal Source Separation with Weakly Labelled Data. CoRR abs/2305.07447 (2023) - [i89]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7. CoRR abs/2305.15905 (2023) - [i88]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. CoRR abs/2305.17719 (2023) - [i87]Arshdeep Singh, Haohe Liu, Mark D. Plumbley:
E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks. CoRR abs/2305.18665 (2023) - [i86]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. CoRR abs/2305.18753 (2023) - [i85]Gabriel Bibbó, Arshdeep Singh, Mark D. Plumbley:
Audio Tagging on an Embedded Hardware Platform. CoRR abs/2306.09106 (2023) - [i84]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang:
Text-Driven Foley Sound Generation With Latent Diffusion Model. CoRR abs/2306.10359 (2023) - [i83]Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023) - [i82]Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023) - [i81]Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023) - [i80]Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang:
META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection. CoRR abs/2308.08847 (2023) - [i79]Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
AudioSR: Versatile Audio Super-resolution at Scale. CoRR abs/2309.07314 (2023) - [i78]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. CoRR abs/2309.08051 (2023) - [i77]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection. CoRR abs/2312.16422 (2023) - 2022
- [j61]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated audio captioning: an overview of recent progress and new challenges. EURASIP J. Audio Speech Music. Process. 2022(1): 26 (2022) - [c184]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains. DCASE 2022 - [c183]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022 - [c182]Arshdeep Singh, Mark D. Plumbley:
Low-Complexity CNNs for Acoustic Scene Classification. DCASE 2022 - [c181]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c180]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. EUSIPCO 2022: 772-776 - [c179]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. EUSIPCO 2022: 1145-1149 - [c178]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning Via Adversarial Training. ICASSP 2022: 8882-8886 - [c177]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection. ICASSP 2022: 9196-9200 - [c176]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805 - [c175]Arshdeep Singh, Mark D. Plumbley:
A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification. INTERSPEECH 2022: 2433-2437 - [c174]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. INTERSPEECH 2022: 4142-4146 - [c173]Meng Cui, Xubo Liu, Jinzheng Zhao, Jianyuan Sun, Guoping Lian, Tao Chen, Mark D. Plumbley, Daoliang Li, Wenwu Wang:
Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm. MLSP 2022: 1-6 - [i76]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. CoRR abs/2203.02838 (2022) - [i75]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. CoRR abs/2203.03436 (2022) - [i74]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2203.10228 (2022) - [i73]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022) - [i72]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. CoRR abs/2203.15537 (2022) - [i71]Arshdeep Singh, Mark D. Plumbley:
A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification. CoRR abs/2203.15751 (2022) - [i70]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated Audio Captioning: an Overview of Recent Progress and New Challenges. CoRR abs/2205.05949 (2022) - [i69]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i68]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022) - [i67]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022) - [i66]Arshdeep Singh, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2207.11529 (2022) - [i65]Arshdeep Singh, James A. King, Xubo Liu, Wenwu Wang, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2208.01555 (2022) - [i64]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains. CoRR abs/2209.01802 (2022) - [i63]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022) - [i62]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022) - [i61]Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features. CoRR abs/2210.05037 (2022) - [i60]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022) - [i59]Arshdeep Singh, Mark D. Plumbley:
Efficient Similarity-based Passive Filter Pruning for Compressing CNNs. CoRR abs/2210.17416 (2022) - [i58]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022) - [i57]Sara Atito, Muhammad Awais, Wenwu Wang, Mark D. Plumbley, Josef Kittler:
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation. CoRR abs/2211.13189 (2022) - [i56]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. CoRR abs/2212.02033 (2022) - 2021
- [j60]Yue Gao, Zihang Song, Han Zhang, Sean Fuller, Andrew Lambert, Zhinong Ying, Petri Mähönen, Yonina C. Eldar, Shuguang Cui, Mark D. Plumbley, Clive Parini, Arumugam Nallanathan:
Sub-Nyquist spectrum sensing and learning challenge. Frontiers Comput. Sci. 15(4): 154504 (2021) - [j59]Bin Li, Lucas Rencker, Jing Dong, Yuhui Luo, Mark D. Plumbley, Wenwu Wang:
Sparse Analysis Model Based Dictionary Learning for Signal Declipping. IEEE J. Sel. Top. Signal Process. 15(1): 25-36 (2021) - [j58]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
Sound Event Detection: A tutorial. IEEE Signal Process. Mag. 38(5): 67-83 (2021) - [j57]Jie Jiang, Qiuqiang Kong, Mark D. Plumbley, Nigel Gilbert, Mark Hoogendoorn, Diederik M. Roijers:
Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances. ACM Trans. Knowl. Discov. Data 15(3): 50:1-50:21 (2021) - [j56]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification. IEEE Trans. Multim. 23: 4131-4142 (2021) - [c172]Francesco Renna, Mark D. Plumbley, Miguel T. Coimbra:
Source Separation of the Second Heart Sound via Alternating Optimization. CinC 2021: 1-4 - [c171]Andres Fernandez, Mark D. Plumbley:
Using UMAP to Inspect Audio Data for Unsupervised Anomaly Detection Under Domain-Shift Conditions. DCASE 2021: 165-169 - [c170]Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. DCASE 2021: 196-200 - [c169]Turab Iqbal, Yin Cao, Andrew Bailey, Mark D. Plumbley, Wenwu Wang:
ARCA23K: An Audio Dataset for Investigating Open-Set Label Noise. DCASE 2021: 201-205 - [c168]Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning. DCASE 2021: 206-210 - [c167]Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. DCASE 2021: 211-215 - [c166]Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark D. Plumbley:
An Audio-Based Deep Learning Framework For BBC Television Programme Classification. EUSIPCO 2021: 56-60 - [c165]Andrew Bailey, Mark D. Plumbley:
Gender Bias in Depression Detection Using Audio Features. EUSIPCO 2021: 596-600 - [c164]Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. ICASSP 2021: 885-889 - [c163]Jingshu Zhang, Mark D. Plumbley, Wenwu Wang:
Weighted Magnitude-Phase Loss for Speech Dereverberation. ICASSP 2021: 5794-5798 - [c162]Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. MLSP 2021: 1-6 - [d4]Turab Iqbal, Yin Cao, Andrew Bailey, Mark D. Plumbley, Wenwu Wang:
ARCA23K. Zenodo, 2021 - [d3]