default search action

combined dblp search
author search
venue search
publication search

ask others

Rongzhi Gu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuL24
Rongzhi Gu, Yi Luo:
ReZero: Region-Customizable Sound Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2576-2589 (2024)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tismir/UhlichFHTWRCMLLYGSSHSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tismir/UhlichFHTWRCMLLYGSSHSM24
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 44-62 (2024)
2023
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuZZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuZZY23
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 849-862 (2023)
2021
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/GuZZY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/GuZZY21
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain. IEEE Signal Process. Lett. 28: 1370-1374 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/GuZXCZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/GuZXCZY20
Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-Modal Multi-Channel Target Speech Separation. IEEE J. Sel. Top. Signal Process. 14(3): 530-541 (2020)
2017
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cem/SuGHC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cem/SuGHC17
Xin Su, Rongzhi Gu, Guangjie Han, Dongmin Choi:
Interaction Data Detection System to Upgrade Brick and Mortar Shops: Metrics Allow Offline Shops to Compete with Online Retailers. IEEE Consumer Electron. Mag. 6(4): 57-63 (2017)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuCYHWZLLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuCYHWZLLG24
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. AAAI 2024: 19323-19331
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004G24
Yi Luo, Rongzhi Gu:
Improving Music Source Separation with Simo Stereo Band-Split Rnn. ICASSP 2024: 426-430
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoG24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoG24a
Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-Channel Room Impulse Response. ICASSP Workshops 2024: 449-454
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanGLP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanGLP24
Jingjie Fan, Rongzhi Gu, Yi Luo, Cong Pang:
A Unified Geometry-Aware Source Localization and Separation Framework for AD-HOC Microphone Array. ICASSP Workshops 2024: 725-729
2023
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PengSGPMBC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PengSGPMBC23
Junyi Peng, Themos Stafylakis, Rongzhi Gu, Oldrich Plchot, Ladislav Mosner, Lukás Burget, Jan Cernocký:
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters. ICASSP 2023: 1-5
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCLGLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCLGLW23
Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Weihua Li, Chao Weng:
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge. ICASSP 2023: 1-2
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuC0GW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuC0GW23
Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Chao Weng:
High Fidelity Speech Enhancement with Band-split RNN. INTERSPEECH 2023: 2483-2487
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenY0GLLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenY0GLLW23
Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. INTERSPEECH 2023: 2523-2527
2022
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuGZ22
Xinmeng Xu, Rongzhi Gu, Yuexian Zou:
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention. ICASSP 2022: 6492-6496
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangGZGWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangGZGWZ22
Li Wang, Rongzhi Gu, Weiji Zhuang, Peng Gao, Yujun Wang, Yuexian Zou:
Learning Decoupling Features Through Orthogonality Regularization. ICASSP 2022: 7562-7566
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengGMPBC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengGMPBC22
Junyi Peng, Rongzhi Gu, Ladislav Mosner, Oldrich Plchot, Lukás Burget, Jan Cernocký:
Learnable Sparse Filterbank for Speaker Verification. INTERSPEECH 2022: 5110-5114
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoGYTZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoGYTZ22
Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. INTERSPEECH 2022: 5318-5322
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoYGZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoYGZZ22
Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. INTERSPEECH 2022: 5333-5337
2021
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GuZYY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GuZYY21
Rongzhi Gu, Shi-Xiong Zhang, Meng Yu, Dong Yu:
3D Spatial Features for Multi-Channel Target Speech Separation. ASRU 2021: 996-1002
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengQWG0BC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengQWG0BC21
Junyi Peng, Xiaoyang Qu, Jianzong Wang, Rongzhi Gu, Jing Xiao, Lukás Burget, Jan Cernocký:
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. Interspeech 2021: 511-515
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengQGWXBC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengQGWXBC21
Junyi Peng, Xiaoyang Qu, Rongzhi Gu, Jianzong Wang, Jing Xiao, Lukás Burget, Jan Cernocký:
Effective Phase Encoding for End-To-End Speaker Verification. Interspeech 2021: 2366-2370
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangGCZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangGCZ21
Li Wang, Rongzhi Gu, Nuo Chen, Yuexian Zou:
Text Anchor Based Metric Learning for Small-Footprint Keyword Spotting. Interspeech 2021: 4219-4223
2020
[c10]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/PengGZZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengGZZ20
Junyi Peng, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Context-adaptive Gaussian Attention for Text-independent Speaker Verification. APSIPA 2020: 595-599
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuZCXYSZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuZCXYSZY20
Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning. ICASSP 2020: 7319-7323
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengGZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengGZ20
Junyi Peng, Rongzhi Gu, Yuexian Zou:
Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification. INTERSPEECH 2020: 3246-3250
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuWGZCX00YLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuWGZCX00YLM20
Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Recognition of Overlapped Speech. INTERSPEECH 2020: 3496-3500
2019
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PengGZW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengGZW19
Junyi Peng, Rongzhi Gu, Yuexian Zou, Wenwu Wang:
Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification. APSIPA 2019: 314-319
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GuPZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GuPZ019
Rongzhi Gu, Junyi Peng, Yuexian Zou, Dong Yu:
Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation. APSIPA 2019: 325-331
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PengGZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PengGZ19
Junyi Peng, Rongzhi Gu, Yuexian Zou:
Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification. ASRU 2019: 704-709
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuCZZXYSZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuCZZXYSZ019
Rongzhi Gu, Lianwu Chen, Shi-Xiong Zhang, Jimeng Zheng, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. INTERSPEECH 2019: 4290-4294
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BahmaninezhadWG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BahmaninezhadWG19
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation. INTERSPEECH 2019: 4574-4578
2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZouGWJR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZouGWJR17
Yuexian Zou, Rongzhi Gu, Disong Wang, Aimin Jiang, Christian H. Ritz:
Learning a robust DOA estimation model with acoustic vector sensor cues. APSIPA 2017: 1688-1691

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-04947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-04947
Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng:
Gull: A Generative Multifunctional Audio Codec. CoRR abs/2404.04947 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13216
Yaoxun Xu, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Shun Lei, Zhiwei Lin, Zhiyong Wu:
MuCodec: Ultra Low-Bitrate Music Codec. CoRR abs/2409.13216 (2024)
2023
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13462
Rongzhi Gu, Shi-Xiong Zhang, Dong Yu:
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty. CoRR abs/2302.13462 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-08052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-08052
Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-channel Room Impulse Response. CoRR abs/2304.08052 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06981
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada P. Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. CoRR abs/2308.06981 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11053
Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. CoRR abs/2308.11053 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-16892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-16892
Rongzhi Gu, Yi Luo:
ReZero: Region-customizable Sound Extraction. CoRR abs/2308.16892 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10381
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. CoRR abs/2312.10381 (2023)
2022
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16772
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16772
Li Wang, Rongzhi Gu, Weiji Zhuang, Peng Gao, Yujun Wang, Yuexian Zou:
Learning Decoupling Features Through Orthogonality Regularization. CoRR abs/2203.16772 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01355
Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. CoRR abs/2204.01355 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07375
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07375
Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. CoRR abs/2204.07375 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01280
Xinmeng Xu, Rongzhi Gu, Yuexian Zou:
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention. CoRR abs/2205.01280 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16032
Junyi Peng, Themos Stafylakis, Rongzhi Gu, Oldrich Plchot, Ladislav Mosner, Lukás Burget, Jan Cernocký:
Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters. CoRR abs/2210.16032 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08348
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. CoRR abs/2212.08348 (2022)
2021
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12359
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain. CoRR abs/2104.12359 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-00812
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-00812
Jinchuan Tian, Rongzhi Gu, Helin Wang, Yuexian Zou:
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency. CoRR abs/2105.00812 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-05516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-05516
Li Wang, Rongzhi Gu, Nuo Chen, Yuexian Zou:
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting. CoRR abs/2108.05516 (2021)
2020
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-00391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-00391
Rongzhi Gu, Yuexian Zou:
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation. CoRR abs/2001.00391 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-03927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-03927
Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning. CoRR abs/2003.03927 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07032
Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-modal Multi-channel Target Speech Separation. CoRR abs/2003.07032 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08571
Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-visual Multi-channel Recognition of Overlapped Speech. CoRR abs/2005.08571 (2020)
2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-06286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-06286
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
End-to-End Multi-Channel Speech Separation. CoRR abs/1905.06286 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-07497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-07497
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A comprehensive study of speech separation: spectrogram vs waveform separation. CoRR abs/1905.07497 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.