


Остановите войну!
for scientists:


default search action
Haizhou Li 0001
李海洲
Person information

- unicode name: 李海洲
- affiliation: Chinese University of Hong Kong (Shenzhen), China
- affiliation: National University of Singapore, Department of Electrical and Computer Engineering, Singapore
- affiliation (2006 - 2016): Nanyang Technological University, Singapore
- affiliation (2003 - 2016): Institute for Infocomm Research, A*STAR, Singapore
- affiliation (2011): University of New South Wales, Sydney, Australia
- affiliation (2009): University of Eastern Finland, Kuopio, Finland
- affiliation (PhD 1990): South China University of Technology, Guangzhou, China
Other persons with the same name
- Haizhou Li 0002 — Blaise Pascal University, Clermont-Ferrand, France
- Haizhou Li 0003 — City University of Hong Kong, Department of Computer Science, Hong Kong
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j160]Tao Luo, Weng-Fai Wong, Rick Siow Mong Goh, Anh Tuan Do, Zhixian Chen, Haizhou Li, Wenyu Jiang, Weiyun Yau:
Achieving Green AI with Energy-Efficient Deep Learning Using Neuromorphic Computing. Commun. ACM 66(7): 52-57 (2023) - [j159]Buddhi Wickramasinghe
, Eliathamby Ambikairajah, Vidhyasaharan Sethu, Julien Epps, Haizhou Li, Ting Dang:
DNN controlled adaptive front-end for replay attack detection systems. Speech Commun. 154: 102973 (2023) - [j158]Tingting Wang
, Zexu Pan, Meng Ge
, Zhen Yang
, Haizhou Li
:
Time-Domain Speech Separation Networks With Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114 (2023) - [j157]Yi Zhou
, Zhizheng Wu
, Mingyang Zhang
, Xiaohai Tian
, Haizhou Li
:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023) - [j156]Mingyang Zhang
, Xuehao Zhou, Zhizheng Wu
, Haizhou Li:
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023) - [j155]Kun Zhou
, Berrak Sisman
, Rajib Rana
, Björn W. Schuller
, Haizhou Li
:
Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023) - [j154]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis With Mixed Emotions. IEEE Trans. Affect. Comput. 14(4): 3120-3134 (2023) - [j153]Hui Tian
, Yiqin Qiu
, Wojciech Mazurczyk
, Haizhou Li
, Zhenxing Qian
:
STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams. IEEE ACM Trans. Audio Speech Lang. Process. 31: 277-289 (2023) - [j152]Qiquan Zhang
, Xinyuan Qian
, Zhaoheng Ni, Aaron Nicolson, Eliathamby Ambikairajah
, Haizhou Li:
A Time-Frequency Attention Module for Neural Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 462-475 (2023) - [j151]Xinyuan Qian
, Zhengdong Wang, Jiadong Wang
, Guohui Guan, Haizhou Li
:
Audio-Visual Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 31: 550-562 (2023) - [j150]Chen Zhang
, Luis Fernando D'Haro
, Qiquan Zhang
, Thomas Friedrichs, Haizhou Li
:
PoE: A Panel of Experts for Generalized Automatic Dialogue Assessment. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1234-1250 (2023) - [j149]Ruijie Tao
, Kong Aik Lee
, Rohan Kumar Das
, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023) - [j148]Yi Zhou
, Zhizheng Wu
, Xiaohai Tian
, Haizhou Li
:
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023) - [j147]Xiaoxue Gao
, Chitralekha Gupta
, Haizhou Li:
PoLyScriber: Integrated Fine-Tuning of Extractor and Lyrics Transcriber for Polyphonic Music. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1968-1981 (2023) - [j146]Zhenyu Weng
, Huiping Zhuang
, Haizhou Li
, Balakrishnan Ramalingam
, Rajesh Elara Mohan
, Zhiping Lin
:
Online Multi-Face Tracking With Multi-Modality Cascaded Matching. IEEE Trans. Circuits Syst. Video Technol. 33(6): 2738-2752 (2023) - [j145]Yiqin Qiu
, Hui Tian
, Haizhou Li
, Chin-Chen Chang
, Athanasios V. Vasilakos
:
Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis. IEEE Trans. Inf. Forensics Secur. 18: 2737-2750 (2023) - [j144]Jibin Wu
, Yansong Chua
, Malu Zhang
, Guoqi Li
, Haizhou Li
, Kay Chen Tan:
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 34(1): 446-460 (2023) - [c655]Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li:
Dynamic Transformers Provide a False Sense of Efficiency. ACL (1) 2023: 7164-7180 - [c654]Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Zero-shot multi-speaker accent TTS with limited accent data. APSIPA ASC 2023: 1931-1936 - [c653]Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li:
Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation. CVPR 2023: 3749-3758 - [c652]Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li:
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert. CVPR 2023: 14653-14662 - [c651]Marvin Borsdorf, Saurav Pahuja, Gabriel Ivucic, Siqi Cai, Haizhou Li, Tanja Schultz:
Multi-Head Attention and GRU for Improved Match-Mismatch Classification of Speech Stimulus and EEG Response. ICASSP 2023: 1-2 - [c650]Xiaoxue Gao, Xianghu Yue, Haizhou Li:
Self-Transriber: Few-Shot Lyrics Transcription With Self-Training. ICASSP 2023: 1-5 - [c649]Zexu Pan, Wupeng Wang, Marvin Borsdorf, Haizhou Li:
ImagineNet: Target Speaker Extraction with Intermittent Visual Cue Through Embedding Inpainting. ICASSP 2023: 1-5 - [c648]Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li:
Speaker Recognition with Two-Step Multi-Modal Deep Cleansing. ICASSP 2023: 1-5 - [c647]Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li:
Token2vec: A Joint Self-Supervised Pre-Training Framework Using Unpaired Speech and Text. ICASSP 2023: 1-5 - [c646]Qiquan Zhang, Hongxu Zhu, Qi Song, Xinyuan Qian, Zhaoheng Ni, Haizhou Li:
Ripple Sparse Self-Attention for Monaural Speech Enhancement. ICASSP 2023: 1-5 - [c645]Haolin Zuo, Rui Liu, Jinming Zhao, Guanglai Gao, Haizhou Li:
Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities. ICASSP 2023: 1-5 - [c644]Yuke Si, Yan Zhang, Yuhang Li, Xiaobao Wang, Longbiao Wang, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition. IJCNN 2023: 1-8 - [c643]Chuang Li, Hengchang Hu, Yan Zhang, Min-Yen Kan, Haizhou Li:
A Conversation is Worth A Thousand Recommendations: A Survey of Holistic Conversational Recommendation Systems. KaRS@RecSys 2023: 7-20 - [c642]Xueyi Zhang
, Chengwei Zhang
, Tao Wang
, Jun Tang
, Songyang Lao
, Haizhou Li
:
Slow-Fast Time Parameter Aggregation Network for Class-Incremental Lip Reading. ACM Multimedia 2023: 747-756 - [c641]Saurav Pahuja, Siqi Cai, Tanja Schultz, Haizhou Li:
XAnet: Cross-Attention Between EEG of Left and Right Brain for Auditory Attention Decoding. NER 2023: 1-4 - [c640]Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. NLPCC (3) 2023: 69-80 - [c639]Bin Wang, Haizhou Li:
Relational Sentence Embedding for Flexible Semantic Matching. RepL4NLP@ACL 2023: 238-252 - [i163]Jiadong Wang, Xinyuan Qian, Malu Zhang, Robby T. Tan, Haizhou Li:
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert. CoRR abs/2303.17480 (2023) - [i162]Zhihong Chen, Feng Jiang, Junying Chen, Tiannan Wang, Fei Yu, Guiming Chen, Hongbo Zhang, Juhao Liang, Chen Zhang, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li:
Phoenix: Democratizing ChatGPT across Languages. CoRR abs/2304.10453 (2023) - [i161]Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis with Limited Data. CoRR abs/2305.04816 (2023) - [i160]Qiquan Zhang, Hongxu Zhu, Qi Song, Xinyuan Qian, Zhaoheng Ni, Haizhou Li:
Ripple sparse self-attention for monaural speech enhancement. CoRR abs/2305.08541 (2023) - [i159]Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby T. Tan, Haizhou Li:
Dynamic Transformers Provide a False Sense of Efficiency. CoRR abs/2305.12228 (2023) - [i158]Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li:
Target Active Speaker Detection with Audio-visual Cues. CoRR abs/2305.12831 (2023) - [i157]Feng Jiang, Longwang He, Peifeng Li, Qiaoming Zhu, Haizhou Li:
Topic-driven Distant Supervision Framework for Macro-level Discourse Parsing. CoRR abs/2305.13755 (2023) - [i156]Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li:
ADD 2023: the Second Audio Deepfake Detection Challenge. CoRR abs/2305.13774 (2023) - [i155]Danqing Luo, Chen Zhang, Jiahui Xu, Bin Wang, Yiming Chen, Yan Zhang, Haizhou Li:
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation. CoRR abs/2305.13785 (2023) - [i154]Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Haizhou Li:
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark. CoRR abs/2305.14790 (2023) - [i153]Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li:
HuatuoGPT, towards Taming Language Model to Be a Doctor. CoRR abs/2305.15075 (2023) - [i152]Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li:
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. CoRR abs/2305.16353 (2023) - [i151]Xinyi Chen, Qu Yang, Jibin Wu, Haizhou Li, Kay Chen Tan:
A Hybrid Neural Coding Approach for Pattern Recognition with Spiking Neural Networks. CoRR abs/2305.16594 (2023) - [i150]Zhenyu Weng, Huiping Zhuang, Haizhou Li, Zhiping Lin:
Constant Sequence Extension for Fast Search Using Weighted Hamming Distance. CoRR abs/2306.03612 (2023) - [i149]Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li:
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. CoRR abs/2306.17005 (2023) - [i148]Shimin Zhang, Qu Yang, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan:
Long Short-term Memory with Two-Compartment Spiking Neuron. CoRR abs/2307.07231 (2023) - [i147]Lingyi Yang, Feng Jiang, Haizhou Li:
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text. CoRR abs/2307.11380 (2023) - [i146]Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning. CoRR abs/2307.13923 (2023) - [i145]Xidong Wang, Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li:
CMB: A Comprehensive Medical Benchmark in Chinese. CoRR abs/2308.08833 (2023) - [i144]Shimin Zhang, Qu Yang, Chenxiang Ma, Jibin Wu, Haizhou Li, Kay Chen Tan:
TC-LIF: A Two-Compartment Spiking Neuron Model for Long-term Sequential Modelling. CoRR abs/2308.13250 (2023) - [i143]Hongxu Zhu, Siqi Cai, Yidi Jiang, Qiquan Zhang, Haizhou Li:
EEG-Derived Voice Signature for Attended Speaker Detection. CoRR abs/2308.14774 (2023) - [i142]Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. CoRR abs/2309.06723 (2023) - [i141]Chuang Li, Hengchang Hu, Yan Zhang, Min-Yen Kan, Haizhou Li:
A Conversation is Worth A Thousand Recommendations: A Survey of Holistic Conversational Recommender Systems. CoRR abs/2309.07682 (2023) - [i140]Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023) - [i139]Zeyang Song, Jibin Wu, Malu Zhang, Mike Zheng Shou, Haizhou Li:
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks. CoRR abs/2309.09469 (2023) - [i138]Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023) - [i137]Rui Liu, Bin Liu, Haizhou Li:
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech. CoRR abs/2309.11724 (2023) - [i136]Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency. CoRR abs/2309.11725 (2023) - [i135]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023) - [i134]Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu:
AceGPT, Localizing Large Language Models in Arabic. CoRR abs/2309.12053 (2023) - [i133]Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li:
Disentangling Voice and Content with Self-Supervision for Speaker Recognition. CoRR abs/2310.01128 (2023) - [i132]Chen Zhang, Luis Fernando D'Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li:
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark. CoRR abs/2310.08958 (2023) - [i131]Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li:
UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking. CoRR abs/2310.10492 (2023) - [i130]Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li:
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism. CoRR abs/2310.10497 (2023) - [i129]Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li:
Quantify Health-Related Atomic Knowledge in Chinese Medical Large Language Models: A Computational Analysis. CoRR abs/2310.11722 (2023) - [i128]Qu Yang, Malu Zhang, Jibin Wu, Kay Chen Tan, Haizhou Li:
LC-TTFS: Towards Lossless Network Conversion for Spiking Neural Networks with TTFS Coding. CoRR abs/2310.14978 (2023) - [i127]Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li:
How Well Do Text Embedding Models Understand Syntax? CoRR abs/2311.07996 (2023) - [i126]Junying Chen, Xidong Wang, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang:
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs. CoRR abs/2311.09774 (2023) - 2022
- [j143]Xianghu Yue
, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li:
Self-Supervised Learning With Segmental Masking for Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1367-1379 (2022) - [j142]Hongqiang Du
, Lei Xie, Haizhou Li:
Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022) - [j141]Jibin Wu
, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang
, Haizhou Li
, Kay Chen Tan
:
Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022) - [j140]Kun Zhou
, Berrak Sisman
, Rui Liu
, Haizhou Li
:
Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022) - [j139]Hongning Zhu
, Kong Aik Lee
, Haizhou Li
:
Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022) - [j138]Tianchi Liu
, Rohan Kumar Das
, Kong Aik Lee
, Haizhou Li
:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022) - [j137]Zexu Pan
, Xinyuan Qian
, Haizhou Li
:
Speaker Extraction With Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022) - [j136]Haizhou Li:
A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE Signal Process. Mag. 39(2): 159-160 (2022) - [j135]Zexu Pan
, Ruijie Tao, Chenglin Xu
, Haizhou Li
:
Selective Listening by Synchronizing Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664 (2022) - [j134]Rui Liu
, Berrak Sisman
, Guanglai Gao, Haizhou Li
:
Decoding Knowledge Transfer for Neural Text-to-Speech Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1789-1802 (2022) - [j133]Xiaoxue Gao
, Chitralekha Gupta
, Haizhou Li
:
Automatic Lyrics Transcription of Polyphonic Music With Lyrics-Chord Multi-Task Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2280-2294 (2022) - [j132]Chitralekha Gupta
, Haizhou Li
, Masataka Goto
:
Deep Learning Approaches in Topics of Singing Information Processing. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2422-2451 (2022) - [j131]Zexu Pan
, Meng Ge
, Haizhou Li
:
USEV: Universal Speaker Extraction With Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022) - [j130]Enze Su
, Siqi Cai
, Longhan Xie
, Haizhou Li
, Tanja Schultz
:
STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG. IEEE Trans. Biomed. Eng. 69(7): 2233-2242 (2022) - [j129]Siqi Cai
, Enze Su
, Longhan Xie
, Haizhou Li
:
EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2): 256-266 (2022) - [j128]Malu Zhang
, Jiadong Wang
, Jibin Wu
, Ammar Belatreche
, Burin Amornpaisannon, Zhixuan Zhang, Venkata Pavan Kumar Miriyala
, Hong Qu
, Yansong Chua
, Trevor E. Carlson
, Haizhou Li
:
Rectified Linear Postsynaptic Potential Function for Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 33(5): 1947-1958 (2022) - [c638]Chen Zhang
, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li:
MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation. AAAI 2022: 11657-11666 - [c637]Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li:
M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL (1) 2022: 5699-5710 - [c636]Bin Wang, C.-C. Jay Kuo, Haizhou Li:
Just Rank: Rethinking Evaluation with Word and Sentence Similarities. ACL (1) 2022: 6060-6077 - [c635]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu
, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang
, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova
, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem
, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990 - [c634]Chen Zhang
, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li:
FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation. EMNLP 2022: 3336-3355 - [c633]Bin Wang, Chen Zhang, Yan Zhang, Yiming Chen, Haizhou Li:
Analyzing and Evaluating Faithfulness in Dialogue Summarization. EMNLP 2022: 4897-4908 - [c632]Yiming Chen, Yan Zhang, Bin Wang, Zuozhu Liu, Haizhou Li:
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework. EMNLP 2022: 8150-8161 - [c631]Xiaoxue Gao
, Chitralekha Gupta, Haizhou Li:
Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. ICASSP 2022: 791-795 - [c630]Marvin Borsdorf, Kevin Scheck, Haizhou Li, Tanja Schultz
:
Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages. ICASSP 2022: 846-850 - [c629]Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang
, Haizhou Li:
Memobert: Pre-Training Model with Prompt-Based Learning for Multimodal Emotion Recognition. ICASSP 2022: 4703-4707 - [c628]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146 - [c627]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291 - [c626]Tianchi Liu
, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521 - [c625]Qiquan Zhang, Qi Song, Zhaoheng Ni, Aaron Nicolson, Haizhou Li:
Time-Frequency Attention for Monaural Speech Enhancement. ICASSP 2022: 7852-7856 - [c624]Junchen Lu
, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. ICASSP 2022: 8032-8036 - [c623]Jiadong Wang, Jibin Wu, Malu Zhang, Qi Liu, Haizhou Li:
A Hybrid Learning Framework for Deep Spiking Neural Networks with One-Spike Temporal Coding. ICASSP 2022: 8942-8946 - [c622]Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu
, Zhengqi Wen, Haizhou Li:
ADD 2022: the first Audio Deep Synthesis Detection Challenge. ICASSP 2022: 9216-9220 - [c621]