default search action
Heng Tao Shen
Hengtao Shen – 申恒涛
Person information
- unicode name: 申恒涛
- affiliation: University of Electronic Science and Technology of China, School of Computer Science and Engineering, Chengdu, China
- affiliation (2004 - 2017): University of Queensland, Brisbane, Australia
- affiliation (PhD 2004): National University of Singapore, Singapore
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j276]Yujie Mo, Heng Tao Shen, Xiaofeng Zhu:
Unsupervised multi-view graph representation learning with dual weight-net. Inf. Fusion 114: 102669 (2025) - [j275]Yujie Mo, Heng Tao Shen, Xiaofeng Zhu:
Efficient self-supervised heterogeneous graph representation learning with reconstruction. Inf. Fusion 117: 102846 (2025) - 2024
- [j274]Lifeng Sun, Xinhang Song, Shuqiang Jiang, Lili Wang, Hengtao Shen:
Preface to the Special Issue on Multimodal Collaborative Perception and Fusion Technology. Int. J. Softw. Informatics 14(2): 119-122 (2024) - [j273]Jingjing Li, Zhiqi Yu, Zhekai Du, Lei Zhu, Heng Tao Shen:
A Comprehensive Survey on Source-Free Domain Adaptation. IEEE Trans. Pattern Anal. Mach. Intell. 46(8): 5743-5762 (2024) - [j272]Yuhui Wu, Guoqing Wang, Shaochong Liu, Yang Yang, Wei Liu, Xiongxin Tang, Shuhang Gu, Chongyi Li, Heng Tao Shen:
Towards a Flexible Semantic Guided Model for Single Image Enhancement and Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9921-9939 (2024) - [j271]Chaofan Zheng, Lianli Gao, Xinyu Lyu, Pengpeng Zeng, Abdulmotaleb El-Saddik, Heng Tao Shen:
Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation. IEEE Trans. Circuits Syst. Video Technol. 34(3): 1743-1756 (2024) - [j270]Shenshen Li, Xing Xu, Xun Jiang, Fumin Shen, Xin Liu, Heng Tao Shen:
Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(4): 2959-2972 (2024) - [j269]Zeyu Ma, Ziqiang Zheng, Jiwei Wei, Yang Yang, Heng Tao Shen:
Instance-Dictionary Learning for Open-World Object Detection in Autonomous Driving Scenarios. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3395-3408 (2024) - [j268]Haonan Zhang, Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Jingkuan Song, Heng Tao Shen:
SPT: Spatial Pyramid Transformer for Image Captioning. IEEE Trans. Circuits Syst. Video Technol. 34(6): 4829-4842 (2024) - [j267]Ziqiang Zheng, Hao Ren, Yang Wu, Weichuan Zhang, Hong Lu, Yang Yang, Heng Tao Shen:
Fully Unsupervised Domain-Agnostic Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(6): 5077-5090 (2024) - [j266]Yin Tang, Tao Chen, Xiruo Jiang, Yazhou Yao, Guo-Sen Xie, Heng Tao Shen:
Holistic Prototype Attention Network for Few-Shot Video Object Segmentation. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6699-6709 (2024) - [j265]Yahui Xu, Jiwei Wei, Yi Bin, Yang Yang, Zeyu Ma, Heng Tao Shen:
Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(10): 10494-10506 (2024) - [j264]Mingfeng Zha, Feiyang Fu, Yunqiang Pei, Guoqing Wang, Tianyu Li, Xiongxin Tang, Yang Yang, Heng Tao Shen:
Dual Domain Perception and Progressive Refinement for Mirror Detection. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11942-11953 (2024) - [j263]Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Ump: Unified Modality-Aware Prompt Tuning for Text-Video Retrieval. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11954-11964 (2024) - [j262]Yixuan Zhou, Yi Qu, Xing Xu, Fumin Shen, Jingkuan Song, Heng Tao Shen:
BatchNorm-Based Weakly Supervised Video Anomaly Detection. IEEE Trans. Circuits Syst. Video Technol. 34(12): 13642-13654 (2024) - [j261]Feiyu Chen, Jie Shao, Anjie Zhu, Deqiang Ouyang, Xueliang Liu, Heng Tao Shen:
Modeling Hierarchical Uncertainty for Multimodal Emotion Recognition in Conversation. IEEE Trans. Cybern. 54(1): 187-198 (2024) - [j260]Yujie Li, Xun Jiang, Xing Xu, Huimin Lu, Heng Tao Shen:
Fuzzy Multimodal Graph Reasoning for Human-Centric Instructional Video Grounding. IEEE Trans. Fuzzy Syst. 32(9): 5046-5059 (2024) - [j259]Quan Rui, Shiyuan He, Tianyu Li, Guoqing Wang, Ningjuan Ruan, Lin Mei, Yang Yang, Heng Tao Shen:
Density-Aware Cloud Removal of Remote Sensing Imagery Using a Global-Local Fusion Transformer. IEEE Trans. Geosci. Remote. Sens. 62: 1-11 (2024) - [j258]Jiefu Chen, Tong Chen, Xing Xu, Jingran Zhang, Yang Yang, Heng Tao Shen:
Coreset Learning-Based Sparse Black-Box Adversarial Attack for Video Recognition. IEEE Trans. Inf. Forensics Secur. 19: 1547-1560 (2024) - [j257]Guobao Xiao, Zhimin Tang, Hanlin Guo, Jun Yu, Heng Tao Shen:
FAFusion: Learning for Infrared and Visible Image Fusion via Frequency Awareness. IEEE Trans. Instrum. Meas. 73: 1-11 (2024) - [j256]Mengmeng Jing, Jingjing Li, Ke Lu, Lei Zhu, Heng Tao Shen:
Visually Source-Free Domain Adaptation via Adversarial Style Matching. IEEE Trans. Image Process. 33: 1032-1044 (2024) - [j255]Zheng Wang, Xing Xu, Jiwei Wei, Ning Xie, Yang Yang, Heng Tao Shen:
Semantics Disentangling for Cross-Modal Retrieval. IEEE Trans. Image Process. 33: 2226-2237 (2024) - [j254]Xuanhan Wang, Xiaojia Chen, Lianli Gao, Jingkuan Song, Heng Tao Shen:
CPI-Parser: Integrating Causal Properties Into Multiple Human Parsing. IEEE Trans. Image Process. 33: 5771-5782 (2024) - [j253]Kumie Gedamu, Yanli Ji, Yang Yang, Jie Shao, Heng Tao Shen:
Self-Supervised Sub-Action Parsing Network for Semi-Supervised Action Quality Assessment. IEEE Trans. Image Process. 33: 6057-6070 (2024) - [j252]Lei Zhu, Chaoqun Zheng, Weili Guan, Jingjing Li, Yang Yang, Heng Tao Shen:
Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey. IEEE Trans. Knowl. Data Eng. 36(1): 239-260 (2024) - [j251]Yang Xu, Lei Zhu, Jingjing Li, Fengling Li, Heng Tao Shen:
Temporal Social Graph Network Hashing for Efficient Recommendation. IEEE Trans. Knowl. Data Eng. 36(7): 3541-3555 (2024) - [j250]Yan Dai, Xiaojia Chen, Xuanhan Wang, Minghui Pang, Lianli Gao, Heng Tao Shen:
ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets. IEEE Trans. Multim. 26: 1384-1394 (2024) - [j249]Shuaiqi Jing, Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Memory-Based Augmentation Network for Video Captioning. IEEE Trans. Multim. 26: 2367-2379 (2024) - [j248]Jinghan Ru, Jun Tian, Chengwei Xiao, Jingjing Li, Heng Tao Shen:
Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment. IEEE Trans. Multim. 26: 2504-2514 (2024) - [j247]Congrui Li, Ziqiang Zheng, Yi Bin, Guoqing Wang, Yang Yang, Xuesheng Li, Heng Tao Shen:
Pixel Bleach Network for Detecting Face Forgery Under Compression. IEEE Trans. Multim. 26: 2585-2597 (2024) - [j246]Yan Dai, Beitao Chen, Lianli Gao, Jingkuan Song, Heng Tao Shen:
DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation. IEEE Trans. Multim. 26: 3180-3193 (2024) - [j245]Jiwei Wei, Yang Yang, Xiang Guan, Xing Xu, Guoqing Wang, Heng Tao Shen:
Runge-Kutta Guided Feature Augmentation for Few-Sample Learning. IEEE Trans. Multim. 26: 7349-7358 (2024) - [j244]Huafeng Liu, Mengmeng Sheng, Zeren Sun, Yazhou Yao, Xian-Sheng Hua, Heng Tao Shen:
Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection. IEEE Trans. Multim. 26: 7426-7437 (2024) - [j243]Shiyuan He, Jiwei Wei, Chaoning Zhang, Xing Xu, Jingkuan Song, Yang Yang, Heng Tao Shen:
Boosting Adversarial Training with Hardness-Guided Attack Strategy. IEEE Trans. Multim. 26: 7748-7760 (2024) - [j242]Jian Huang, Yanli Ji, Zhen Qin, Yang Yang, Heng Tao Shen:
Dominant SIngle-Modal SUpplementary Fusion (SIMSUF) for Multimodal Sentiment Analysis. IEEE Trans. Multim. 26: 8383-8394 (2024) - [j241]Xun Jiang, Xing Xu, Zailei Zhou, Yang Yang, Fumin Shen, Heng Tao Shen:
Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings. IEEE Trans. Multim. 26: 9657-9670 (2024) - [j240]Yahui Xu, Yi Bin, Jiwei Wei, Yang Yang, Guoqing Wang, Heng Tao Shen:
Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback. IEEE Trans. Multim. 26: 9936-9948 (2024) - [j239]Zheng Wang, Zhenwei Gao, Mengqun Han, Yang Yang, Heng Tao Shen:
Estimating the Semantics via Sector Embedding for Image-Text Retrieval. IEEE Trans. Multim. 26: 10342-10353 (2024) - [j238]Jiwei Wei, Chen Pan, Shiyuan He, Guoqing Wang, Yang Yang, Heng Tao Shen:
Towards Robust Person Re-Identification by Adversarial Training With Dynamic Attack Strategy. IEEE Trans. Multim. 26: 10367-10380 (2024) - [j237]Dan Zhang, Zhekai Du, Jingjing Li, Lei Zhu, Heng Tao Shen:
Domain-Adaptive Energy-Based Models for Generalizable Face Anti-Spoofing. IEEE Trans. Multim. 26: 10474-10488 (2024) - [j236]Ke Liu, Jiwei Wei, Jie Zou, Peng Wang, Yang Yang, Heng Tao Shen:
Improving Pre-Trained Model-Based Speech Emotion Recognition From a Low-Level Speech Feature Perspective. IEEE Trans. Multim. 26: 10623-10636 (2024) - [j235]Ran Ran, Jiwei Wei, Chaoning Zhang, Guoqing Wang, Yang Yang, Heng Tao Shen:
Adaptive Multi-scale Degradation-Based Attack for Boosting the Adversarial Transferability. IEEE Trans. Multim. 26: 10979-10990 (2024) - [j234]Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Liqiang Nie, Heng Tao Shen:
Anti-Collapse Loss for Deep Metric Learning. IEEE Trans. Multim. 26: 11139-11150 (2024) - [j233]Yalan Ye, Tongjie Pan, Qianhe Meng, Jingjing Li, Heng Tao Shen:
Online Unsupervised Domain Adaptation via Reducing Inter- and Intra-Domain Discrepancies. IEEE Trans. Neural Networks Learn. Syst. 35(1): 884-898 (2024) - [j232]Xun Jiang, Xing Xu, Jingran Zhang, Fumin Shen, Zuo Cao, Heng Tao Shen:
SDN: Semantic Decoupling Network for Temporal Language Grounding. IEEE Trans. Neural Networks Learn. Syst. 35(5): 6598-6612 (2024) - [j231]Zheng Wang, Xing Xu, Yin Zhang, Yang Yang, Heng Tao Shen:
Complex Relation Embedding for Scene Graph Generation. IEEE Trans. Neural Networks Learn. Syst. 35(6): 8321-8335 (2024) - [j230]Liang Peng, Yujie Mo, Jie Xu, Jialie Shen, Xiaoshuang Shi, Xiaoxiao Li, Heng Tao Shen, Xiaofeng Zhu:
GRLC: Graph Representation Learning With Constraints. IEEE Trans. Neural Networks Learn. Syst. 35(6): 8609-8622 (2024) - [j229]Yan Dai, Xuanhan Wang, Lianli Gao, Jingkuan Song, Feng Zheng, Heng Tao Shen:
Overcoming Data Deficiency for Multi-Person Pose Estimation. IEEE Trans. Neural Networks Learn. Syst. 35(8): 10857-10868 (2024) - [j228]Haonan Luo, Guosheng Lin, Fumin Shen, Xingguo Huang, Yazhou Yao, Hengtao Shen:
Robust-EQA: Robust Learning for Embodied Question Answering With Noisy Labels. IEEE Trans. Neural Networks Learn. Syst. 35(9): 12083-12094 (2024) - [j227]Hongzu Su, Jingjing Li, Zhekai Du, Lei Zhu, Ke Lu, Heng Tao Shen:
Cross-domain Recommendation via Dual Adversarial Adaptation. ACM Trans. Inf. Syst. 42(3): 83:1-83:26 (2024) - [j226]Tianshi Wang, Fengling Li, Lei Zhu, Jingjing Li, Zheng Zhang, Heng Tao Shen:
Invisible Black-Box Backdoor Attack against Deep Cross-Modal Hashing Retrieval. ACM Trans. Inf. Syst. 42(4): 111:1-111:27 (2024) - [c261]Shenshen Li, Chen He, Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen:
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval. AAAI 2024: 3172-3180 - [c260]Ziyang Lu, Yunqiang Pei, Guoqing Wang, Peiwei Li, Yang Yang, Yinjie Lei, Heng Tao Shen:
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding. AAAI 2024: 3936-3944 - [c259]Mingfeng Zha, Yunqiang Pei, Guoqing Wang, Tianyu Li, Yang Yang, Wenbin Qian, Heng Tao Shen:
Weakly-Supervised Mirror Detection via Scribble Annotations. AAAI 2024: 6953-6961 - [c258]Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen:
T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering. AAAI 2024: 19162-19170 - [c257]Fei Kong, Jinhao Duan, Lichao Sun, Hao Cheng, Renjing Xu, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu:
ACT-Diffusion: Efficient Adversarial Consistency Training for One-Step Diffusion Models. CVPR 2024: 8890-8899 - [c256]Ji Zhang, Shihan Wu, Lianli Gao, Heng Tao Shen, Jingkuan Song:
DePT: Decoupled Prompt Tuning. CVPR 2024: 12924-12933 - [c255]Kaipeng Fang, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Zhi-Qi Cheng, Xiyao Li, Heng Tao Shen:
ProS: Prompting-to-Simulate Generalized Knowledge for Universal Cross-Domain Retrieval. CVPR 2024: 17292-17301 - [c254]Bowen Tang, Zheng Wang, Yi Bin, Qi Dou, Yang Yang, Heng Tao Shen:
Ensemble Diversity Facilitates Adversarial Transferability. CVPR 2024: 24377-24386 - [c253]Zixian Gao, Xun Jiang, Xing Xu, Fumin Shen, Yujie Li, Heng Tao Shen:
Embracing Unimodal Aleatoric Uncertainty for Robust Multimodal Fusion. CVPR 2024: 26866-26875 - [c252]Renming Huang, Yunqiang Pei, Guoqing Wang, Yangming Zhang, Yang Yang, Peng Wang, Hengtao Shen:
Diffusion Models as Optimizers for Efficient Planning in Offline RL. ECCV (51) 2024: 1-17 - [c251]Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Xiaoshuang Shi, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu:
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees. EMNLP (Findings) 2024: 6886-6898 - [c250]Zetao Zheng, Jie Shao, Shilong Deng, Anjie Zhu, Heng Tao Shen, Xiaofang Zhou:
Cross-Insight Trader: A Trading Approach Integrating Policies with Diverse Investment Horizons for Portfolio Management. ICDE 2024: 4685-4698 - [c249]Zetao Zheng, Jie Shao, Feiyu Chen, Anjie Zhu, Shilong Deng, Heng Tao Shen:
HIT: Solving Partial Index Tracking via Hierarchical Reinforcement Learning. ICDE 2024: 4709-4721 - [c248]Fei Kong, Jinhao Duan, Ruipeng Ma, Heng Tao Shen, Xiaoshuang Shi, Xiaofeng Zhu, Kaidi Xu:
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization. ICLR 2024 - [c247]Yujie Mo, Feiping Nie, Ping Hu, Heng Tao Shen, Zheng Zhang, Xinchao Wang, Xiaofeng Zhu:
Self-Supervised Heterogeneous Graph Learning: a Homophily and Heterogeneity View. ICLR 2024 - [c246]Mengmeng Zhan, Zongqian Wu, Rongyao Hu, Ping Hu, Heng Tao Shen, Xiaofeng Zhu:
Towards Dynamic-Prompting Collaboration for Source-Free Domain Adaptation. IJCAI 2024: 1643-1651 - [c245]Yunqiang Pei, Kaiyue Zhang, Hongrong Yang, Yong Tao, Qihang Tang, Jialei Tang, Guoqing Wang, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen:
Improving Interaction Comfort in Authoring Task in AR-HRI through Dynamic Dual-Layer Interaction Adjustment. ACM Multimedia 2024: 88-97 - [c244]Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen:
MPT: Multi-grained Prompt Tuning for Text-Video Retrieval. ACM Multimedia 2024: 1206-1214 - [c243]Yuhui Wu, Guoqing Wang, Zhiwen Wang, Yang Yang, Tianyu Li, Malu Zhang, Chongyi Li, Heng Tao Shen:
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement. ACM Multimedia 2024: 1810-1818 - [c242]Zhiwen Wang, Yuhui Wu, Zheng Wang, Jiwei Wei, Tianyu Li, Guoqing Wang, Yang Yang, Hengtao Shen:
Cascaded Adversarial Attack: Simultaneously Fooling Rain Removal and Semantic Segmentation Networks. ACM Multimedia 2024: 2136-2145 - [c241]Yi Bin, Junrong Liao, Yujuan Ding, Haoxuan Li, Yang Yang, See-Kiong Ng, Heng Tao Shen:
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning. ACM Multimedia 2024: 4630-4639 - [c240]Yunqiang Pei, Jialei Tang, Qihang Tang, Mingfeng Zha, Dongyu Xie, Guoqing Wang, Zhitao Liu, Ning Xie, Peng Wang, Yang Yang, Hengtao Shen:
Emotion Recognition in HMDs: A Multi-task Approach Using Physiological Signals and Occluded Faces. ACM Multimedia 2024: 5977-5986 - [c239]Xun Jiang, Zhuoyuan Wei, Shenshen Li, Xing Xu, Jingkuan Song, Heng Tao Shen:
Counterfactually Augmented Event Matching for De-biased Temporal Sentence Grounding. ACM Multimedia 2024: 6472-6481 - [c238]Jin Sun, Xiaoshuang Shi, Zhiyuan Wang, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu:
Caterpillar: A Pure-MLP Architecture with Shifted-Pillars-Concatenation. ACM Multimedia 2024: 7123-7132 - [c237]Yi Bin, Wenhao Shi, Yujuan Ding, Zhiqiang Hu, Zheng Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen:
GalleryGPT: Analyzing Paintings with Large Multimodal Models. ACM Multimedia 2024: 7734-7743 - [c236]Peng Yin, Xiaosu Zhu, Jingkuan Song, Lianli Gao, Heng Tao Shen:
SI-BiViT: Binarizing Vision Transformers with Spatial Interaction. ACM Multimedia 2024: 8169-8178 - [c235]Zixian Gao, Disen Hu, Xun Jiang, Huimin Lu, Heng Tao Shen, Xing Xu:
Enhanced Experts with Uncertainty-Aware Routing for Multimodal Sentiment Analysis. ACM Multimedia 2024: 9650-9659 - [c234]Kai Wang, Jiayang Liu, Xing Xu, Jingkuan Song, Xin Liu, Heng Tao Shen:
Unsupervised Cross-Domain Image Retrieval with Semantic-Attended Mixture-of-Experts. SIGIR 2024: 197-207 - [c233]Yunqiang Pei, Bowen Jiang, Kaiyue Zhang, Ziyang Lu, Mingfeng Zha, Guoqing Wang, Zhitao Liu, Ning Xie, Yang Yang, Hengtao Shen:
Toward Optimized AR-Based Human-Robot Interaction Ergonomics: Modeling and Predicting Interaction Comfort. VR Workshops 2024: 797-798 - [i119]Huafeng Liu, Mengmeng Sheng, Zeren Sun, Yazhou Yao, Xian-Sheng Hua, Heng Tao Shen:
Learning with Imbalanced Noisy Data by Preventing Bias in Sample Selection. CoRR abs/2402.11242 (2024) - [i118]Cheng Chen, Junchen Zhu, Xu Luo, Hengtao Shen, Lianli Gao, Jingkuan Song:
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model. CoRR abs/2403.08350 (2024) - [i117]Meixuan Li, Tianyu Li, Guoqing Wang, Peng Wang, Yang Yang, Heng Tao Shen:
Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning. CoRR abs/2403.10252 (2024) - [i116]Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen:
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization. CoRR abs/2405.15356 (2024) - [i115]Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu:
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees. CoRR abs/2407.00499 (2024) - [i114]Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Xian-Sheng Hua, Heng Tao Shen:
Anti-Collapse Loss for Deep Metric Learning Based on Coding Rate Metric. CoRR abs/2407.03106 (2024) - [i113]Renming Huang, Yunqiang Pei, Guoqing Wang, Yangming Zhang, Yang Yang, Peng Wang, Hengtao Shen:
Diffusion Models as Optimizers for Efficient Planning in Offline RL. CoRR abs/2407.16142 (2024) - [i112]Yi Bin, Junrong Liao, Yujuan Ding, Haoxuan Li, Yang Yang, See-Kiong Ng, Heng Tao Shen:
Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning. CoRR abs/2408.00305 (2024) - [i111]Yi Bin, Wenhao Shi, Yujuan Ding, Zhiqiang Hu, Zheng Wang, Yang Yang, See-Kiong Ng, Heng Tao Shen:
GalleryGPT: Analyzing Paintings with Large Multimodal Models. CoRR abs/2408.00491 (2024) - [i110]Yujia Wu, Yiming Shi, Jiwei Wei, Chengwei Sun, Yuyang Zhou, Yang Yang, Heng Tao Shen:
DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion. CoRR abs/2408.06740 (2024) - [i109]Yixuan Zhou, Xing Xu, Zhe Sun, Jingkuan Song, Andrzej Cichocki, Heng Tao Shen:
VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization. CoRR abs/2409.00942 (2024) - [i108]Renming Huang, Shaochong Liu, Yunqiang Pei, Peng Wang, Guoqing Wang, Yang Yang, Hengtao Shen:
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance. CoRR abs/2409.03996 (2024) - [i107]Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Xiaobo Xia, Fei Huang, Jingkuan Song, Yongbin Li:
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct. CoRR abs/2409.05840 (2024) - [i106]Xiaorui Sun, Jun Liu, Heng Tao Shen, Xiaofeng Zhu, Ping Hu:
On Efficient Variants of Segment Anything Model: A Survey. CoRR abs/2410.04960 (2024) - [i105]Xiao Cai, Pengpeng Zeng, Lianli Gao, Junchen Zhu, Jiaxin Zhang, Sitong Su, Heng Tao Shen, Jingkuan Song:
SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors. CoRR abs/2410.07658 (2024) - [i104]Wei Dong, Yuan Sun, Yiting Yang, Xing Zhang, Zhijun Lin, Qingsen Yan, Haokui Zhang, Peng Wang, Yang Yang, Hengtao Shen:
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation. CoRR abs/2410.22952 (2024) - 2023
- [j225]Xuemeng Song, Liqiang Nie, Hengtao Shen, Qi Tian, Hua Huang:
Preface to the Special Issue on Multimodal Learning Integrated with Pre-training Techniques. Int. J. Softw. Informatics 13(2): 139-142 (2023) - [j224]Xing Xu, Jialiang Sun, Zuo Cao, Yin Zhang, Xiaofeng Zhu, Heng Tao Shen:
TFUN: Trilinear Fusion Network for Ternary Image-Text Retrieval. Inf. Fusion 91: 327-337 (2023) - [j223]Jie Xu, Yazhou Ren, Xiaoshuang Shi, Heng Tao Shen, Xiaofeng Zhu:
UNTIE: Clustering analysis with disentanglement in multi-view information fusion. Inf. Fusion 100: 101937 (2023) - [j222]Junchen Zhu, Lianli Gao, Jingkuan Song, Yuan-Fang Li, Feng Zheng, Xuelong Li, Heng Tao Shen:
Label-Guided Generative Adversarial Network for Realistic Image Synthesis. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3311-3328 (2023) - [j221]Xinyu Lyu, Lianli Gao, Pengpeng Zeng, Heng Tao Shen, Jingkuan Song:
Adaptive Fine-Grained Predicates Learning for Scene Graph Generation. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13921-13940 (2023) - [j220]Kumie Gedamu, Yanli Ji, Lingling Gao, Yang Yang, Heng Tao Shen:
Relation-mining self-attention network for skeleton-based human action recognition. Pattern Recognit. 139: 109455 (2023) - [j219]