default search action
Errui Ding
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j9]Keyao Wang, Guosheng Zhang, Haixiao Yue, Yanyan Liang, Mouxiao Huang, Gang Zhang, Junyu Han, Errui Ding, Jingdong Wang:
CSDG-FAS: Closed-Space Domain Generalization for Face Anti-spoofing. Int. J. Comput. Vis. 132(11): 4866-4879 (2024) - [j8]Huixin Sun, Yunhao Wang, Xiaodi Wang, Bin Zhang, Ying Xin, Baochang Zhang, Xianbin Cao, Errui Ding, Shumin Han:
MAFormer: A transformer network with multi-scale attention fusion for visual recognition. Neurocomputing 595: 127828 (2024) - [j7]Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training. Trans. Mach. Learn. Res. 2024 (2024) - [c134]Keyao Wang, Guosheng Zhang, Haixiao Yue, Ajian Liu, Gang Zhang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Multi-Domain Incremental Learning for Face Presentation Attack Detection. AAAI 2024: 5499-5507 - [c133]Jialun Liu, Chenming Wu, Xinqi Liu, Xing Liu, Jinbo Wu, Haotian Peng, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding:
TexOct: Generating Textures of 3D Models with Octree-based Diffusion. CVPR 2024: 4284-4293 - [c132]Yu Wang, Xin Li, Shengzhao Weng, Gang Zhang, Haixiao Yue, Haocheng Feng, Junyu Han, Errui Ding:
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling. CVPR 2024: 16016-16025 - [c131]Jiacheng Zhang, Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Decoupled Pseudo-Labeling for Semi-Supervised Monocular 3D Object Detection. CVPR 2024: 16923-16932 - [c130]Chuyang Zhao, Yifan Sun, Wenhao Wang, Qiang Chen, Errui Ding, Yi Yang, Jingdong Wang:
MS-DETR: Efficient DETR Training with Mixed Supervision. CVPR 2024: 17027-17036 - [c129]Yanpeng Sun, Jiahui Chen, Shan Zhang, Xinyu Zhang, Qiang Chen, Gang Zhang, Errui Ding, Jingdong Wang, Zechao Li:
VRP-SAM: SAM with Visual Reference Prompt. CVPR 2024: 23565-23574 - [c128]Rui Zhang, Xiangru Lin, Wei Zhang, Jincheng Lu, Xuekuan Wang, Xiao Tan, Yingying Li, Errui Ding, Jingdong Wang, Guanbin Li:
Interactive 3D Object Detection with Prompts. ECCV (17) 2024: 140-157 - [c127]Jinghua Hou, Tong Wang, Xiaoqing Ye, Zhe Liu, Shi Gong, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
OPEN: Object-Wise Position Embedding for Multi-view 3D Object Detection. ECCV (26) 2024: 146-162 - [c126]Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu:
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction. ECCV (23) 2024: 312-328 - [c125]Hao Li, Yuanyuan Gao, Chenming Wu, Dingwen Zhang, Yalun Dai, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han:
GGRt: Towards Pose-Free Generalizable 3D Gaussian Splatting in Real-Time. ECCV (71) 2024: 325-341 - [c124]Jiazhi Guan, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu:
ReSyncer: Rewiring Style-Based Generator for Unified Audio-Visually Synced Facial Performer. ECCV (41) 2024: 348-367 - [c123]Xingyu Wan, Chengquan Zhang, Pengyuan Lyu, Sen Fan, Zihan Ni, Kun Yao, Errui Ding, Jingdong Wang:
Towards Unified Multi-granularity Text Detection with Interactive Attention. ICML 2024 - [c122]Jiazhi Guan, Quanwei Yang, Kaisiyuan Wang, Hang Zhou, Shengyi He, Zhiliang Xu, Haocheng Feng, Errui Ding, Jingdong Wang, Hongtao Xie, Youjian Zhao, Ziwei Liu:
TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model. SIGGRAPH Asia 2024: 109:1-109:11 - [c121]Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding:
HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation. WACV 2024: 3190-3199 - [i149]Chuyang Zhao, Yifan Sun, Wenhao Wang, Qiang Chen, Errui Ding, Yi Yang, Jingdong Wang:
MS-DETR: Efficient DETR Training with Mixed Supervision. CoRR abs/2401.03989 (2024) - [i148]Xinqi Liu, Chenming Wu, Jialun Liu, Xing Liu, Jinbo Wu, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang:
GVA: Reconstructing Vivid 3D Gaussian Avatars from Monocular Videos. CoRR abs/2402.16607 (2024) - [i147]Yanpeng Sun, Jiahui Chen, Shan Zhang, Xinyu Zhang, Qiang Chen, Gang Zhang, Errui Ding, Jingdong Wang, Zechao Li:
VRP-SAM: SAM with Visual Reference Prompt. CoRR abs/2402.17726 (2024) - [i146]Hao Li, Yuanyuan Gao, Chenming Wu, Dingwen Zhang, Yalun Dai, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han:
GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time. CoRR abs/2403.10147 (2024) - [i145]Jinbo Wu, Xing Liu, Chenming Wu, Xiaobo Gao, Jialun Liu, Xinqi Liu, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang:
TexRO: Generating Delicate Textures of 3D Models by Recursive Optimization. CoRR abs/2403.15009 (2024) - [i144]Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Yingying Li, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection. CoRR abs/2403.15127 (2024) - [i143]Jiacheng Zhang, Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection. CoRR abs/2403.17387 (2024) - [i142]Xingyu Wan, Chengquan Zhang, Pengyuan Lyu, Sen Fan, Zihan Ni, Kun Yao, Errui Ding, Jingdong Wang:
Towards Unified Multi-granularity Text Detection with Interactive Attention. CoRR abs/2405.19765 (2024) - [i141]Pengyuan Lyu, Yulin Li, Hao Zhou, Weihong Ma, Xingyu Wan, Qunyi Xie, Liang Wu, Chengquan Zhang, Kun Yao, Errui Ding, Jingdong Wang:
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond. CoRR abs/2405.21013 (2024) - [i140]Yanmin Wu, Jiarui Meng, Haijie Li, Chenming Wu, Yahao Shi, Xinhua Cheng, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Jian Zhang:
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding. CoRR abs/2406.02058 (2024) - [i139]Qiang Chen, Xiangbo Su, Xinyu Zhang, Jian Wang, Jiahui Chen, Yunpeng Shen, Chuchu Han, Ziliang Chen, Weixiang Xu, Fanrong Li, Shan Zhang, Kun Yao, Errui Ding, Gang Zhang, Jingdong Wang:
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection. CoRR abs/2406.03459 (2024) - [i138]Zhengqi Zhao, Xiaohu Huang, Hao Zhou, Kun Yao, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng:
Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting. CoRR abs/2406.08814 (2024) - [i137]Hao Li, Jingfeng Li, Dingwen Zhang, Chenming Wu, Jieqi Shi, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han:
VDG: Vision-Only Dynamic Gaussian for Driving Simulation. CoRR abs/2406.18198 (2024) - [i136]Hao Li, Ming Yuan, Yan Zhang, Chenming Wu, Chen Zhao, Chunyu Song, Haocheng Feng, Errui Ding, Dingwen Zhang, Jingdong Wang:
XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis. CoRR abs/2406.18360 (2024) - [i135]Yu Wang, Xiangbo Su, Qiang Chen, Xinyu Zhang, Teng Xi, Kun Yao, Errui Ding, Gang Zhang, Jingdong Wang:
OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer. CoRR abs/2407.10655 (2024) - [i134]Jinghua Hou, Tong Wang, Xiaoqing Ye, Zhe Liu, Shi Gong, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection. CoRR abs/2407.10753 (2024) - [i133]Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu:
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction. CoRR abs/2407.11335 (2024) - [i132]Lingfeng Yang, Xinyu Zhang, Xiang Li, Jinwen Chen, Kun Yao, Gang Zhang, Errui Ding, Lingqiao Liu, Jingdong Wang, Jian Yang:
Add-SD: Rational Generation without Manual Reference. CoRR abs/2407.21016 (2024) - [i131]Jiazhi Guan, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu:
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer. CoRR abs/2408.03284 (2024) - [i130]Jing Hao, Yuxiang Zhao, Song Chen, Yanpeng Sun, Qiang Chen, Gang Zhang, Kun Yao, Errui Ding, Jingdong Wang:
FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs. CoRR abs/2409.13540 (2024) - [i129]Chuyang Zhao, Yuxing Song, Wenhao Wang, Haocheng Feng, Errui Ding, Yifan Sun, Xinyan Xiao, Jingdong Wang:
MonoFormer: One Transformer for Both Diffusion and Autoregression. CoRR abs/2409.16280 (2024) - [i128]Yubin Wang, Zhikang Zou, Xiaoqing Ye, Xiao Tan, Errui Ding, Cairong Zhao:
Uni2Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection. CoRR abs/2409.20558 (2024) - [i127]Jing Yang, Minyue Jiang, Sen Yang, Xiao Tan, Yingying Li, Errui Ding, Hanli Wang, Jingdong Wang:
MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction. CoRR abs/2410.07733 (2024) - [i126]Jiazhi Guan, Quanwei Yang, Kaisiyuan Wang, Hang Zhou, Shengyi He, Zhiliang Xu, Haocheng Feng, Errui Ding, Jingdong Wang, Hongtao Xie, Youjian Zhao, Ziwei Liu:
TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model. CoRR abs/2410.10696 (2024) - [i125]Linger Deng, Yuliang Liu, Bohan Li, Dongliang Luo, Liang Wu, Chengquan Zhang, Pengyuan Lyu, Ziyang Zhang, Gang Zhang, Errui Ding, Yingying Zhu, Xiang Bai:
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models. CoRR abs/2410.17885 (2024) - 2023
- [j6]Cong Cao, Tianwei Lin, Dongliang He, Fu Li, Huanjing Yue, Jing-Yu Yang, Errui Ding:
Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation. IEEE Trans. Circuits Syst. Video Technol. 33(2): 793-803 (2023) - [j5]Xinyu Zhang, Jiahui Chen, Junkun Yuan, Qiang Chen, Jian Wang, Xiaodi Wang, Shumin Han, Xiaokang Chen, Jimin Pi, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
CAE v2: Context Autoencoder with CLIP Latent Alignment. Trans. Mach. Learn. Res. 2023 (2023) - [j4]Zhuoqi Ma, Tianwei Lin, Xin Li, Fu Li, Dongliang He, Errui Ding, Nannan Wang, Xinbo Gao:
Dual-Affinity Style Embedding Network for Semantic-Aligned Image Style Transfer. IEEE Trans. Neural Networks Learn. Syst. 34(10): 7404-7417 (2023) - [c120]Zhe Liu, Xiaoqing Ye, Xiao Tan, Errui Ding, Xiang Bai:
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection. AAAI 2023: 1790-1798 - [c119]Kaisiyuan Wang, Changcheng Liang, Hang Zhou, Jiaxiang Tang, Qianyi Wu, Dongliang He, Zhibin Hong, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
Robust Video Portrait Reenactment via Personalized Representation Quantization. AAAI 2023: 2564-2572 - [c118]Haixiao Yue, Keyao Wang, Guosheng Zhang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Cyclically Disentangled Feature Translation for Face Anti-spoofing. AAAI 2023: 3358-3366 - [c117]Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator. CVPR 2023: 1505-1515 - [c116]Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang:
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection. CVPR 2023: 15579-15588 - [c115]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang:
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers. CVPR 2023: 21254-21263 - [c114]Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai:
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection. CVPR 2023: 21570-21579 - [c113]Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers. CVPR 2023: 23809-23818 - [c112]Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Qian He, Chuanyang Hu, Errui Ding, Yu Guan, Xuming He:
Part-aware Prototypical Graph Network for One-shot Skeleton-based Action Recognition. FG 2023: 1-8 - [c111]Qiang Chen, Xiaokang Chen, Jian Wang, Shan Zhang, Kun Yao, Haocheng Feng, Junyu Han, Errui Ding, Gang Zeng, Jingdong Wang:
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment. ICCV 2023: 6610-6619 - [c110]Lin Zhang, Xin Li, Dongliang He, Fu Li, Errui Ding, Zhaoxiang Zhang:
LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution. ICCV 2023: 13072-13081 - [c109]Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, Jingdong Wang:
Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation. ICCV 2023: 14983-14992 - [c108]Xiang Guo, Jiadai Sun, Yuchao Dai, Guanying Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang:
Forward Flow for Novel View Synthesis of Dynamic Scenes. ICCV 2023: 15976-15987 - [c107]Shuo Li, Yue He, Weiming Zhang, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang:
CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision. ICCV 2023: 16302-16312 - [c106]Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Yingying Li, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection. ICCV 2023: 16344-16354 - [c105]Jiaxiang Tang, Hang Zhou, Xiaokang Chen, Tianshu Hu, Errui Ding, Jingdong Wang, Gang Zeng:
Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement. ICCV 2023: 17693-17703 - [c104]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. ICDAR (2) 2023: 536-552 - [c103]Xiaohu Huang, Hao Zhou, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng:
Graph Contrastive Learning for Skeleton-based Action Recognition. ICLR 2023 - [c102]Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training. ICLR 2023 - [c101]Wei Xu, Kangkang Wang, Ziliang Chen, Bin He, Bi Li, Haocheng Feng, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding:
MSAbox: A spatially stable face detector. ICME 2023: 1745-1750 - [c100]Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, Jingdong Wang:
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception. NeurIPS 2023 - [c99]Kaisiyuan Wang, Hang Zhou, Qianyi Wu, Jiaxiang Tang, Zhiliang Xu, Borong Liang, Tianshu Hu, Errui Ding, Jingtuo Liu, Ziwei Liu, Jingdong Wang:
Efficient Video Portrait Reenactment via Grid-based Codebook. SIGGRAPH (Conference Paper Track) 2023: 66:1-66:9 - [c98]Zhihong Pan, Baopu Li, Dongliang He, Wenhao Wu, Errui Ding:
Effective Invertible Arbitrary Image Rescaling. WACV 2023: 5405-5414 - [i124]Zhe Liu, Xiaoqing Ye, Xiao Tan, Errui Ding, Xiang Bai:
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-based 3D Object Detection. CoRR abs/2301.01615 (2023) - [i123]Xiaohu Huang, Hao Zhou, Bin Feng, Xinggang Wang, Wenyu Liu, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Graph Contrastive Learning for Skeleton-based Action Recognition. CoRR abs/2301.10900 (2023) - [i122]Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Dongliang He, Jingtuo Liu, Errui Ding, Jingdong Wang, Shio Miyafuji, Ziwei Liu, Hideki Koike:
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation. CoRR abs/2302.06857 (2023) - [i121]Zhichao Liu, Leshan Wang, Desen Zhou, Jian Wang, Songyang Zhang, Yang Bai, Errui Ding, Rui Fan:
Temporal Segment Transformer for Action Segmentation. CoRR abs/2302.13074 (2023) - [i120]Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training. CoRR abs/2303.00289 (2023) - [i119]Jiaxiang Tang, Hang Zhou, Xiaokang Chen, Tianshu Hu, Errui Ding, Jingdong Wang, Gang Zeng:
Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement. CoRR abs/2303.02091 (2023) - [i118]Lin Zhang, Xin Li, Dongliang He, Errui Ding, Zhaoxiang Zhang:
LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution. CoRR abs/2303.04970 (2023) - [i117]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang:
PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers. CoRR abs/2303.09187 (2023) - [i116]Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai:
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection. CoRR abs/2303.10209 (2023) - [i115]Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang:
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection. CoRR abs/2303.14960 (2023) - [i114]Yifu Zhang, Xinggang Wang, Xiaoqing Ye, Wei Zhang, Jincheng Lu, Xiao Tan, Errui Ding, Peize Sun, Jingdong Wang:
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box. CoRR abs/2303.15334 (2023) - [i113]Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator. CoRR abs/2305.05445 (2023) - [i112]Zhe Liu, Xiaoqing Ye, Zhikang Zou, Xinwei He, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
Multi-Modal 3D Object Detection by Box Matching. CoRR abs/2305.07713 (2023) - [i111]Jiazhi Guan, Tianshu Hu, Hang Zhou, Zhizhi Guo, Lirui Deng, Chengbin Quan, Errui Ding, Youjian Zhao:
Building an Invisible Shield for Your Portrait against Deepfakes. CoRR abs/2305.12881 (2023) - [i110]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. CoRR abs/2306.03287 (2023) - [i109]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Xiyu Wang, Chang Xu, Dongmei Fu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation. CoRR abs/2306.17074 (2023) - [i108]Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers. CoRR abs/2307.08095 (2023) - [i107]Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding:
HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation. CoRR abs/2307.16183 (2023) - [i106]Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, Jingdong Wang:
Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation. CoRR abs/2308.07313 (2023) - [i105]Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang:
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation. CoRR abs/2309.00398 (2023) - [i104]Xiang Guo, Jiadai Sun, Yuchao Dai, Guanying Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang:
Forward Flow for Novel View Synthesis of Dynamic Scenes. CoRR abs/2309.17390 (2023) - [i103]Deli Yu, Teng Xi, Jianwei Li, Baopu Li, Gang Zhang, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Accelerating Vision Transformers Based on Heterogeneous Attention Patterns. CoRR abs/2310.07664 (2023) - [i102]Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, Jingdong Wang:
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception. CoRR abs/2310.20695 (2023) - [i101]Zipeng Qi, Guoxi Huang, Zebin Huang, Qin Guo, Jinwen Chen, Junyu Han, Jian Wang, Gang Zhang, Lufei Liu, Errui Ding, Jingdong Wang:
Layered Rendering Diffusion Model for Zero-Shot Guided Image Synthesis. CoRR abs/2311.18435 (2023) - [i100]Yahao Shi, Yanmin Wu, Chenming Wu, Xing Liu, Chen Zhao, Haocheng Feng, Jingtuo Liu, Liangjun Zhang, Jian Zhang, Bin Zhou, Errui Ding, Jingdong Wang:
GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization. CoRR abs/2312.05133 (2023) - 2022
- [j3]Liang Du, Xiaoqing Ye, Xiao Tan, Edward Johns, Bo Chen, Errui Ding, Xiangyang Xue, Jianfeng Feng:
AGO-Net: Association-Guided 3D Point Cloud Object Detection Network. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 8097-8109 (2022) - [c97]Zhiliang Xu, Zhibin Hong, Changxing Ding, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding:
MobileFaceSwap: A Lightweight Framework for Video Face Swapping. AAAI 2022: 2973-2981 - [c96]Xiang Guo, Guanying Chen, Yuchao Dai, Xiaoqing Ye, Jiadai Sun, Xiao Tan, Errui Ding:
Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis. ACCV (1) 2022: 450-468 - [c95]Xipeng Yang, Jin Ye, Jincheng Lu, Chenting Gong, Minyue Jiang, Xiangru Lin, Wei Zhang, Xiao Tan, Yingying Li, Xiaoqing Ye, Errui Ding:
Box-Grained Reranking Matching for Multi-Camera Multi-Target Tracking. CVPR Workshops 2022: 3095-3105 - [c94]Jiacheng Zhang, Xiangru Lin, Minyue Jiang, Yue Yu, Chenting Gong, Wei Zhang, Xiao Tan, Yingying Li, Errui Ding, Guanbin Li:
A Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval. CVPR Workshops 2022: 3215-3224 - [c93]Borong Liang, Yan Pan, Zhizhi Guo, Hang Zhou, Zhibin Hong, Xiaoguang Han, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Expressive Talking Head Generation with Granular Audio-Visual Control. CVPR 2022: 3377-3386 - [c92]Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval. CVPR 2022: 5174-5183 - [c91]Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang:
MixFormer: Mixing Features across Windows and Dimensions. CVPR 2022: 5239-5249 - [c90]