Остановите войну!
for scientists:
default search action
Peng Gao 0007
Person information
- affiliation: Shanghai Artificial Intelligence Laboratory, China
- affiliation (PhD 2021): Chinese University of Hong Kong, Hong Kong
Other persons with the same name
- Peng Gao — disambiguation page
- Peng Gao 0001 — China Mobile Group Design Institute Co., Ltd, Division of Research, China
- Peng Gao 0002 — University of South Carolina, Department of Geography, Columbia, SC, USA
- Peng Gao 0003 — University at Buffalo, Department of Geography, NY, USA
- Peng Gao 0004 — Jilin University, Institute of Mathematics, Changchun, China
- Peng Gao 0005 — Qufu Normal University, School of Cyber Science and Engineering, China (and 1 more)
- Peng Gao 0006 — Berlin Institute of Technology, Germany
- Peng Gao 0008 — Virginia Tech, Department of Computer Science, Blacksburg, VA, USA (and 2 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j9]Kexue Fu, Peng Gao, Shaolei Liu, Linhao Qu, Longxiang Gao, Manning Wang:
POS-BERT: Point cloud one-stage BERT pre-training. Expert Syst. Appl. 240: 122563 (2024) - [j8]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024) - [j7]Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024) - 2023
- [j6]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023) - [j5]Weicong Su, Yali Wang, Kunchang Li, Peng Gao, Yu Qiao:
Hybrid token transformer for deep face recognition. Pattern Recognit. 139: 109443 (2023) - [j4]Guanqun Wang, He Chen, Liang Chen, Yin Zhuang, Shanghang Zhang, Tong Zhang, Hao Dong, Peng Gao:
P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification. Remote. Sens. 15(7): 1773 (2023) - [j3]Tong Zhang, Yin Zhuang, He Chen, Liang Chen, Guanqun Wang, Peng Gao, Hao Dong:
Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 16: 5013-5025 (2023) - 2022
- [j2]Jianhao Li, Yin Zhuang, Shan Dong, Peng Gao, Hao Dong, He Chen, Liang Chen, Lianlin Li:
Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery. Remote. Sens. 14(7): 1767 (2022) - [j1]Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang, He Chen:
Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain. Remote. Sens. 14(22): 5675 (2022)
Conference and Workshop Papers
- 2024
- [c32]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Hao Dong, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. AAAI 2024: 6449-6457 - 2023
- [c31]Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lu:
Resilient Binary Neural Network. AAAI 2023: 10620-10628 - [c30]Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CVPR 2023: 3842-3851 - [c29]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CVPR 2023: 22732-22741 - [c28]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. ICCV 2023: 2605-2615 - [c27]Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132 - [c26]Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li:
SparseMAE: Sparse Training Meets Masked Autoencoders. ICCV 2023: 16130-16140 - [c25]Yongjing Cui, Yin Zhuang, Shan Dong, Xinyi Zhang, Peng Gao, He Chen, Liang Chen:
Hybrid Transformer Network for Change Detection Under Self-Supervised Pretraining. IGARSS 2023: 6652-6655 - 2022
- [c24]Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure Correction. BMVC 2022: 238 - [c23]Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481 - [c22]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c21]Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. ECCV (24) 2022: 19-35 - [c20]Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang:
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation. ECCV (34) 2022: 36-54 - [c19]Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü:
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors. ECCV (11) 2022: 346-361 - [c18]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c17]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c16]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022 - [c15]Shanjunyu Liu, Yin Zhuang, Hao Dong, Peng Gao, Guanqun Wang, Tong Zhang, Liang Chen, He Chen, Lianlin Li:
Adaptive Local Context Embedding for Small Vehicle Detection from Aerial Optical Remote Sensing Images. IGARSS 2022: 1712-1715 - [c14]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c13]Yanjing Li, Sheng Xu, Baochang Zhang, Xianbin Cao, Peng Gao, Guodong Guo:
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer. NeurIPS 2022 - [c12]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022 - 2021
- [c11]Shijie Geng, Peng Gao, Moitreya Chatterjee, Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian:
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers. AAAI 2021: 1415-1423 - [c10]Minghang Zheng, Peng Gao, Renrui Zhang, Kunchang Li, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. BMVC 2021: 226 - [c9]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. ICCV 2021: 3601-3610 - [c8]Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su:
Dense Contrastive Visual-Linguistic Pretraining. ACM Multimedia 2021: 5203-5212 - [c7]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Networks. NeurIPS 2021: 19160-19171 - [c6]Mingyuan Mao, Peng Gao, Renrui Zhang, Honghui Zheng, Teli Ma, Yan Peng, Errui Ding, Baochang Zhang, Shumin Han:
Dual-stream Network for Visual Recognition. NeurIPS 2021: 25346-25358 - 2020
- [c5]Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan:
Learning Where to Focus for Efficient Video Object Detection. ECCV (16) 2020: 18-34 - 2019
- [c4]Zhengkai Jiang, Peng Gao, Chaoxu Guo, Qian Zhang, Shiming Xiang, Chunhong Pan:
Video Object Detection with Locally-Weighted Deformable Neighbors. AAAI 2019: 8529-8536 - [c3]Peng Gao, Zhengkai Jiang, Haoxuan You, Pan Lu, Steven C. H. Hoi, Xiaogang Wang, Hongsheng Li:
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering. CVPR 2019: 6639-6648 - [c2]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-Modality Latent Interaction Network for Visual Question Answering. ICCV 2019: 5824-5834 - 2018
- [c1]Peng Gao, Hongsheng Li, Shuang Li, Pan Lu, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. ECCV (1) 2018: 485-501
Informal and Other Publications
- 2024
- [i69]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i68]Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong:
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models. CoRR abs/2403.11289 (2024) - [i67]Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024) - [i66]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation. CoRR abs/2404.04050 (2024) - [i65]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024) - [i64]Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li:
TerDiT: Ternary Diffusion Models with Transformers. CoRR abs/2405.14854 (2024) - [i63]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. CoRR abs/2405.16057 (2024) - [i62]Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang:
Phased Consistency Model. CoRR abs/2405.18407 (2024) - 2023
- [i61]Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lv:
Resilient Binary Neural Network. CoRR abs/2302.00956 (2023) - [i60]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023) - [i59]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i58]Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023) - [i57]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023) - [i56]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i55]Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lu, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CoRR abs/2304.00253 (2023) - [i54]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. CoRR abs/2304.01195 (2023) - [i53]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i52]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023) - [i51]Siyuan Huang, Bo Zhang, Botian Shi, Peng Gao, Yikang Li, Hongsheng Li:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. CoRR abs/2305.09160 (2023) - [i50]Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023) - [i49]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023) - [i48]Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023) - [i47]Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023) - [i46]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao:
Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks. CoRR abs/2308.12961 (2023) - [i45]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023) - [i44]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng:
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following. CoRR abs/2309.00615 (2023) - [i43]Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023) - [i42]Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo:
ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model. CoRR abs/2311.17963 (2023) - [i41]Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023) - 2022
- [i40]Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Peng Gao, Zenghui Zhang, Tatsuya Harada:
RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images. CoRR abs/2201.02314 (2022) - [i39]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning. CoRR abs/2201.04676 (2022) - [i38]Sheng Xu, Yanjing Li, Teli Ma, Bohan Zeng, Baochang Zhang, Peng Gao, Jinhu Lv:
TerViT: An Efficient Ternary Vision Transformer. CoRR abs/2201.08050 (2022) - [i37]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-attention for Visual Recognition. CoRR abs/2201.09450 (2022) - [i36]Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022) - [i35]Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022) - [i34]Kexue Fu, Peng Gao, Shaolei Liu, Renrui Zhang, Yu Qiao, Manning Wang:
POS-BERT: Point Cloud One-Stage BERT Pre-Training. CoRR abs/2204.00989 (2022) - [i33]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i32]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022) - [i31]Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
Illumination Adaptive Transformer. CoRR abs/2205.14871 (2022) - [i30]Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang, He Chen:
Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain. CoRR abs/2207.03860 (2022) - [i29]Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang:
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation. CoRR abs/2207.06654 (2022) - [i28]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i27]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i26]Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lv, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. CoRR abs/2209.01542 (2022) - [i25]Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022) - [i24]Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lv:
IDa-Det: An Information Discrepancy-aware Distillation for 1-bit Detectors. CoRR abs/2210.03477 (2022) - [i23]Yanjing Li, Sheng Xu, Baochang Zhang, Xianbin Cao, Peng Gao, Guodong Guo:
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer. CoRR abs/2210.06707 (2022) - [i22]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CoRR abs/2211.08887 (2022) - [i21]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyao Zeng, Shanghang Zhang, Peng Gao:
PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning. CoRR abs/2211.11682 (2022) - [i20]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022) - 2021
- [i19]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2101.07448 (2021) - [i18]Shijie Geng, Peng Gao, Zuohui Fu, Yongfeng Zhang:
RomeBERT: Robust Training of Multi-Exit BERT. CoRR abs/2101.09755 (2021) - [i17]Mingyuan Mao, Renrui Zhang, Honghui Zheng, Peng Gao, Teli Ma, Yan Peng, Errui Ding, Shumin Han:
Dual-stream Network for Visual Recognition. CoRR abs/2105.14734 (2021) - [i16]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Network. CoRR abs/2106.01401 (2021) - [i15]Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021) - [i14]Teli Ma, Mingyuan Mao, Honghui Zheng, Peng Gao, Xiaodi Wang, Shumin Han, Errui Ding, Baochang Zhang, David S. Doermann:
Oriented Object Detection with Transformer. CoRR abs/2106.03146 (2021) - [i13]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2108.02404 (2021) - [i12]Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su:
Dense Contrastive Visual-Linguistic Pretraining. CoRR abs/2109.11778 (2021) - [i11]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. CoRR abs/2110.04544 (2021) - [i10]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i9]Teli Ma, Shijie Geng, Mengmeng Wang, Jing Shao, Jiasen Lu, Hongsheng Li, Peng Gao, Yu Qiao:
A Simple Long-Tailed Recognition Baseline via Vision-Language Model. CoRR abs/2111.14745 (2021) - [i8]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021) - 2020
- [i7]Shijie Geng, Ji Zhang, Zuohui Fu, Peng Gao, Hang Zhang, Gerard de Melo:
Character Matters: Video Story Understanding with Character-Aware Relations. CoRR abs/2005.08646 (2020) - [i6]Peng Su, Shixiang Tang, Peng Gao, Di Qiu, Ni Zhao, Xiaogang Wang:
Gradient Regularized Contrastive Learning for Continual Domain Adaptation. CoRR abs/2007.12942 (2020) - [i5]Lei Shi, Kai Shuang, Shijie Geng, Peng Su, Zhengkai Jiang, Peng Gao, Zuohui Fu, Gerard de Melo, Sen Su:
Contrastive Visual-Linguistic Pretraining. CoRR abs/2007.13135 (2020) - [i4]Minghang Zheng, Peng Gao, Xiaogang Wang, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. CoRR abs/2011.09315 (2020) - 2019
- [i3]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-modality Latent Interaction Network for Visual Question Answering. CoRR abs/1908.04289 (2019) - 2018
- [i2]Peng Gao, Pan Lu, Hongsheng Li, Shuang Li, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. CoRR abs/1808.02632 (2018) - [i1]Peng Gao, Hongsheng Li, Haoxuan You, Zhengkai Jiang, Pan Lu, Steven C. H. Hoi, Xiaogang Wang:
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering. CoRR abs/1812.05252 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-07-19 00:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint