default search action
Peng Gao 0007
Person information
- affiliation: Shanghai Artificial Intelligence Laboratory, China
- affiliation (PhD 2021): Chinese University of Hong Kong, Hong Kong
Other persons with the same name
- Peng Gao — disambiguation page
- Peng Gao 0001 — China Mobile Group Design Institute Co., Ltd, Division of Research, China
- Peng Gao 0002 — University of South Carolina, Department of Geography, Columbia, SC, USA
- Peng Gao 0003 — University at Buffalo, Department of Geography, NY, USA
- Peng Gao 0004 — Jilin University, Institute of Mathematics, Changchun, China
- Peng Gao 0005 — Qufu Normal University, School of Cyber Science and Engineering, China (and 1 more)
- Peng Gao 0006 — Berlin Institute of Technology, Germany
- Peng Gao 0008 — Virginia Tech, Department of Computer Science, Blacksburg, VA, USA (and 2 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j13]Kexue Fu, Peng Gao, Shaolei Liu, Linhao Qu, Longxiang Gao, Manning Wang:
POS-BERT: Point cloud one-stage BERT pre-training. Expert Syst. Appl. 240: 122563 (2024) - [j12]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024) - [j11]Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024) - [j10]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6402-6415 (2024) - [c70]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Hao Dong, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. AAAI 2024: 6449-6457 - [c69]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. ACL (Findings) 2024: 7775-7803 - [c68]Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Liang Pan, Xiangyu Fan, Han Du, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu:
Digital Life Project: Autonomous 3D Characters with Social Intelligence. CVPR 2024: 582-592 - [c67]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation. CVPR 2024: 3838-3847 - [c66]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. ICLR 2024 - [c65]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. ICLR 2024 - [c64]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. ICLR 2024 - [c63]Renrui Zhang, Jiaming Han, Chris Liu, Aojun Zhou, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention. ICLR 2024 - [c62]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024 - [c61]Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024 - [c60]Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024 - [c59]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. ICML 2024 - [c58]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. ICML 2024 - [c57]Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong:
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill. ICRA 2024: 5228-5234 - [i105]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. CoRR abs/2401.02384 (2024) - [i104]Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang:
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models. CoRR abs/2402.03327 (2024) - [i103]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i102]Peng Gao, Xiao Liu, Yu Wang, Ru-Yue Yuan:
Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking. CoRR abs/2402.16570 (2024) - [i101]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. CoRR abs/2402.16880 (2024) - [i100]Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong:
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models. CoRR abs/2403.11289 (2024) - [i99]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li:
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? CoRR abs/2403.14624 (2024) - [i98]Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024) - [i97]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation. CoRR abs/2404.04050 (2024) - [i96]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. CoRR abs/2404.16006 (2024) - [i95]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024) - [i94]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024) - [i93]Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li:
TerDiT: Ternary Diffusion Models with Transformers. CoRR abs/2405.14854 (2024) - [i92]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. CoRR abs/2405.16057 (2024) - [i91]Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang:
Phased Consistency Model. CoRR abs/2405.18407 (2024) - [i90]Siyuan Huang, Haonan Chang, Yuhan Liu, Yimeng Zhu, Hao Dong, Peng Gao, Abdeslam Boularias, Hongsheng Li:
A3VLM: Actionable Articulation-Aware Vision Language Model. CoRR abs/2406.07549 (2024) - [i89]Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024) - [i88]Jingwen He, Tianfan Xue, Dongyang Liu, Xinqi Lin, Peng Gao, Dahua Lin, Yu Qiao, Wanli Ouyang, Ziwei Liu:
VEnhancer: Generative Space-Time Enhancement for Video Generation. CoRR abs/2407.07667 (2024) - [i87]Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024) - [i86]Mengzhao Chen, Wenqi Shao, Peng Xu, Jiahao Wang, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models. CoRR abs/2407.11062 (2024) - [i85]Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. CoRR abs/2408.02657 (2024) - [i84]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Chengzhuo Tong, Peng Gao, Chunyuan Li, Pheng-Ann Heng:
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners. CoRR abs/2408.16768 (2024) - 2023
- [j9]Ruiqiang Lu, Jun Wang, Pengyong Li, Yuquan Li, Shuoyan Tan, Yiting Pan, Huanxiang Liu, Peng Gao, Guotong Xie, Xiaojun Yao:
Improving drug-target affinity prediction via feature fusion and knowledge distillation. Briefings Bioinform. 24(3) (2023) - [j8]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023) - [j7]Weicong Su, Yali Wang, Kunchang Li, Peng Gao, Yu Qiao:
Hybrid token transformer for deep face recognition. Pattern Recognit. 139: 109443 (2023) - [j6]Guanqun Wang, He Chen, Liang Chen, Yin Zhuang, Shanghang Zhang, Tong Zhang, Hao Dong, Peng Gao:
P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification. Remote. Sens. 15(7): 1773 (2023) - [j5]Tong Zhang, Yin Zhuang, He Chen, Liang Chen, Guanqun Wang, Peng Gao, Hao Dong:
Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 16: 5013-5025 (2023) - [c56]Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lu:
Resilient Binary Neural Network. AAAI 2023: 10620-10628 - [c55]Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CVPR 2023: 3842-3851 - [c54]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CVPR 2023: 5344-5353 - [c53]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222 - [c52]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders. CVPR 2023: 21769-21780 - [c51]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CVPR 2023: 22732-22741 - [c50]Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie:
Filter Pruning Via Filters Similarity in Consecutive Layers. ICASSP 2023: 1-5 - [c49]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. ICCV 2023: 2605-2615 - [c48]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao:
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning. ICCV 2023: 2639-2650 - [c47]Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132 - [c46]Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li:
SparseMAE: Sparse Training Meets Masked Autoencoders. ICCV 2023: 16130-16140 - [c45]Yongjing Cui, Yin Zhuang, Shan Dong, Xinyi Zhang, Peng Gao, He Chen, Liang Chen:
Hybrid Transformer Network for Change Detection Under Self-Supervised Pretraining. IGARSS 2023: 6652-6655 - [c44]Siyuan Huang, Bo Zhang, Botian Shi, Hongsheng Li, Yikang Li, Peng Gao:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. ACM Multimedia 2023: 8644-8652 - [i83]Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lv:
Resilient Binary Neural Network. CoRR abs/2302.00956 (2023) - [i82]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023) - [i81]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i80]Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023) - [i79]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023) - [i78]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i77]Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lu, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CoRR abs/2304.00253 (2023) - [i76]Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. CoRR abs/2304.01195 (2023) - [i75]Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie:
Filter Pruning via Filters Similarity in Consecutive Layers. CoRR abs/2304.13397 (2023) - [i74]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i73]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023) - [i72]Siyuan Huang, Bo Zhang, Botian Shi, Peng Gao, Yikang Li, Hongsheng Li:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. CoRR abs/2305.09160 (2023) - [i71]Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023) - [i70]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023) - [i69]Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023) - [i68]Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023) - [i67]Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao:
Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks. CoRR abs/2308.12961 (2023) - [i66]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023) - [i65]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng:
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following. CoRR abs/2309.00615 (2023) - [i64]Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao:
ImageBind-LLM: Multi-modality Instruction Tuning. CoRR abs/2309.03905 (2023) - [i63]Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong:
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill. CoRR abs/2309.10309 (2023) - [i62]Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris N. Metaxas:
Improving Compositional Text-to-image Generation with Large Vision-Language Models. CoRR abs/2310.06311 (2023) - [i61]Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023) - [i60]Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo:
ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model. CoRR abs/2311.17963 (2023) - [i59]Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CoRR abs/2312.03700 (2023) - [i58]Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu:
Digital Life Project: Autonomous 3D Characters with Social Intelligence. CoRR abs/2312.04547 (2023) - [i57]Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang:
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V. CoRR abs/2312.09738 (2023) - [i56]Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023) - 2022
- [j4]Jianhao Li, Yin Zhuang, Shan Dong, Peng Gao, Hao Dong, He Chen, Liang Chen, Lianlin Li:
Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery. Remote. Sens. 14(7): 1767 (2022) - [j3]Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang, He Chen:
Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain. Remote. Sens. 14(22): 5675 (2022) - [c43]Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure Correction. BMVC 2022: 238 - [c42]Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481 - [c41]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c40]Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. ECCV (24) 2022: 19-35 - [c39]Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang:
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation. ECCV (34) 2022: 36-54 - [c38]Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü:
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors. ECCV (11) 2022: 346-361 - [c37]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c36]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c35]Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning. ICASSP 2022: 7732-7736 - [c34]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022 - [c33]Shanjunyu Liu, Yin Zhuang, Hao Dong, Peng Gao, Guanqun Wang, Tong Zhang, Liang Chen, He Chen, Lianlin Li:
Adaptive Local Context Embedding for Small Vehicle Detection from Aerial Optical Remote Sensing Images. IGARSS 2022: 1712-1715 - [c32]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c31]Yanjing Li, Sheng Xu, Baochang Zhang, Xianbin Cao, Peng Gao, Guodong Guo:
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer. NeurIPS 2022 - [c30]