Остановите войну!
for scientists:
default search action
Yu Qiao 0006
Person information
- affiliation: Shanghai AI Laboratory, China
Other persons with the same name
- Yu Qiao — disambiguation page
- Yu Qiao 0001 — Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, China (and 2 more)
- Yu Qiao 0002 — Biomedical Imaging Lab, Singapore
- Yu Qiao 0003 — Shanghai Jiao Tong University, Department of Automation, Institute of Image Processing and Pattern Recognition, China (and 1 more)
- Yu Qiao 0004 — Kyung Hee University, School of Computing, Department of Artificial Intelligence, Yongin, South Korea (and 1 more)
- Yu Qiao 0005 — RWTH Aachen University, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i64]Xin Ma, Yaohui Wang, Gengyun Jia, Xinyuan Chen, Ziwei Liu, Yuan-Fang Li, Cunjian Chen, Yu Qiao:
Latte: Latent Diffusion Transformer for Video Generation. CoRR abs/2401.03048 (2024) - 2023
- [c26]Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CVPR 2023: 2132-2141 - [c25]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CVPR 2023: 2691-2700 - [c24]Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao:
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection. CVPR 2023: 9253-9262 - [c23]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CVPR 2023: 14408-14419 - [c22]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection. CVPR 2023: 15599-15608 - [c21]Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He:
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion. CVPR 2023: 17524-17534 - [c20]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CVPR 2023: 17830-17839 - [c19]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CVPR 2023: 18053-18062 - [c18]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CVPR 2023: 22732-22741 - [c17]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. ICCV 2023: 6713-6724 - [c16]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. ICLR 2023 - [c15]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao, Zhenguo Li, Ping Luo:
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. ICLR 2023 - [c14]Penghao Wu, Li Chen, Hongyang Li, Xiaosong Jia, Junchi Yan, Yu Qiao:
Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling. ICLR 2023 - [c13]Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao:
Long-Term Rhythmic Video Soundtracker. ICML 2023: 40339-40353 - [c12]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. NeurIPS 2023 - [i63]Penghao Wu, Li Chen, Hongyang Li, Xiaosong Jia, Junchi Yan, Yu Qiao:
Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling. CoRR abs/2301.01006 (2023) - [i62]Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He:
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion. CoRR abs/2303.03595 (2023) - [i61]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection. CoRR abs/2303.05886 (2023) - [i60]Bo Zhang, Jiakang Yuan, Botian Shi, Tao Chen, Yikang Li, Yu Qiao:
Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection. CoRR abs/2303.06880 (2023) - [i59]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CoRR abs/2303.09757 (2023) - [i58]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i57]Tianyu Li, Li Chen, Xiangwei Geng, Huijie Wang, Yang Li, Zhenbo Liu, Shengyin Jiang, Yuting Wang, Hang Xu, Chunjing Xu, Feng Wen, Ping Luo, Junchi Yan, Wei Zhang, Xiaogang Wang, Yu Qiao, Hongyang Li:
Topology Reasoning for Driving Scenes. CoRR abs/2304.05277 (2023) - [i56]Huijie Wang, Zhenbo Liu, Yang Li, Tianyu Li, Li Chen, Chonghao Sima, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Jun Yao, Yu Qiao, Hongyang Li:
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving. CoRR abs/2304.10440 (2023) - [i55]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i54]Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao:
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language. CoRR abs/2305.05662 (2023) - [i53]Kunchang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao:
VideoChat: Chat-Centric Video Understanding. CoRR abs/2305.06355 (2023) - [i52]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. CoRR abs/2305.11175 (2023) - [i51]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. CoRR abs/2305.15021 (2023) - [i50]Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023) - [i49]Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, Yu Qiao, Zhaoxiang Zhang, Jifeng Dai:
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory. CoRR abs/2305.17144 (2023) - [i48]Mengzhao Chen, Wenqi Shao, Peng Xu, Mingbao Lin, Kaipeng Zhang, Fei Chao, Rongrong Ji, Yu Qiao, Ping Luo:
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers. CoRR abs/2305.17997 (2023) - [i47]Jiakang Yuan, Bo Zhang, Xiangchao Yan, Tao Chen, Botian Shi, Yikang Li, Yu Qiao:
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset. CoRR abs/2306.00612 (2023) - [i46]Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang:
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling. CoRR abs/2306.01721 (2023) - [i45]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. CoRR abs/2306.06023 (2023) - [i44]Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023) - [i43]Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo:
Align, Adapt and Inject: Sound-guided Unified Image Generation. CoRR abs/2306.11504 (2023) - [i42]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i41]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. CoRR abs/2307.06942 (2023) - [i40]Daocheng Fu, Xin Li, Licheng Wen, Min Dou, Pinlong Cai, Botian Shi, Yu Qiao:
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models. CoRR abs/2307.07162 (2023) - [i39]Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue:
Meta-Transformer: A Unified Framework for Multimodal Learning. CoRR abs/2307.10802 (2023) - [i38]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. CoRR abs/2307.15644 (2023) - [i37]Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao:
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World. CoRR abs/2308.01907 (2023) - [i36]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023) - [i35]Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Ben Fei, Bo Dai, Wanli Ouyang, Yu Qiao, Chao Dong:
DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior. CoRR abs/2308.15070 (2023) - [i34]Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao:
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation. CoRR abs/2309.05527 (2023) - [i33]Xiangchao Yan, Runjian Chen, Bo Zhang, Jiakang Yuan, Xinyu Cai, Botian Shi, Wenqi Shao, Junchi Yan, Ping Luo, Yu Qiao:
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving. CoRR abs/2309.10527 (2023) - [i32]Renqiu Xia, Bo Zhang, Haoyang Peng, Ning Liao, Peng Ye, Botian Shi, Junchi Yan, Yu Qiao:
StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding. CoRR abs/2309.11268 (2023) - [i31]Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao:
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models. CoRR abs/2309.16292 (2023) - [i30]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. CoRR abs/2310.08582 (2023) - [i29]Zhiyu Zhao, Bingkun Huang, Sen Xing, Gangshan Wu, Yu Qiao, Limin Wang:
Asymmetric Masked Distillation for Pre-Training Small Foundation Models. CoRR abs/2311.03149 (2023) - [i28]Licheng Wen, Xuemeng Yang, Daocheng Fu, Xiaofeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao:
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving. CoRR abs/2311.05332 (2023) - [i27]Zhihang Zhong, Gurunandan Krishnan, Xiao Sun, Yu Qiao, Sizhuo Ma, Jian Wang:
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation. CoRR abs/2311.08007 (2023) - [i26]Xin Li, Yeqi Bai, Pinlong Cai, Licheng Wen, Daocheng Fu, Bo Zhang, Xuemeng Yang, Xinyu Cai, Tao Ma, Jianfei Guo, Xing Gao, Min Dou, Yikang Li, Botian Shi, Yong Liu, Liang He, Yu Qiao:
Towards Knowledge-driven Autonomous Driving. CoRR abs/2312.04316 (2023) - [i25]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CoRR abs/2312.14238 (2023) - 2022
- [c11]Xiaosong Jia, Li Chen, Penghao Wu, Jia Zeng, Junchi Yan, Hongyang Li, Yu Qiao:
Towards Capturing the Temporal Dynamics for Trajectory Prediction: a Coarse-to-Fine Approach. CoRL 2022: 910-920 - [c10]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers. ECCV (9) 2022: 1-18 - [c9]Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. ECCV (25) 2022: 73-91 - [c8]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c7]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c6]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. ECCV (38) 2022: 550-567 - [c5]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c4]Penghao Wu, Xiaosong Jia, Li Chen, Junchi Yan, Hongyang Li, Yu Qiao:
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline. NeurIPS 2022 - [i24]Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan:
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark. CoRR abs/2203.11089 (2022) - [i23]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i22]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. CoRR abs/2205.08534 (2022) - [i21]Chenxin Tao, Xizhou Zhu, Gao Huang, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CoRR abs/2206.01204 (2022) - [i20]Penghao Wu, Xiaosong Jia, Li Chen, Junchi Yan, Hongyang Li, Yu Qiao:
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline. CoRR abs/2206.08129 (2022) - [i19]Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu Qiao:
Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot. CoRR abs/2206.08176 (2022) - [i18]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i17]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i16]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo:
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe. CoRR abs/2209.05324 (2022) - [i15]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CoRR abs/2211.05778 (2022) - [i14]Jifeng Dai, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Xiaowei Hu:
Demystify Transformers & Convolutions in Modern Image Deep Networks. CoRR abs/2211.05781 (2022) - [i13]Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CoRR abs/2211.08887 (2022) - [i12]Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges. CoRR abs/2211.09529 (2022) - [i11]Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai:
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information. CoRR abs/2211.09807 (2022) - [i10]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CoRR abs/2211.09808 (2022) - [i9]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CoRR abs/2211.10439 (2022) - [i8]Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li:
Goal-oriented Autonomous Driving. CoRR abs/2212.10156 (2022) - [i7]Ben Fei, Siyuan Huang, Jiakang Yuan, Botian Shi, Bo Zhang, Tao Chen, Min Dou, Yu Qiao:
ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation. CoRR abs/2212.10390 (2022) - 2021
- [i6]Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021) - [i5]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i4]Jing Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin, Xiaogang Wang, Yu Qiao:
INTERN: A New Learning Paradigm Towards General Vision. CoRR abs/2111.08687 (2021) - [i3]Changyao Tian, Wenhai Wang, Xizhou Zhu, Xiaogang Wang, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. CoRR abs/2111.13579 (2021)
2010 – 2019
- 2019
- [c3]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CVPR 2019: 9906-9914 - [c2]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CVPR 2019: 10823-10832 - [i2]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CoRR abs/1905.00292 (2019) - [i1]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CoRR abs/1905.02479 (2019) - 2016
- [j1]Xixuan Wu, Yu Qiao, Xiaogang Wang, Xiaoou Tang:
Bridging Music and Image via Cross-Modal Ranking Analysis. IEEE Trans. Multim. 18(7): 1305-1318 (2016) - 2012
- [c1]Xixuan Wu, Yu Qiao, Xiaogang Wang, Xiaoou Tang:
Cross matching of music and image. ACM Multimedia 2012: 837-840
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-26 00:04 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint