default search action
Yehao Li
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j13]Ting Yao, Yehao Li, Yingwei Pan, Tao Mei:
HIRI-ViT: Scaling Vision Transformer With High Resolution Inputs. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6431-6442 (2024) - [c32]Rui Zhu, Yingwei Pan, Yehao Li, Ting Yao, Zhenglong Sun, Tao Mei, Chang Wen Chen:
SD-DiT: Unleashing the Power of Self-Supervised Discrimination in Diffusion Transformer*. CVPR 2024: 8435-8445 - [c31]Yurui Qian, Qi Cai, Yingwei Pan, Yehao Li, Ting Yao, Qibin Sun, Tao Mei:
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain. CVPR 2024: 8911-8920 - [c30]Yifu Chen, Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Zhineng Chen, Tao Mei:
Improving Text-Guided Object Inpainting with Semantic Pre-inpainting. ECCV (46) 2024: 110-126 - [c29]Jianjie Luo, Jingwen Chen, Yehao Li, Yingwei Pan, Jianlin Feng, Hongyang Chao, Ting Yao:
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning. ECCV (57) 2024: 237-254 - [i32]Ting Yao, Yehao Li, Yingwei Pan, Tao Mei:
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs. CoRR abs/2403.11999 (2024) - [i31]Rui Zhu, Yingwei Pan, Yehao Li, Ting Yao, Zhenglong Sun, Tao Mei, Chang Wen Chen:
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer. CoRR abs/2403.17004 (2024) - [i30]Yurui Qian, Qi Cai, Yingwei Pan, Yehao Li, Ting Yao, Qibin Sun, Tao Mei:
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain. CoRR abs/2403.17870 (2024) - [i29]Siqi Wan, Yehao Li, Jingwen Chen, Yingwei Pan, Ting Yao, Yang Cao, Tao Mei:
Improving Virtual Try-On with Garment-focused Diffusion Models. CoRR abs/2409.08258 (2024) - [i28]Yifu Chen, Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Zhineng Chen, Tao Mei:
Improving Text-guided Object Inpainting with Semantic Pre-inpainting. CoRR abs/2409.08260 (2024) - 2023
- [j12]Chaowei Wang, Yehao Li, Feifei Gao, Danhao Deng, Jisong Xu, Yuhan Liu, Weidong Wang:
Adaptive Semantic-Bit Communication for Extended Reality Interactions. IEEE J. Sel. Top. Signal Process. 17(5): 1080-1092 (2023) - [j11]Yehao Li, Ting Yao, Yingwei Pan, Tao Mei:
Contextual Transformer Networks for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 1489-1500 (2023) - [j10]Ting Yao, Yehao Li, Yingwei Pan, Yu Wang, Xiao-Ping Zhang, Tao Mei:
Dual Vision Transformer. IEEE Trans. Pattern Anal. Mach. Intell. 45(9): 10870-10882 (2023) - [j9]Jingwen Chen, Jianjie Luo, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing. ACM Trans. Multim. Comput. Commun. Appl. 19(1): 9:1-9:16 (2023) - [j8]Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning. ACM Trans. Multim. Comput. Commun. Appl. 19(1s): 48:1-48:24 (2023) - [j7]Xuewei Ding, Yingwei Pan, Yehao Li, Ting Yao, Dan Zeng, Tao Mei:
Boosting Relationship Detection in Images with Multi-Granular Self-Supervised Learning. ACM Trans. Multim. Comput. Commun. Appl. 19(2s): 88:1-88:18 (2023) - [j6]Yingwei Pan, Yehao Li, Ting Yao, Tao Mei:
Bottom-up and Top-down Object Inference Networks for Image Captioning. ACM Trans. Multim. Comput. Commun. Appl. 19(5): 161:1-161:18 (2023) - [c28]Ting Yao, Yehao Li, Yingwei Pan, Tao Mei:
HGNet: Learning Hierarchical Geometry from Points, Edges, and Surfaces. CVPR 2023: 21846-21855 - [c27]Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei:
Semantic-Conditional Diffusion Networks for Image Captioning. CVPR 2023: 23359-23368 - [c26]Zhongbai Jiang, Yanwei Sun, Beibei Li, Bowen Sun, Shenduo Xiong, Yehao Li, Wenyue Du:
Threat-Aware Data Transmission in Software-Defined Networks. DSC 2023: 443-449 - [c25]Yang Chen, Yingwei Pan, Yehao Li, Ting Yao, Tao Mei:
Control3D: Towards Controllable Text-to-3D Generation. ACM Multimedia 2023: 1148-1156 - [i27]Yang Chen, Yingwei Pan, Yehao Li, Ting Yao, Tao Mei:
Control3D: Towards Controllable Text-to-3D Generation. CoRR abs/2311.05461 (2023) - 2022
- [j5]Jing Wang, Yehao Li, Yingwei Pan, Ting Yao, Jinhui Tang, Tao Mei:
Contextual and selective attention networks for image captioning. Sci. China Inf. Sci. 65(12) (2022) - [j4]Huixia Ben, Yingwei Pan, Yehao Li, Ting Yao, Richang Hong, Meng Wang, Tao Mei:
Unpaired Image Captioning With semantic-Constrained Self-Learning. IEEE Trans. Multim. 24: 904-916 (2022) - [j3]Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei:
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training. ACM Trans. Multim. Comput. Commun. Appl. 18(2): 48:1-48:16 (2022) - [c24]Yehao Li, Chaowei Wang, Danhao Deng, Mingliang Pang, Weidong Wang, Lexi Xu:
An Elite Genetic Algorithm for Power Allocation in Cell-Free Massive MIMO Systems. ChinaCom 2022: 283-293 - [c23]Mingliang Pang, Chaowei Wang, Danhao Deng, Yehao Li, Weidong Wang, Lexi Xu:
Interference-aware Spectrum and Power Coordination in Satellite-aided Cell-free Massive MIMO System. ChinaCom 2022: 375-389 - [c22]Yehao Li, Yingwei Pan, Ting Yao, Tao Mei:
Comprehending and Ordering Semantics for Image Captioning. CVPR 2022: 17969-17978 - [c21]Ting Yao, Yingwei Pan, Yehao Li, Chong-Wah Ngo, Tao Mei:
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning. ECCV (25) 2022: 328-345 - [c20]Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei:
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement. ECCV (3) 2022: 593-609 - [c19]Danhao Deng, Chaowei Wang, Lexi Xu, Yehao Li, Weidong Wang, Zhi Zhang, Ping Zhang:
Flexible User Duplexing in Cell-Free Massive MIMO: A Deep Reinforcement Learning Approach. ICCC 2022: 296-301 - [c18]Yingwei Pan, Yehao Li, Jianjie Luo, Jun Xu, Ting Yao, Tao Mei:
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training. ACM Multimedia 2022: 7070-7074 - [i26]Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei:
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training. CoRR abs/2201.04026 (2022) - [i25]Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei:
Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation. CoRR abs/2206.06289 (2022) - [i24]Yehao Li, Yingwei Pan, Ting Yao, Tao Mei:
Comprehending and Ordering Semantics for Image Captioning. CoRR abs/2206.06930 (2022) - [i23]Ting Yao, Yehao Li, Yingwei Pan, Yu Wang, Xiao-Ping Zhang, Tao Mei:
Dual Vision Transformer. CoRR abs/2207.04976 (2022) - [i22]Ting Yao, Yingwei Pan, Yehao Li, Chong-Wah Ngo, Tao Mei:
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning. CoRR abs/2207.04978 (2022) - [i21]Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei:
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement. CoRR abs/2211.08250 (2022) - [i20]Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei:
Semantic-Conditional Diffusion Networks for Image Captioning. CoRR abs/2212.03099 (2022) - 2021
- [c17]Yehao Li, Yingwei Pan, Ting Yao, Jingwen Chen, Tao Mei:
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network. AAAI 2021: 8518-8526 - [c16]Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, Tao Mei:
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics. ACM Multimedia 2021: 3799-3802 - [c15]Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei:
CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising. ACM Multimedia 2021: 5600-5608 - [i19]Yehao Li, Yingwei Pan, Ting Yao, Jingwen Chen, Tao Mei:
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network. CoRR abs/2101.11562 (2021) - [i18]Yehao Li, Ting Yao, Yingwei Pan, Tao Mei:
Contextual Transformer Networks for Visual Recognition. CoRR abs/2107.12292 (2021) - [i17]Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, Tao Mei:
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics. CoRR abs/2108.08217 (2021) - [i16]Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei:
CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising. CoRR abs/2112.07515 (2021) - 2020
- [j2]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Deep Metric Learning With Density Adaptivity. IEEE Trans. Multim. 22(5): 1285-1297 (2020) - [c14]Yingwei Pan, Ting Yao, Yehao Li, Tao Mei:
X-Linear Attention Networks for Image Captioning. CVPR 2020: 10968-10977 - [c13]Yingwei Pan, Ting Yao, Yehao Li, Chong-Wah Ngo, Tao Mei:
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation. CVPR 2020: 13864-13872 - [c12]Xuewei Ding, Yehao Li, Yingwei Pan, Dan Zeng, Ting Yao:
Exploring Depth Information for Spatial Relation Recognition. MIPR 2020: 279-284 - [i15]Yingwei Pan, Ting Yao, Yehao Li, Tao Mei:
X-Linear Attention Networks for Image Captioning. CoRR abs/2003.14080 (2020) - [i14]Yingwei Pan, Ting Yao, Yehao Li, Chong-Wah Ngo, Tao Mei:
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation. CoRR abs/2006.06567 (2020) - [i13]Yingwei Pan, Yehao Li, Jianjie Luo, Jun Xu, Ting Yao, Tao Mei:
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training. CoRR abs/2007.02375 (2020) - [i12]Yingwei Pan, Jun Xu, Yehao Li, Ting Yao, Tao Mei:
Pre-training for Video Captioning Challenge 2020 Summary. CoRR abs/2008.00947 (2020)
2010 – 2019
- 2019
- [j1]Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Yong Rui, Tao Mei:
Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention. ACM Trans. Multim. Comput. Commun. Appl. 15(3): 78:1-78:19 (2019) - [c11]Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning. AAAI 2019: 8167-8174 - [c10]Yingwei Pan, Ting Yao, Yehao Li, Yu Wang, Chong-Wah Ngo, Tao Mei:
Transferrable Prototypical Networks for Unsupervised Domain Adaptation. CVPR 2019: 2239-2247 - [c9]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Pointing Novel Objects in Image Captioning. CVPR 2019: 12497-12506 - [c8]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Hierarchy Parsing for Image Captioning. ICCV 2019: 2621-2629 - [i11]Yingwei Pan, Ting Yao, Yehao Li, Yu Wang, Chong-Wah Ngo, Tao Mei:
Transferrable Prototypical Networks for Unsupervised Domain Adaptation. CoRR abs/1904.11227 (2019) - [i10]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Pointing Novel Objects in Image Captioning. CoRR abs/1904.11251 (2019) - [i9]Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning. CoRR abs/1905.01077 (2019) - [i8]Zhaofan Qiu, Dong Li, Yehao Li, Qi Cai, Yingwei Pan, Ting Yao:
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019. CoRR abs/1906.07016 (2019) - [i7]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Deep Metric Learning with Density Adaptivity. CoRR abs/1909.03909 (2019) - [i6]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Hierarchy Parsing for Image Captioning. CoRR abs/1909.03918 (2019) - [i5]Yingwei Pan, Yehao Li, Qi Cai, Yang Chen, Ting Yao:
Multi-Source Domain Adaptation and Semi-Supervised Domain Adaptation with Focus on Visual Domain Adaptation Challenge 2019. CoRR abs/1910.03548 (2019) - 2018
- [c7]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Jointly Localizing and Describing Events for Dense Video Captioning. CVPR 2018: 7492-7500 - [c6]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Exploring Visual Relationship for Image Captioning. ECCV (14) 2018: 711-727 - [i4]Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei:
Jointly Localizing and Describing Events for Dense Video Captioning. CoRR abs/1804.08274 (2018) - [i3]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Exploring Visual Relationship for Image Captioning. CoRR abs/1809.07041 (2018) - 2017
- [c5]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects. CVPR 2017: 5263-5271 - [c4]Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, Tao Mei:
Boosting Image Captioning with Attributes. ICCV 2017: 4904-4912 - [i2]Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects. CoRR abs/1708.05271 (2017) - 2016
- [c3]Yingwei Pan, Yehao Li, Ting Yao, Tao Mei, Houqiang Li, Yong Rui:
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure. IJCAI 2016: 3832-3838 - [c2]Yehao Li, Ting Yao, Rui Hu, Tao Mei, Yong Rui:
Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting. ACM Multimedia 2016: 757-758 - [c1]Yehao Li, Ting Yao, Tao Mei, Hongyang Chao, Yong Rui:
Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding. ACM Multimedia 2016: 928-937 - [i1]Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, Tao Mei:
Boosting Image Captioning with Attributes. CoRR abs/1611.01646 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-15 00:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint