default search action
Benjamin Z. Yao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi:
VidLA: Video-Language Alignment at Scale. CVPR 2024: 14043-14055 - [c14]Sirnam Swetha, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Z. Yao, Trishul Chilimbi, Mubarak Shah:
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs. ECCV (6) 2024: 146-162 - [c13]Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Z. Yao, Trishul Chilimbi:
Open Vocabulary Multi-label Video Classification. ECCV (39) 2024: 276-293 - [c12]Changyou Chen, Han Ding, Bunyamin Sisman, Yi Xu, Ouye Xie, Benjamin Z. Yao, Son Dinh Tran, Belinda Zeng:
Diffusion Models for Multi-Task Generative Modeling. ICLR 2024 - [c11]Xinliang Zhu, Sheng-Wei Huang, Han Ding, Jinyu Yang, Kelvin Chen, Tao Zhou, Tal Neiman, Ouye Xie, Son Tran, Benjamin Z. Yao, Douglas Gray, Anuj Bindal, Arnab Dhua:
Bringing Multimodality to Amazon Visual Search System. KDD 2024: 6390-6399 - [i4]Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi:
VidLA: Video-Language Alignment at Scale. CoRR abs/2403.14870 (2024) - [i3]Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Z. Yao, Trishul Chilimbi:
Open Vocabulary Multi-Label Video Classification. CoRR abs/2407.09073 (2024) - [i2]Sirnam Swetha, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Z. Yao, Trishul Chilimbi, Mubarak Shah:
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs. CoRR abs/2407.13851 (2024) - [i1]Changyou Chen, Han Ding, Bunyamin Sisman, Yi Xu, Ouye Xie, Benjamin Z. Yao, Son Dinh Tran, Belinda Zeng:
Diffusion Models for Multi-Task Generative Modeling. CoRR abs/2407.17571 (2024)
2010 – 2019
- 2016
- [j5]Liang Lin, Jason J. Corso, Wangmeng Zuo, David Zhang, Benjamin Z. Yao:
Compositional models and Structured learning for visual recognition. Pattern Recognit. 59: 1-4 (2016) - 2014
- [j4]Benjamin Z. Yao, Bruce X. Nie, Zicheng Liu, Song-Chun Zhu:
Animated Pose Templates for Modeling and Detecting Human Actions. IEEE Trans. Pattern Anal. Mach. Intell. 36(3): 436-452 (2014) - 2013
- [j3]Mingtao Pei, Zhangzhang Si, Benjamin Z. Yao, Song-Chun Zhu:
Learning and parsing video events with goal and intent prediction. Comput. Vis. Image Underst. 117(10): 1369-1383 (2013) - [j2]Jiangen Zhang, Benjamin Z. Yao, Yongtian Wang:
Auto learning temporal atomic actions for activity classification. Pattern Recognit. 46(7): 1789-1798 (2013) - 2012
- [c10]Jiangen Zhang, Benjamin Z. Yao, Yongtian Wang:
Modelling Atomic Actions for Activity Classification. ICME 2012: 278-283 - [c9]Yang Lv, Benjamin Z. Yao, Yongtian Wang, Song-Chun Zhu:
Reconfigurable templates for robust vehicle detection and classification. WACV 2012: 321-328 - 2011
- [c8]Zhangzhang Si, Mingtao Pei, Benjamin Z. Yao, Song-Chun Zhu:
Unsupervised learning of event AND-OR grammar and semantics from video. ICCV 2011: 41-48 - [c7]Jiangen Zhang, Wenze Hu, Benjamin Z. Yao, Yongtian Wang, Song-Chun Zhu:
Inferring social roles in long timespan video sequence. ICCV Workshops 2011: 1456-1463 - 2010
- [j1]Benjamin Z. Yao, Xiong Yang, Liang Lin, Mun Wai Lee, Song Chun Zhu:
I2T: Image Parsing to Text Description. Proc. IEEE 98(8): 1485-1508 (2010) - [c6]Liangliang Cao, Yingli Tian, Zicheng Liu, Benjamin Z. Yao, Zhengyou Zhang, Thomas S. Huang:
Action detection using multiple spatial-temporal interest point features. ICME 2010: 340-345
2000 – 2009
- 2009
- [c5]Benjamin Z. Yao, Xiong Yang, Tianfu Wu:
Image parsing with stochastic grammar: The Lotus Hill dataset and inference scheme. CVPR Workshops 2009: 8 - [c4]Benjamin Z. Yao, Song Chun Zhu:
Learning deformable action templates from cluttered videos. ICCV 2009: 1507-1514 - 2008
- [c3]Jake Porway, Kristy Wang, Benjamin Z. Yao, Song Chun Zhu:
A hierarchical and contextual model for aerial image understanding. CVPR 2008 - [c2]Benjamin Z. Yao, Liang Wang, Song-Chun Zhu:
Learning a scene contextual model for tracking and abnormality detection. CVPR Workshops 2008: 1-8 - 2007
- [c1]Benjamin Z. Yao, Xiong Yang, Song Chun Zhu:
Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and Benchmarks. EMMCVPR 2007: 169-183
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 21:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint