default search action
Chen Sun 0002
Person information
- affiliation: Brown University, Department of Computer Science, Providence, RI, USA
- affiliation (former): Google Research, USA
- affiliation (former): Facebook AI Research, USA
- affiliation (former): University of Southern California, Los Angeles, CA, USA
Other persons with the same name
- Chen Sun — disambiguation page
- Chen Sun 0001 — National Institute of Information and Communications Technology, Yokosuka, Japan
- Chen Sun 0003 — Intel Corporation, USA (and 3 more)
- Chen Sun 0004 — Southeast University, National Mobile Communications Research Laboratory and Purple Mountain Laboratory, Nanjing, China (and 2 more)
- Chen Sun 0005 — Alibaba Group, Hangzhou, China (and 1 more)
- Chen Sun 0006 — Sony China Research Laboratory, Beijing, China (and 3 more)
- Chen Sun 0007 — University of Montréal, Mila, QC, Canada (and 1 more)
- Chen Sun 0008 — University of Waterloo, Department of Mechanical and Mechatronics Engineering, Faculty of Engineering, ON, Canada
- Chen Sun 0010 — National University of Singapore, NUS, Department of Electrical and Computer Engineering, Singapore
- Chen Sun 0011 — University of Manchester, UK (and 2 more)
- Chen Sun 0012 — Beihang University Aeronautics and Astronautics, School of Energy and Power Engineering, Beijing, China
- Chen Sun 0013 — Jiangsu Normal University, School of Geography, Geomatics and Planning, Xuzhou, China
- Chen Sun 0014 — Pennsylvania State University, Department of Computer Science and Engineering, PA, USA
- Chen Sun 0015 — Huazhong University of Science and Technology, State Key Laboratory of Digital Manufacturing Equipment and Technology, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c67]Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid:
Pixel Aligned Language Models. CVPR 2024: 13030-13039 - [c66]Alexey A. Gritsenko, Xuehan Xiong, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lucic, Cordelia Schmid, Anurag Arnab:
End-to-End Spatio-Temporal Action Localisation with Video Transformers. CVPR 2024: 18373-18383 - [c65]Qi Zhao, Shijie Wang, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun:
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? ICLR 2024 - [c64]Nate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, Chen Sun:
Self-Correcting Self-Consuming Loops for Generative Model Training. ICML 2024 - [c63]Yunhao Luo, Chen Sun, Joshua B. Tenenbaum, Yilun Du:
Potential Based Diffusion Motion Planning. ICML 2024 - [c62]Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun:
Object-centric Video Representation for Long-term Action Anticipation. WACV 2024: 6737-6747 - [i70]Nate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, Chen Sun:
Self-Correcting Self-Consuming Loops for Generative Model Training. CoRR abs/2402.07087 (2024) - [i69]Yuan Zang, Tian Yun, Hao Tan, Trung Bui, Chen Sun:
Pre-trained Vision-Language Models Learn Discoverable Visual Concepts. CoRR abs/2404.12652 (2024) - [i68]Calvin Luo, Mandy He, Zilai Zeng, Chen Sun:
Text-Aware Diffusion for Policy Learning. CoRR abs/2407.01903 (2024) - [i67]Yunhao Luo, Chen Sun, Joshua B. Tenenbaum, Yilun Du:
Potential Based Diffusion Motion Planning. CoRR abs/2407.06169 (2024) - [i66]Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, Weicheng Kuo:
Learning Visual Grounding from Generative Vision and Language Model. CoRR abs/2407.14563 (2024) - [i65]Megan Wei, Michael Freeman, Chris Donahue, Chen Sun:
Do Music Generation Models Encode Music Theory? CoRR abs/2410.00872 (2024) - 2023
- [j2]Tian Yun, Usha Bhalla, Ellie Pavlick, Chen Sun:
Do Vision-Language Pretrained Models Learn Composable Primitive Concepts? Trans. Mach. Learn. Res. 2023 (2023) - [c61]Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid:
How can objects help action recognition? CVPR 2023: 2353-2362 - [c60]Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi:
Reveal: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory. CVPR 2023: 23369-23379 - [c59]Apoorv Khandelwal, Ellie Pavlick, Chen Sun:
Analyzing Modular Approaches for Visual Question Decomposition. EMNLP 2023: 2590-2603 - [c58]Tian Yun, Zilai Zeng, Kunal Handa, Ashish V. Thapliyal, Bo Pang, Ellie Pavlick, Chen Sun:
Emergence of Abstract State Representations in Embodied Sequence Modeling. EMNLP 2023: 12190-12205 - [c57]Chen Sun, Calvin Luo, Xingyi Zhou, Anurag Arnab, Cordelia Schmid:
Does Visual Pretraining Help End-to-End Reasoning? NeurIPS 2023 - [c56]Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David Ross, Cordelia Schmid, Alireza Fathi:
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent. NeurIPS 2023 - [c55]Zilai Zeng, Ce Zhang, Shijie Wang, Chen Sun:
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning. NeurIPS 2023 - [i64]Sangnie Bhardwaj, Willie McClinton, Tongzhou Wang, Guillaume Lajoie, Chen Sun, Phillip Isola, Dilip Krishnan:
Steerable Equivariant Representation Learning. CoRR abs/2302.11349 (2023) - [i63]Dylan Ebert, Chen Sun, Ellie Pavlick:
Comparing Trajectory and Vision Modalities for Verb Representation. CoRR abs/2303.12737 (2023) - [i62]Alexey A. Gritsenko, Xuehan Xiong, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lucic, Cordelia Schmid, Anurag Arnab:
End-to-End Spatio-Temporal Action Localisation with Video Transformers. CoRR abs/2304.12160 (2023) - [i61]Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David A. Ross, Cordelia Schmid, Alireza Fathi:
AVIS: Autonomous Visual Information Seeking with Large Language Models. CoRR abs/2306.08129 (2023) - [i60]Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid:
How can objects help action recognition? CoRR abs/2306.11726 (2023) - [i59]Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid:
Dense Video Object Captioning from Disjoint Supervision. CoRR abs/2306.11729 (2023) - [i58]Zilai Zeng, Ce Zhang, Shijie Wang, Chen Sun:
Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning. CoRR abs/2307.03406 (2023) - [i57]Chen Sun, Calvin Luo, Xingyi Zhou, Anurag Arnab, Cordelia Schmid:
Does Visual Pretraining Help End-to-End Reasoning? CoRR abs/2307.08506 (2023) - [i56]Qi Zhao, Ce Zhang, Shijie Wang, Changcheng Fu, Nakul Agarwal, Kwonjoon Lee, Chen Sun:
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? CoRR abs/2307.16368 (2023) - [i55]Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun:
Object-centric Video Representation for Long-term Action Anticipation. CoRR abs/2311.00180 (2023) - [i54]Tian Yun, Zilai Zeng, Kunal Handa, Ashish V. Thapliyal, Bo Pang, Ellie Pavlick, Chen Sun:
Emergence of Abstract State Representations in Embodied Sequence Modeling. CoRR abs/2311.02171 (2023) - [i53]Calvin Luo, Boqing Gong, Ting Chen, Chen Sun:
Towards A Unified Neural Architecture for Visual Recognition and Reasoning. CoRR abs/2311.06386 (2023) - [i52]Apoorv Khandelwal, Ellie Pavlick, Chen Sun:
Analyzing Modular Approaches for Visual Question Decomposition. CoRR abs/2311.06411 (2023) - [i51]Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun:
Vamos: Versatile Action Models for Video Understanding. CoRR abs/2311.13627 (2023) - [i50]Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid:
Pixel Aligned Language Models. CoRR abs/2312.09237 (2023) - 2022
- [c54]Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid:
Multiview Transformers for Video Recognition. CVPR 2022: 3323-3333 - [c53]Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid:
Learning Audio-Video Modalities from Image Captions. ECCV (14) 2022: 407-426 - [c52]Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid:
TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency. ECCV (34) 2022: 540-557 - [c51]Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
AVATAR: Unconstrained Audiovisual Speech Recognition. INTERSPEECH 2022: 2818-2822 - [c50]Dylan Ebert, Chen Sun, Ellie Pavlick:
Do Trajectories Encode Verb Meaning? NAACL-HLT 2022: 2860-2871 - [c49]Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
Masking Modalities for Cross-modal Video Retrieval. WACV 2022: 2111-2120 - [i49]Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid:
Multiview Transformers for Video Recognition. CoRR abs/2201.04288 (2022) - [i48]Tian Yun, Usha Bhalla, Ellie Pavlick, Chen Sun:
Do Vision-Language Pretrained Models Learn Primitive Concepts? CoRR abs/2203.17271 (2022) - [i47]Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid:
Learning Audio-Video Modalities from Image Captions. CoRR abs/2204.00679 (2022) - [i46]Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
AVATAR: Unconstrained Audiovisual Speech Recognition. CoRR abs/2206.07684 (2022) - [i45]Dylan Ebert, Chen Sun, Ellie Pavlick:
Do Trajectories Encode Verb Meaning? CoRR abs/2206.11953 (2022) - [i44]Anurag Arnab, Xuehan Xiong, Alexey A. Gritsenko, Rob Romijnders, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lucic, Cordelia Schmid:
Beyond Transfer Learning: Co-finetuning for Action Localisation. CoRR abs/2207.03807 (2022) - [i43]Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid:
TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency. CoRR abs/2208.06773 (2022) - [i42]Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi:
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory. CoRR abs/2212.05221 (2022) - 2021
- [c48]Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov:
HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps. CVPR 2021: 4227-4236 - [c47]Tian Yun, Chen Sun, Ellie Pavlick:
Does Vision-and-Language Pretraining Improve Lexical Grounding? EMNLP (Findings) 2021: 4357-4366 - [c46]Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun:
Learning Temporal Dynamics from Cycles in Narrated Video. ICCV 2021: 1460-1469 - [c45]Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, Cordelia Schmid:
ViViT: A Video Vision Transformer. ICCV 2021: 6816-6826 - [c44]Anurag Arnab, Chen Sun, Cordelia Schmid:
Unified Graph Structured Models for Video Understanding. ICCV 2021: 8097-8106 - [c43]Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid:
Composable Augmentation Encoding for Video Representation Learning. ICCV 2021: 8814-8824 - [c42]Junru Gu, Chen Sun, Hang Zhao:
DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets. ICCV 2021: 15283-15292 - [c41]Alexander Pashevich, Cordelia Schmid, Chen Sun:
Episodic Transformer for Vision-and-Language Navigation. ICCV 2021: 15922-15932 - [c40]Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun:
Attention Bottlenecks for Multimodal Fusion. NeurIPS 2021: 14200-14213 - [i41]Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun:
Learning Temporal Dynamics from Cycles in Narrated Video. CoRR abs/2101.02337 (2021) - [i40]Anurag Arnab, Chen Sun, Cordelia Schmid:
Unified Graph Structured Models for Video Understanding. CoRR abs/2103.15662 (2021) - [i39]Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, Cordelia Schmid:
ViViT: A Video Vision Transformer. CoRR abs/2103.15691 (2021) - [i38]Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid:
Composable Augmentation Encoding for Video Representation Learning. CoRR abs/2104.00616 (2021) - [i37]Jack Valmadre, Alex Bewley, Jonathan Huang, Chen Sun, Cristian Sminchisescu, Cordelia Schmid:
Local Metrics for Multi-Object Tracking. CoRR abs/2104.02631 (2021) - [i36]Alexander Pashevich, Cordelia Schmid, Chen Sun:
Episodic Transformer for Vision-and-Language Navigation. CoRR abs/2105.06453 (2021) - [i35]Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov:
HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps. CoRR abs/2106.14880 (2021) - [i34]Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun:
Attention Bottlenecks for Multimodal Fusion. CoRR abs/2107.00135 (2021) - [i33]Junru Gu, Chen Sun, Hang Zhao:
DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets. CoRR abs/2108.09640 (2021) - [i32]Tian Yun, Chen Sun, Ellie Pavlick:
Does Vision-and-Language Pretraining Improve Lexical Grounding? CoRR abs/2109.10246 (2021) - [i31]Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
Masking Modalities for Cross-modal Video Retrieval. CoRR abs/2111.01300 (2021) - 2020
- [c39]Hang Zhao, Jiyang Gao, Tian Lan, Chen Sun, Benjamin Sapp, Balakrishnan Varadarajan, Yue Shen, Yi Shen, Yuning Chai, Cordelia Schmid, Congcong Li, Dragomir Anguelov:
TNT: Target-driven Trajectory Prediction. CoRL 2020: 895-904 - [c38]Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, Andrew Zisserman:
Speech2Action: Cross-Modal Supervision for Action Recognition. CVPR 2020: 10314-10323 - [c37]Jiyang Gao, Chen Sun, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, Cordelia Schmid:
VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation. CVPR 2020: 11522-11530 - [c36]Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid:
Multi-modal Transformer for Video Retrieval. ECCV (4) 2020: 214-229 - [c35]Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid:
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos. ECCV (10) 2020: 751-768 - [c34]Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola:
What Makes for Good Views for Contrastive Learning? NeurIPS 2020 - [c33]Jonathan C. Stroud, David A. Ross, Chen Sun, Jia Deng, Rahul Sukthankar:
D3D: Distilled 3D Networks for Video Action Recognition. WACV 2020: 614-623 - [i30]Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, Andrew Zisserman:
Speech2Action: Cross-modal Supervision for Action Recognition. CoRR abs/2003.13594 (2020) - [i29]Jiyang Gao, Chen Sun, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, Cordelia Schmid:
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation. CoRR abs/2005.04259 (2020) - [i28]Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola:
What makes for good views for contrastive learning. CoRR abs/2005.10243 (2020) - [i27]Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid:
Multi-modal Transformer for Video Retrieval. CoRR abs/2007.10639 (2020) - [i26]Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid:
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos. CoRR abs/2007.10703 (2020) - [i25]Jonathan C. Stroud, David A. Ross, Chen Sun, Jia Deng, Rahul Sukthankar, Cordelia Schmid:
Learning Video Representations from Textual Web Supervision. CoRR abs/2007.14937 (2020) - [i24]Samuel Albanie, Yang Liu, Arsha Nagrani, Antoine Miech, Ernesto Coto, Ivan Laptev, Rahul Sukthankar, Bernard Ghanem, Andrew Zisserman, Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid, Shizhe Chen, Yida Zhao, Qin Jin, Kaixu Cui, Hui Liu, Chen Wang, Yudong Jiang, Xiaoshuai Hao:
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020). CoRR abs/2008.00744 (2020) - [i23]Hang Zhao, Jiyang Gao, Tian Lan, Chen Sun, Benjamin Sapp, Balakrishnan Varadarajan, Yue Shen, Yi Shen, Yuning Chai, Cordelia Schmid, Congcong Li, Dragomir Anguelov:
TNT: Target-driveN Trajectory Prediction. CoRR abs/2008.08294 (2020)
2010 – 2019
- 2019
- [c32]Manan Shah, Krishnamurthy Viswanathan, Chun-Ta Lu, Ariel Fuxman, Zhen Li, Aleksei Timofeev, Chao Jia, Chen Sun:
Inferring Context from Pixels for Multimodal Image Classification. CIKM 2019: 189-198 - [c31]Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid:
Relational Action Forecasting. CVPR 2019: 273-283 - [c30]Nam Vo, Lu Jiang, Chen Sun, Kevin Murphy, Li-Jia Li, Li Fei-Fei, James Hays:
Composing Text and Image for Image Retrieval - an Empirical Odyssey. CVPR 2019: 6439-6448 - [c29]Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid:
VideoBERT: A Joint Model for Video and Language Representation Learning. ICCV 2019: 7463-7472 - [c28]Chen Sun, Per Karlsson, Jiajun Wu, Joshua B. Tenenbaum, Kevin Murphy:
Stochastic Prediction of Multi-Agent Interactions from Partial Observations. ICLR (Poster) 2019 - [c27]Zhenjia Xu, Zhijian Liu, Chen Sun, Kevin Murphy, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu:
Unsupervised Discovery of Parts, Structure, and Dynamics. ICLR (Poster) 2019 - [c26]Matthias Minderer, Chen Sun, Ruben Villegas, Forrester Cole, Kevin P. Murphy, Honglak Lee:
Unsupervised learning of object structure and dynamics from videos. NeurIPS 2019: 92-102 - [i22]Chen Sun, Per Karlsson, Jiajun Wu, Joshua B. Tenenbaum, Kevin Murphy:
Stochastic Prediction of Multi-Agent Interactions from Partial Observations. CoRR abs/1902.09641 (2019) - [i21]Zhenjia Xu, Zhijian Liu, Chen Sun, Kevin Murphy, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu:
Unsupervised Discovery of Parts, Structure, and Dynamics. CoRR abs/1903.05136 (2019) - [i20]Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid:
VideoBERT: A Joint Model for Video and Language Representation Learning. CoRR abs/1904.01766 (2019) - [i19]Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid:
Relational Action Forecasting. CoRR abs/1904.04231 (2019) - [i18]Chen Sun, Fabien Baradel, Kevin Murphy, Cordelia Schmid:
Contrastive Bidirectional Transformer for Temporal Representation Learning. CoRR abs/1906.05743 (2019) - [i17]Matthias Minderer, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee:
Unsupervised Learning of Object Structure and Dynamics from Videos. CoRR abs/1906.07889 (2019) - 2018
- [c25]Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge J. Belongie:
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018: 4109-4118 - [c24]Chunhui Gu, Chen Sun, David A. Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik:
AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions. CVPR 2018: 6047-6056 - [c23]Grant Van Horn, Oisin Mac Aodha, Yang Song, Yin Cui, Chen Sun, Alexander Shepard, Hartwig Adam, Pietro Perona, Serge J. Belongie:
The INaturalist Species Classification and Detection Dataset. CVPR 2018: 8769-8778 - [c22]Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy:
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification. ECCV (15) 2018: 318-335 - [c21]Chen Sun, Abhinav Shrivastava, Carl Vondrick, Kevin Murphy, Rahul Sukthankar, Cordelia Schmid:
Actor-Centric Relation Network. ECCV (11) 2018: 335-351 - [i16]Unaiza Ahsan, Chen Sun, Irfan A. Essa:
DiscrimNet: Semi-Supervised Action Recognition from Videos using Generative Adversarial Networks. CoRR abs/1801.07230 (2018) - [i15]Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge J. Belongie:
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CoRR abs/1806.06193 (2018) - [i14]Chen Sun, Abhinav Shrivastava, Carl Vondrick, Kevin Murphy, Rahul Sukthankar, Cordelia Schmid:
Actor-Centric Relation Network. CoRR abs/1807.10982 (2018) - [i13]Nam Vo, Lu Jiang, Chen Sun, Kevin Murphy, Li-Jia Li, Li Fei-Fei, James Hays:
Composing Text and Image for Image Retrieval - An Empirical Odyssey. CoRR abs/1812.07119 (2018) - [i12]Jonathan C. Stroud, David A. Ross, Chen Sun, Jia Deng, Rahul Sukthankar:
D3D: Distilled 3D Networks for Video Action Recognition. CoRR abs/1812.08249 (2018) - 2017
- [c20]Chuang Gan, Chen Sun, Ram Nevatia:
DECK: Discovering Event Composition Knowledge from Web Images for Zero-Shot Event Detection and Recounting in Videos. AAAI 2017: 4032-4038 - [c19]Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy:
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. CVPR 2017: 3296-3297 - [c18]Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong:
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation. ICCV 2017: 1829-1838 - [c17]Jiyang Gao, Zhenheng Yang, Chen Sun, Kan Chen, Ram Nevatia:
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. ICCV 2017: 3648-3656 - [c16]Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia:
TALL: Temporal Activity Localization via Language Query. ICCV 2017: 5277-5285 - [c15]Unaiza Ahsan, Chen Sun, James Hays, Irfan A. Essa:
Complex Event Recognition from Images with Few Training Examples. WACV 2017: 669-678 - [i11]Unaiza Ahsan, Chen Sun, James Hays, Irfan A. Essa:
Complex Event Recognition from Images with Few Training Examples. CoRR abs/1701.04769 (2017) - [i10]Jiyang Gao, Zhenheng Yang, Chen Sun, Kan Chen, Ram Nevatia:
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. CoRR abs/1703.06189 (2017) - [i9]Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia:
TALL: Temporal Activity Localization via Language Query. CoRR abs/1705.02101 (2017) - [i8]Chunhui Gu, Chen Sun, Sudheendra Vijayanarasimhan, Caroline Pantofaru, David A. Ross, George Toderici, Yeqing Li, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik:
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. CoRR abs/1705.08421 (2017) - [i7]Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong:
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation. CoRR abs/1708.04686 (2017) - [i6]Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy:
Rethinking Spatiotemporal Feature Learning For Video Understanding. CoRR abs/1712.04851 (2017) - 2016
- [c14]Chen Sun, Manohar Paluri, Ronan Collobert, Ram Nevatia, Lubomir D. Bourdev:
ProNet: Learning to Propose Object-Specific Boxes for Cascaded Neural Networks. CVPR 2016: 3485-3493 - [c13]Chuang Gan, Chen Sun, Lixin Duan, Boqing Gong:
Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames. ECCV (3) 2016: 849-866 - [c12]Jiyang Gao, Chen Sun, Ram Nevatia:
ACD: Action Concept Discovery from Image-Sentence Corpora. ICMR 2016: 31-38 - [i5]Jiyang Gao, Chen Sun, Ram Nevatia:
ACD: Action Concept Discovery from Image-Sentence Corpora. CoRR abs/1604.04784 (2016) - [i4]Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy:
Speed/accuracy trade-offs for modern convolutional object detectors. CoRR abs/1611.10012 (2016) - 2015
- [c11]Chen Sun, Chuang Gan, Ram Nevatia:
Automatic Concept Discovery from Parallel Text and Visual Corpora. ICCV 2015: 2596-2604 - [c10]Chen Sun, Sanketh Shetty, Rahul Sukthankar, Ram Nevatia:
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images. ACM Multimedia 2015: 371-380 - [i3]Chen Sun, Sanketh Shetty, Rahul Sukthankar, Ram Nevatia:
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images. CoRR abs/1504.00983 (2015) - [i2]Chen Sun, Chuang Gan, Ram Nevatia:
Automatic Concept Discovery from Parallel Text and Visual Corpora. CoRR abs/1509.07225 (2015) - [i1]Chen Sun, Manohar Paluri, Ronan Collobert, Ram Nevatia, Lubomir D. Bourdev:
ProNet: Learning to Propose Object-specific Boxes for Cascaded Neural Networks. CoRR abs/1511.03776 (2015) - 2014
- [j1]Gregory K. Myers, Ramesh Nallapati, Julien van Hout, Stephanie Pancoast, Ramakant Nevatia, Chen Sun, AmirHossein Habibian, Dennis C. Koelma, Koen E. A. van de Sande, Arnold W. M. Smeulders, Cees G. M. Snoek:
Evaluating multimedia features and fusion for example-based event detection. Mach. Vis. Appl. 25(1): 17-32 (2014) - [c9]Chen Sun, Ramakant Nevatia:
DISCOVER: Discovering Important Segments for Classification of Video Events and Recounting. CVPR 2014: 2569-2576 - [c8]Chen Sun, Ram Nevatia:
Semantic Aware Video Transcription Using Random Forest Classifiers. ECCV (1) 2014: 772-786 - [c7]Julien van Hout, Eric Yeh, Dennis C. Koelma, Cees G. M. Snoek, Chen Sun, Ramakant Nevatia, Julie Wong, Gregory K. Myers:
Late fusion and calibration for multimedia event detection using few examples. ICASSP 2014: 4598-4602 - [c6]Chen Sun, J. Brian Burns, Ram Nevatia, Cees Snoek, Bob Bolles, Gregory K. Myers, Wen Wang, Eric Yeh:
ISOMER: Informative Segment Observations for Multimedia Event Recounting. ICMR 2014: 241 - [p1]Gregory K. Myers, Cees G. M. Snoek, Ramakant Nevatia, Ramesh Nallapati, Julien van Hout, Stephanie Pancoast, Chen Sun, AmirHossein Habibian, Dennis C. Koelma, Koen E. A. van de Sande, Arnold W. M. Smeulders:
Evaluating Multimedia Features and Fusion for Example-Based Event Detection. Fusion in Computer Vision 2014: 109-133 - 2013
- [c5]Chen Sun, Ram Nevatia:
ACTIVE: Activity Concept Transitions in Video Event Classification. ICCV 2013: 913-920 - [c4]Robert C. Bolles, J. Brian Burns, James A. Herson, Gregory K. Myers, Stephanie Pancoast, Julien van Hout, Wen Wang, Julie Wong, Eric Yeh, AmirHossein Habibian, Dennis C. Koelma, Zhenyang Li, Masoud Mazloom, Silvia-Laura Pintea, Arnold W. M. Smeulders, Cees G. M. Snoek, Sung Chun Lee, Ram Nevatia, Pramod Sharma, Chen Sun, Rémi Trichet:
The 2013 SESAME Multimedia Event Detection and Recounting System. TRECVID 2013 - [c3]Chen Sun, Ram Nevatia:
Large-scale web video event classification by use of Fisher Vectors. WACV 2013: 15-22 - 2012
- [c2]Murat Akbacak, Robert C. Bolles, J. Brian Burns, Mark Elliot, Aaron Heller, James A. Herson, Gregory K. Myers, Ramesh Nallapati, Stephanie Pancoast, Julien van Hout, Eric Yeh, AmirHossein Habibian, Dennis C. Koelma, Zhenyang Li, Masoud Mazloom, Silvia-Laura Pintea, Koen E. A. van de Sande, Arnold W. M. Smeulders, Cees G. M. Snoek, Sung Chun Lee, Ram Nevatia, Pramod Sharma, Chen Sun, Rémi Trichet:
The 2012 SESAME Multimedia Event Detection (MED) System. TRECVID 2012 - 2011
- [c1]Murat Akbacak, Robert C. Bolles, J. Brian Burns, Mark Elliot, Aaron Heller, James A. Herson, Gregory K. Myers, Ramesh Nallapati, Eric Yeh, Dennis C. Koelma, Xirong Li, Masoud Mazloom, Koen E. A. van de Sande, Arnold W. M. Smeulders, Cees G. M. Snoek, Sung Chun Lee, Ram Nevatia, Pramod Sharma, Chen Sun, Rémi Trichet:
The SESAME MED System. TRECVID 2011
Coauthor Index
aka: Kevin P. Murphy
aka: Ram Nevatia
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint