default search action
Devi Parikh
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c153]Shelly Sheynin, Adam Polyak, Uriel Singer, Yuval Kirstain, Amit Zohar, Oron Ashual, Devi Parikh, Yaniv Taigman:
Emu Edit: Precise Image Editing via Recognition and Generation Tasks. CVPR 2024: 8871-8879 - [c152]Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra:
Factorizing Text-to-Video Generation by Explicit Image Conditioning. ECCV (62) 2024: 205-224 - [c151]Uriel Singer, Amit Zohar, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman:
Video Editing via Factorized Diffusion Distillation. ECCV (76) 2024: 450-466 - [i128]Uriel Singer, Amit Zohar, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman:
Video Editing via Factorized Diffusion Distillation. CoRR abs/2403.09334 (2024) - 2023
- [c150]Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin:
SpaText: Spatio-Textual Representation for Controllable Image Generation. CVPR 2023: 18370-18380 - [c149]Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta:
Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation. ICCV 2023: 14993-15002 - [c148]Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. ICLR 2023 - [c147]Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman:
Make-A-Video: Text-to-Video Generation without Text-Video Data. ICLR 2023 - [c146]Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman:
Text-To-4D Dynamic Scene Generation. ICML 2023: 31915-31929 - [i127]Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman:
Text-To-4D Dynamic Scene Generation. CoRR abs/2301.11280 (2023) - [i126]Samaneh Azadi, Thomas Hayes, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta:
Text-Conditional Contextualized Avatars For Zero-Shot Personalization. CoRR abs/2304.07410 (2023) - [i125]Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta:
Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation. CoRR abs/2305.09662 (2023) - [i124]Xiaoliang Dai, Ji Hou, Chih-Yao Ma, Sam S. Tsai, Jialiang Wang, Rui Wang, Peizhao Zhang, Simon Vandenhende, Xiaofang Wang, Abhimanyu Dubey, Matthew Yu, Abhishek Kadian, Filip Radenovic, Dhruv Mahajan, Kunpeng Li, Yue Zhao, Vladan Petrovic, Mitesh Kumar Singh, Simran Motwani, Yi Wen, Yiwen Song, Roshan Sumbaly, Vignesh Ramanathan, Zijian He, Peter Vajda, Devi Parikh:
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack. CoRR abs/2309.15807 (2023) - [i123]Shelly Sheynin, Adam Polyak, Uriel Singer, Yuval Kirstain, Amit Zohar, Oron Ashual, Devi Parikh, Yaniv Taigman:
Emu Edit: Precise Image Editing via Recognition and Generation Tasks. CoRR abs/2311.10089 (2023) - [i122]Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra:
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning. CoRR abs/2311.10709 (2023) - 2022
- [c145]Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gökhan Tür, Devi Parikh, Dilek Hakkani-Tur:
VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator. ACL (Findings) 2022: 1984-1994 - [c144]Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Mukul Khanna, Dhruv Batra, Devi Parikh:
Episodic Memory Question Answering. CVPR 2022: 19097-19106 - [c143]Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman:
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors. ECCV (15) 2022: 89-106 - [c142]Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh:
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer. ECCV (17) 2022: 102-118 - [c141]Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh:
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration. ECCV (8) 2022: 431-449 - [i121]Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman:
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors. CoRR abs/2203.13131 (2022) - [i120]Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh:
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer. CoRR abs/2204.03638 (2022) - [i119]Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh:
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration. CoRR abs/2204.08058 (2022) - [i118]Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Mukul Khanna, Dhruv Batra, Devi Parikh:
Episodic Memory Question Answering. CoRR abs/2205.01652 (2022) - [i117]Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman:
Make-A-Video: Text-to-Video Generation without Text-Video Data. CoRR abs/2209.14792 (2022) - [i116]Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. CoRR abs/2209.15352 (2022) - [i115]Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin:
SpaText: Spatio-Textual Representation for Controllable Image Generation. CoRR abs/2211.14305 (2022) - 2021
- [c140]Devi Parikh:
AI-assisted Human creativity. AffCon@AAAI 2021 - [c139]Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani:
Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs. CVPR 2021: 7005-7015 - [c138]Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach:
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. CVPR 2021: 14111-14121 - [c137]Songwei Ge, Devi Parikh:
Visual Conceptual Blending with Large-Scale Language and Vision Models. ICCC 2021: 6-10 - [c136]Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal:
Contrast and Classify: Training Robust VQA Models. ICCV 2021: 1584-1593 - [c135]Songwei Ge, Vedanuj Goswami, Larry Zitnick, Devi Parikh:
Creative Sketch Generation. ICLR 2021 - [c134]Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh, Ramprasaath R. Selvaraju:
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency. NAACL-HLT 2021: 3103-3111 - [c133]Abhishek Das, Muhammed Shuaibi, Aini Palizhati, Siddharth Goyal, Aditya Grover, Adeesh Kolluru, Janice Lan, Ammar Rizvi, Anuroop Sriram, Brandon M. Wood, Devi Parikh, Zachary W. Ulissi, C. Lawrence Zitnick, Guolin Ke, Shuxin Zheng, Yu Shi, Di He, Tie-Yan Liu, Chengxuan Ying, Jiacheng You, Yihan He, Rostislav Grigoriev, Ruslan Lukin, Adel Yarullin, Max Faleev:
The Open Catalyst Challenge 2021: Competition Report. NeurIPS (Competition and Demos) 2021: 29-40 - [c132]Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Alberto Lopez Magana, Tristan Thrush, Wojciech Galuba, Devi Parikh, Douwe Kiela:
Human-Adversarial Visual Question Answering. NeurIPS 2021: 20346-20359 - [i114]Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani:
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs. CoRR abs/2101.12059 (2021) - [i113]Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick:
ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations. CoRR abs/2103.01436 (2021) - [i112]Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gökhan Tür, Devi Parikh, Dilek Hakkani-Tür:
VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator. CoRR abs/2105.11589 (2021) - [i111]Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Alberto Lopez Magana, Wojciech Galuba, Devi Parikh, Douwe Kiela:
Human-Adversarial Visual Question Answering. CoRR abs/2106.02280 (2021) - [i110]Ramya Srinivasan, Devi Parikh:
Building Bridges: Generative Artworks to Explore AI Ethics. CoRR abs/2106.13901 (2021) - [i109]Songwei Ge, Devi Parikh:
Visual Conceptual Blending with Large-scale Language and Vision Models. CoRR abs/2106.14127 (2021) - [i108]Gunjan Aggarwal, Devi Parikh:
Dance2Music: Automatic Dance-driven Music Generation. CoRR abs/2107.06252 (2021) - [i107]Safinah Ali, Devi Parikh:
Telling Creative Stories Using Generative Visual Aids. CoRR abs/2110.14810 (2021) - 2020
- [j16]Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra:
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 128(2): 336-359 (2020) - [c131]Samyak Datta, Oleksandr Maksymets, Judy Hoffman, Stefan Lee, Dhruv Batra, Devi Parikh:
Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents. CoRL 2020: 313-328 - [c130]Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee:
Sim-to-Real Transfer for Vision-and-Language Navigation. CoRL 2020: 671-681 - [c129]Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Túlio Ribeiro, Besmira Nushi, Ece Kamar:
SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions. CVPR 2020: 10000-10008 - [c128]Jiasen Lu, Vedanuj Goswami, Marcus Rohrbach, Devi Parikh, Stefan Lee:
12-in-1: Multi-Task Vision and Language Representation Learning. CVPR 2020: 10434-10443 - [c127]Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra:
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web. ECCV (6) 2020: 259-274 - [c126]Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das:
Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline. ECCV (18) 2020: 336-352 - [c125]Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh:
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation. ECCV (18) 2020: 513-529 - [c124]Yash Kant, Dhruv Batra, Peter Anderson, Alexander G. Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal:
Spatially Aware Multimodal Transformers for TextVQA. ECCV (9) 2020: 715-732 - [c123]Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson:
Where Are You? Localization from Embodied Dialog. EMNLP (1) 2020: 806-822 - [c122]Devi Parikh, C. Lawrence Zitnick:
Exploring Crowd Co-creation Scenarios for Sketches. ICCC 2020: 73-76 - [c121]Purva Tendulkar, Abhishek Das, Aniruddha Kembhavi, Devi Parikh:
Feel The Music: Automatically Generating A Dance For An Input Song. ICCC 2020: 292-295 - [c120]X. Alice Li, Devi Parikh:
Lemotif: An Affective Visual Journal Using Deep Neural Networks. ICCC 2020: 453-460 - [c119]Devi Parikh:
Predicting A Creator's Preferences In, and From, Interactive Generative Art. ICCC 2020: 484-487 - [c118]Gunjan Aggarwal, Devi Parikh:
Neuro-Symbolic Generative Art: A Preliminary Study. ICCC 2020: 492-495 - [c117]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. ICLR 2020 - [c116]Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam:
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL. IJCAI 2020: 2022-2028 - [c115]Devendra Singh Chaplot, Lisa Lee, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra:
Embodied Multimodal Multitask Learning. IJCAI 2020: 2442-2448 - [c114]Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra:
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data. NeurIPS 2020 - [c113]Douwe Kiela, Hamed Firooz, Aravind Mohan, Vedanuj Goswami, Amanpreet Singh, Casey A. Fitzpatrick, Peter Bull, Greg Lipstein, Tony Nelli, Ron Zhu, Niklas Muennighoff, Riza Velioglu, Jewgeni Rose, Phillip Lippe, Nithin Holla, Shantanu Chandra, Santhosh Rajamanickam, Georgios Antoniou, Ekaterina Shutova, Helen Yannakoudakis, Vlad Sandulescu, Umut Ozertem, Patrick Pantel, Lucia Specia, Devi Parikh:
The Hateful Memes Challenge: Competition Report. NeurIPS (Competition and Demos) 2020: 344-360 - [i106]Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Ribeiro, Besmira Nushi, Ece Kamar:
SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions. CoRR abs/2001.06927 (2020) - [i105]Devi Parikh:
Predicting A Creator's Preferences In, and From, Interactive Generative Art. CoRR abs/2003.01274 (2020) - [i104]Amanpreet Singh, Vedanuj Goswami, Devi Parikh:
Are we pretraining it right? Digging deeper into visio-linguistic pretraining. CoRR abs/2004.08744 (2020) - [i103]Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra:
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web. CoRR abs/2004.14973 (2020) - [i102]Devi Parikh, C. Lawrence Zitnick:
Exploring Crowd Co-creation Scenarios for Sketches. CoRR abs/2005.07328 (2020) - [i101]Purva Tendulkar, Abhishek Das, Aniruddha Kembhavi, Devi Parikh:
Feel The Music: Automatically Generating A Dance For An Input Song. CoRR abs/2006.11905 (2020) - [i100]Gunjan Aggarwal, Devi Parikh:
Neuro-Symbolic Generative Art: A Preliminary Study. CoRR abs/2007.02171 (2020) - [i99]Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh:
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation. CoRR abs/2007.09841 (2020) - [i98]Yash Kant, Dhruv Batra, Peter Anderson, Alexander G. Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal:
Spatially Aware Multimodal Transformers for TextVQA. CoRR abs/2007.12146 (2020) - [i97]Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra:
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data. CoRR abs/2007.12750 (2020) - [i96]Samyak Datta, Oleksandr Maksymets, Judy Hoffman, Stefan Lee, Dhruv Batra, Devi Parikh:
Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents. CoRR abs/2009.03231 (2020) - [i95]Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal:
Contrast and Classify: Alternate Training for Robust VQA. CoRR abs/2010.06087 (2020) - [i94]C. Lawrence Zitnick, Lowik Chanussot, Abhishek Das, Siddharth Goyal, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Thibaut Lavril, Aini Palizhati, Morgane Riviere, Muhammed Shuaibi, Anuroop Sriram, Kevin Tran, Brandon M. Wood, Junwoong Yoon, Devi Parikh, Zachary W. Ulissi:
An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage. CoRR abs/2010.09435 (2020) - [i93]Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon M. Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, Zachary W. Ulissi:
The Open Catalyst 2020 (OC20) Dataset and Community Challenges. CoRR abs/2010.09990 (2020) - [i92]Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh, Ramprasaath R. Selvaraju:
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency. CoRR abs/2010.10038 (2020) - [i91]Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee:
Sim-to-Real Transfer for Vision-and-Language Navigation. CoRR abs/2011.03807 (2020) - [i90]Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson:
Where Are You? Localization from Embodied Dialog. CoRR abs/2011.08277 (2020) - [i89]Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh:
Creative Sketch Generation. CoRR abs/2011.10039 (2020) - [i88]Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach:
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. CoRR abs/2012.11014 (2020) - [i87]Jianwei Yang, Jiayuan Mao, Jiajun Wu, Devi Parikh, David D. Cox, Joshua B. Tenenbaum, Chuang Gan:
Object-Centric Diagnosis of Visual Reasoning. CoRR abs/2012.11587 (2020)
2010 – 2019
- 2019
- [j15]Yash Goyal, Tejas Khot, Aishwarya Agrawal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. Int. J. Comput. Vis. 127(4): 398-414 (2019) - [j14]Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, Stefan Lee, José M. F. Moura, Devi Parikh, Dhruv Batra:
Visual Dialog. IEEE Trans. Pattern Anal. Mach. Intell. 41(5): 1242-1256 (2019) - [c112]Jin-Hwa Kim, Nikita Kitaev, Xinlei Chen, Marcus Rohrbach, Byoung-Tak Zhang, Yuandong Tian, Dhruv Batra, Devi Parikh:
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication. ACL (1) 2019: 6495-6513 - [c111]Meet Shah, Xinlei Chen, Marcus Rohrbach, Devi Parikh:
Cycle-Consistency for Robust Visual Question Answering. CVPR 2019: 6649-6658 - [c110]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments With Point Cloud Perception. CVPR 2019: 6659-6668 - [c109]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh:
Audio Visual Scene-Aware Dialog. CVPR 2019: 7558-7567 - [c108]Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach:
Towards VQA Models That Can Read. CVPR 2019: 8317-8326 - [c107]Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das:
Improving Generative Visual Dialog by Answering Diverse Questions. EMNLP/IJCNLP (1) 2019: 1449-1454 - [c106]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c105]Purva Tendulkar, Kalpesh Krishna, Ramprasaath R. Selvaraju, Devi Parikh:
Trick or TReAT : Thematic Reinforcement for Artistic Typography. ICCC 2019: 188-195 - [c104]Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra:
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation. ICCV 2019: 1022-1031 - [c103]Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra:
Embodied Amodal Recognition: Learning to Move to Perceive Objects. ICCV 2019: 2040-2050 - [c102]Ramprasaath Ramasamy Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry P. Heck, Dhruv Batra, Devi Parikh:
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. ICCV 2019: 2591-2600 - [c101]Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. ICCV 2019: 2601-2610 - [c100]Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman:
Fashion++: Minimal Edits for Outfit Improvement. ICCV 2019: 5046-5055 - [c99]Harsh Agrawal, Peter Anderson, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee:
nocaps: novel object captioning at scale. ICCV 2019: 8947-8956 - [c98]Manolis Savva, Jitendra Malik, Devi Parikh, Dhruv Batra, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun:
Habitat: A Platform for Embodied AI Research. ICCV 2019: 9338-9346 - [c97]Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra:
Modeling the Long Term Future in Model-Based Reinforcement Learning. ICLR (Poster) 2019 - [c96]Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Mike Rabbat, Joelle Pineau:
TarMAC: Targeted Multi-Agent Communication. ICML 2019: 1538-1546 - [c95]Yash Goyal, Ziyan Wu, Jan Ernst, Dhruv Batra, Devi Parikh, Stefan Lee:
Counterfactual Visual Explanations. ICML 2019: 2376-2384 - [c94]Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering. ICML 2019: 6428-6437 - [c93]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog. NAACL-HLT (1) 2019: 582-595 - [c92]Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee:
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. NeurIPS 2019: 13-23 - [c91]Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee:
Chasing Ghosts: Instruction Following as Bayesian State Tracking. NeurIPS 2019: 369-379 - [c90]Rémi Cadène, Corentin Dancette, Hédi Ben-Younes, Matthieu Cord, Devi Parikh:
RUBi: Reducing Unimodal Biases for Visual Question Answering. NeurIPS 2019: 839-850 - [c89]Jianwei Yang, Zhile Ren, Chuang Gan, Hongyuan Zhu, Devi Parikh:
Cross-channel Communication Networks. NeurIPS 2019: 1295-1304 - [i86]Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, R. Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett,