default search action
Peter Clark
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c129]Ruoyao Wang, Graham Todd, Ziang Xiao, Xingdi Yuan, Marc-Alexandre Côté, Peter Clark, Peter A. Jansen:
Can Language Models Serve as Text-Based World Simulators? ACL (Short Papers) 2024: 1-17 - [c128]Yuling Gu, Oyvind Tafjord, Peter Clark:
Digital Socrates: Evaluating LLMs through Explanation Critiques. ACL (1) 2024: 5559-5586 - [c127]Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe:
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks. ACL (1) 2024: 7002-7024 - [c126]Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon:
Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization. ACL (Findings) 2024: 15597-15611 - [c125]Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. EMNLP 2024: 9458-9482 - [c124]Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot:
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories. EMNLP 2024: 12622-12645 - [c123]Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot:
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs. ICLR 2024 - [c122]Bodhisattwa Prasad Majumder, Harshit Surana, Dhruv Agarwal, Sanchaita Hazra, Ashish Sabharwal, Peter Clark:
Position: Data-driven Discovery with Large Generative Models. ICML 2024 - [c121]Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, Roy Fox:
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills. ICML 2024 - [c120]Nathaniel Weir, Peter Clark, Benjamin Van Durme:
NELLIE: A Neuro-Symbolic Inference Engine for Grounded, Compositional, and Explainable Reasoning. IJCAI 2024: 3602-3612 - [c119]Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan:
QualEval: Qualitative Evaluation for Model Improvement. NAACL-HLT 2024: 2093-2111 - [c118]Archiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot:
ADaPT: As-Needed Decomposition and Planning with Language Models. NAACL-HLT (Findings) 2024: 4226-4252 - [c117]Ben Bogin, Shivanshu Gupta, Peter Clark, Ashish Sabharwal:
Leveraging Code to Improve In-Context Learning for Semantic Parsing. NAACL-HLT 2024: 4971-5012 - [c116]Li Zhang, Peter A. Jansen, Tianyi Zhang, Peter Clark, Chris Callison-Burch, Niket Tandon:
PDDLEGO: Iterative Planning in Textual Environments. *SEM@NAACL 2024: 212-221 - [i93]Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe:
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks. CoRR abs/2401.06751 (2024) - [i92]Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, Roy Fox:
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills. CoRR abs/2402.03244 (2024) - [i91]Bodhisattwa Prasad Majumder, Harshit Surana, Dhruv Agarwal, Sanchaita Hazra, Ashish Sabharwal, Peter Clark:
Data-driven Discovery with Large Generative Models. CoRR abs/2402.13610 (2024) - [i90]Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. CoRR abs/2402.14798 (2024) - [i89]Tianyi Zhang, Li Zhang, Zhaoyi Hou, Ziyu Wang, Yuling Gu, Peter Clark, Chris Callison-Burch, Niket Tandon:
PROC2PDDL: Open-Domain Planning Representations from Texts. CoRR abs/2403.00092 (2024) - [i88]Nathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark:
Learning to Reason via Program Generation, Emulation, and Search. CoRR abs/2405.16337 (2024) - [i87]Li Zhang, Peter Jansen, Tianyi Zhang, Peter Clark, Chris Callison-Burch, Niket Tandon:
PDDLEGO: Iterative Planning in Textual Environments. CoRR abs/2405.19793 (2024) - [i86]Ruoyao Wang, Graham Todd, Ziang Xiao, Xingdi Yuan, Marc-Alexandre Côté, Peter Clark, Peter A. Jansen:
Can Language Models Serve as Text-Based World Simulators? CoRR abs/2406.06485 (2024) - [i85]Peter A. Jansen, Marc-Alexandre Côté, Tushar Khot, Erin Bransom, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Oyvind Tafjord, Peter Clark:
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents. CoRR abs/2406.06769 (2024) - [i84]Bodhisattwa Prasad Majumder, Harshit Surana, Dhruv Agarwal, Bhavana Dalvi Mishra, Abhijeetsingh Meena, Aryan Prakhar, Tirth Vora, Tushar Khot, Ashish Sabharwal, Peter Clark:
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models. CoRR abs/2407.01725 (2024) - [i83]Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle D. Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot:
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories. CoRR abs/2409.07440 (2024) - [i82]Yuling Gu, Oyvind Tafjord, Hyunwoo Kim, Jared Moore, Ronan Le Bras, Peter Clark, Yejin Choi:
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs. CoRR abs/2410.13648 (2024) - 2023
- [c115]Yuling Gu, Bhavana Dalvi Mishra, Peter Clark:
Do language models have coherent mental models of everyday things? ACL (1) 2023: 1892-1913 - [c114]Afra Feyza Akyürek, Ekin Akyürek, Ashwin Kalyan, Peter Clark, Derry Tanti Wijaya, Niket Tandon:
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL (1) 2023: 7716-7733 - [c113]Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal:
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions. EMNLP 2023: 8276-8288 - [c112]Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal:
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy. EMNLP 2023: 8392-8417 - [c111]Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schütze, Peter Clark:
Language Models with Rationality. EMNLP 2023: 14190-14201 - [c110]Zhenwen Liang, Wenhao Yu, Tanmay Rajpurohit, Peter Clark, Xiangliang Zhang, Ashwin Kalyan:
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation. EMNLP 2023: 14384-14396 - [c109]Yao Fu, Hao Peng, Ashish Sabharwal, Peter Clark, Tushar Khot:
Complexity-Based Prompting for Multi-step Reasoning. ICLR 2023 - [c108]Tushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson, Peter Clark, Ashish Sabharwal:
Decomposed Prompting: A Modular Approach for Solving Complex Tasks. ICLR 2023 - [c107]Pan Lu, Liang Qiu, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan:
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning. ICLR 2023 - [c106]Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark:
Self-Refine: Iterative Refinement with Self-Feedback. NeurIPS 2023 - [i81]Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark:
Self-Refine: Iterative Refinement with Self-Feedback. CoRR abs/2303.17651 (2023) - [i80]Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon:
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. CoRR abs/2305.08844 (2023) - [i79]Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal:
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions. CoRR abs/2305.14010 (2023) - [i78]Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schütze, Peter Clark:
Language Models with Rationality. CoRR abs/2305.14250 (2023) - [i77]Zhenwen Liang, Wenhao Yu, Tanmay Rajpurohit, Peter Clark, Xiangliang Zhang, Ashwin Kalyan:
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation. CoRR abs/2305.14386 (2023) - [i76]Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal:
Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy. CoRR abs/2305.14596 (2023) - [i75]Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter A. Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark:
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization. CoRR abs/2310.10134 (2023) - [i74]Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan:
QualEval: Qualitative Evaluation for Model Improvement. CoRR abs/2311.02807 (2023) - [i73]Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot:
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs. CoRR abs/2311.04892 (2023) - [i72]Archiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot:
ADaPT: As-Needed Decomposition and Planning with Language Models. CoRR abs/2311.05772 (2023) - [i71]Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon:
One Size Does Not Fit All: Customizing Open-Domain Procedures. CoRR abs/2311.09510 (2023) - [i70]Ben Bogin, Shivanshu Gupta, Peter Clark, Ashish Sabharwal:
Leveraging Code to Improve In-context Learning for Semantic Parsing. CoRR abs/2311.09519 (2023) - [i69]Yuling Gu, Oyvind Tafjord, Peter Clark:
Digital Socrates: Evaluating LLMs through explanation critiques. CoRR abs/2311.09613 (2023) - [i68]Peter Clark, Bhavana Dalvi Mishra, Oyvind Tafjord:
BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability. CoRR abs/2312.07527 (2023) - 2022
- [c105]Swaroop Mishra, Arindam Mitra, Neeraj Varshney, Bhavdeep Singh Sachdeva, Peter Clark, Chitta Baral, Ashwin Kalyan:
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks. ACL (1) 2022: 3505-3523 - [c104]Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark:
What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment. EMNLP 2022: 414-426 - [c103]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning. EMNLP 2022: 2078-2093 - [c102]Aman Madaan, Niket Tandon, Peter Clark, Yiming Yang:
Memory-assisted prompt editing to improve GPT-3 after deployment. EMNLP 2022: 2833-2861 - [c101]Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan:
LILA: A Unified Benchmark for Mathematical Reasoning. EMNLP 2022: 5807-5832 - [c100]Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Clark:
Towards Teachable Reasoning Systems: Using a Dynamic Memory of User Feedback for Continual System Improvement. EMNLP 2022: 9465-9480 - [c99]Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang:
Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback. NAACL-HLT (Findings) 2022: 339-352 - [c98]Yuling Gu, Bhavana Dalvi, Peter Clark:
DREAM: Improving Situational QA by First Elaborating the Situation. NAACL-HLT 2022: 1115-1127 - [c97]Pan Lu, Swaroop Mishra, Tanglin Xia, Liang Qiu, Kai-Wei Chang, Song-Chun Zhu, Oyvind Tafjord, Peter Clark, Ashwin Kalyan:
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering. NeurIPS 2022 - [i67]Aman Madaan, Niket Tandon, Peter Clark, Yiming Yang:
Memory-assisted prompt editing to improve GPT-3 after deployment. CoRR abs/2201.06009 (2022) - [i66]Swaroop Mishra, Arindam Mitra, Neeraj Varshney, Bhavdeep Singh Sachdeva, Peter Clark, Chitta Baral, Ashwin Kalyan:
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks. CoRR abs/2204.05660 (2022) - [i65]Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark:
What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment. CoRR abs/2204.09148 (2022) - [i64]Bhavana Dalvi, Oyvind Tafjord, Peter Clark:
Towards Teachable Reasoning Systems. CoRR abs/2204.13074 (2022) - [i63]Pan Lu, Swaroop Mishra, Tony Xia, Liang Qiu, Kai-Wei Chang, Song-Chun Zhu, Oyvind Tafjord, Peter Clark, Ashwin Kalyan:
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering. CoRR abs/2209.09513 (2022) - [i62]Pan Lu, Liang Qiu, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan:
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning. CoRR abs/2209.14610 (2022) - [i61]Yao Fu, Hao Peng, Ashish Sabharwal, Peter Clark, Tushar Khot:
Complexity-Based Prompting for Multi-Step Reasoning. CoRR abs/2210.00720 (2022) - [i60]Tushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson, Peter Clark, Ashish Sabharwal:
Decomposed Prompting: A Modular Approach for Solving Complex Tasks. CoRR abs/2210.02406 (2022) - [i59]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning. CoRR abs/2210.12217 (2022) - [i58]Yuling Gu, Yao Fu, Valentina Pyatkin, Ian Magnusson, Bhavana Dalvi Mishra, Peter Clark:
Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE. CoRR abs/2210.16407 (2022) - [i57]Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan:
Lila: A Unified Benchmark for Mathematical Reasoning. CoRR abs/2210.17517 (2022) - [i56]Yuling Gu, Bhavana Dalvi Mishra, Peter Clark:
Do language models have coherent mental models of everyday things? CoRR abs/2212.10029 (2022) - 2021
- [c96]Oyvind Tafjord, Bhavana Dalvi, Peter Clark:
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language. ACL/IJCNLP (Findings) 2021: 3621-3634 - [c95]Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi:
proScript: Partially Ordered Scripts Generation. EMNLP (Findings) 2021: 2138-2149 - [c94]Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Peter Clark, Yiming Yang, Eduard H. Hovy:
Think about it! Improving defeasible reasoning by first modeling the question scenario. EMNLP (1) 2021: 6291-6310 - [c93]Ashwin Kalyan, Abhinav Kumar, Arjun Chandrasekaran, Ashish Sabharwal, Peter Clark:
How much coffee was consumed during EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI. EMNLP (1) 2021: 7318-7328 - [c92]Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, Peter Clark:
Explaining Answers with Entailment Trees. EMNLP (1) 2021: 7358-7370 - [c91]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief. EMNLP (1) 2021: 8849-8861 - [c90]Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, Ashish Sabharwal:
Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models. NAACL-HLT 2021: 1264-1279 - [i55]Sumithra Bhakthavatsalam, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Peter Clark:
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge. CoRR abs/2102.03315 (2021) - [i54]Dheeraj Rajagopal, Aman Madaan, Niket Tandon, Yiming Yang, Shrimai Prabhumoye, Abhilasha Ravichander, Peter Clark, Eduard H. Hovy:
CURIE: An Iterative Querying Approach for Reasoning About Situations. CoRR abs/2104.00814 (2021) - [i53]Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi:
proScript: Partially Ordered Scripts Generation via Pre-trained Language Models. CoRR abs/2104.08251 (2021) - [i52]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
Enriching a Model's Notion of Belief using a Persistent Memory. CoRR abs/2104.08401 (2021) - [i51]Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, Peter Clark:
Explaining Answers with Entailment Trees. CoRR abs/2104.08661 (2021) - [i50]Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Yiming Yang, Peter Clark, Keisuke Sakaguchi, Eduard H. Hovy:
Improving Neural Model Performance through Natural Language Feedback on Their Explanations. CoRR abs/2104.08765 (2021) - [i49]Oyvind Tafjord, Peter Clark:
General-Purpose Question-Answering with Macaw. CoRR abs/2109.02593 (2021) - [i48]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief. CoRR abs/2109.14723 (2021) - [i47]Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Peter Clark, Yiming Yang, Eduard H. Hovy:
Think about it! Improving defeasible reasoning by first modeling the question scenario. CoRR abs/2110.12349 (2021) - [i46]Ashwin Kalyan, Abhinav Kumar, Arjun Chandrasekaran, Ashish Sabharwal, Peter Clark:
How Much Coffee Was Consumed During EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI. CoRR abs/2110.14207 (2021) - [i45]Niket Tandon, Aman Madaan, Peter Clark, Keisuke Sakaguchi, Yiming Yang:
Interscript: A dataset for interactive learning of scripts through error feedback. CoRR abs/2112.07867 (2021) - [i44]Yuling Gu, Bhavana Dalvi Mishra, Peter Clark:
DREAM: Uncovering Mental Models behind Language Models. CoRR abs/2112.08656 (2021) - [i43]Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang:
Improving scripts with a memory of natural feedback. CoRR abs/2112.09737 (2021) - 2020
- [j15]Peter Clark, Oren Etzioni, Tushar Khot, Daniel Khashabi, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Niket Tandon, Sumithra Bhakthavatsalam, Dirk Groeneveld, Michal Guerquin, Michael Schmitz:
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project. AI Mag. 41(4): 39-53 (2020) - [c89]Tushar Khot, Peter Clark, Michal Guerquin, Peter Jansen, Ashish Sabharwal:
QASC: A Dataset for Question Answering via Sentence Composition. AAAI 2020: 8082-8090 - [c88]Harsh Jhamtani, Peter Clark:
Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering. EMNLP (1) 2020: 137-150 - [c87]Daniel Khashabi, Sewon Min, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi:
UnifiedQA: Crossing Format Boundaries With a Single QA System. EMNLP (Findings) 2020: 1896-1907 - [c86]Dheeraj Rajagopal, Niket Tandon, Peter Clark, Bhavana Dalvi, Eduard H. Hovy:
What-if I ask you to explain: Explaining the effects of perturbations in procedural text. EMNLP (Findings) 2020: 3345-3355 - [c85]Niket Tandon, Keisuke Sakaguchi, Bhavana Dalvi, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson, Eduard H. Hovy:
A Dataset for Tracking Entities in Open Domain Procedural Text. EMNLP (1) 2020: 6408-6417 - [c84]Peter Clark, Oyvind Tafjord, Kyle Richardson:
Transformers as Soft Reasoners over Language. IJCAI 2020: 3882-3890 - [c83]Dongfang Xu, Peter A. Jansen, Jaycie Martin, Zhengnan Xie, Vikas Yadav, Harish Tayyar Madabushi, Oyvind Tafjord, Peter Clark:
Multi-class Hierarchical Question Classification for Multiple Choice Science Exams. LREC 2020: 5370-5382 - [c82]Alon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, Jonathan Berant:
Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge. NeurIPS 2020 - [i42]Peter Clark, Oyvind Tafjord, Kyle Richardson:
Transformers as Soft Reasoners over Language. CoRR abs/2002.05867 (2020) - [i41]Sumithra Bhakthavatsalam, Chloe Anastasiades, Peter Clark:
GenericsKB: A Knowledge Base of Generic Statements. CoRR abs/2005.00660 (2020) - [i40]Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi:
UnifiedQA: Crossing Format Boundaries With a Single QA System. CoRR abs/2005.00700 (2020) - [i39]Dheeraj Rajagopal, Niket Tandon, Peter Clark, Bhavana Dalvi, Eduard H. Hovy:
What-if I ask you to explain: Explaining the effects of perturbations in procedural text. CoRR abs/2005.01526 (2020) - [i38]Peter Clark, John A. Thompson, Bruce W. Porter:
Knowledge Patterns. CoRR abs/2005.04306 (2020) - [i37]Alon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, Jonathan Berant:
Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge. CoRR abs/2006.06609 (2020) - [i36]Sumithra Bhakthavatsalam, Kyle Richardson, Niket Tandon, Peter Clark:
Do Dogs have Whiskers? A New Knowledge Base of hasPart Relations. CoRR abs/2006.07510 (2020) - [i35]Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, Ashish Sabharwal:
Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models. CoRR abs/2009.00751 (2020) - [i34]Harsh Jhamtani, Peter Clark:
Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering. CoRR abs/2010.03274 (2020) - [i33]Niket Tandon, Keisuke Sakaguchi, Bhavana Dalvi Mishra, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson, Eduard H. Hovy:
A Dataset for Tracking Entities in Open Domain Procedural Text. CoRR abs/2011.08092 (2020) - [i32]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language. CoRR abs/2012.13048 (2020)
2010 – 2019
- 2019
- [j14]Guy Barash, Mauricio Castillo-Effen, Niyati Chhaya, Peter Clark, Huáscar Espinoza, Eitan Farchi, Christopher W. Geib, Odd Erik Gundersen, Seán Ó hÉigeartaigh, José Hernández-Orallo, Chiori Hori, Xiaowei Huang, Kokil Jaidka, Pavan Kapanipathi, Sarah Keren, Seokhwan Kim, Marc Lanctot, Danny Lange, Julian J. McAuley, David R. Martinez, Marwan Mattar, Mausam, Martin Michalowski, Reuth Mirsky, Roozbeh Mottaghi, Joseph C. Osborn, Julien Pérolat, Martin Schmid, Arash Shaban-Nejad, Onn Shehory, Biplav Srivastava,