


default search action
Benjamin Van Durme
Benjamin David Van Durme
Person information
- affiliation: Johns Hopkins University, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [c216]Ishani Mondal, Michelle Yuan, Anandhavelu Natarajan, Aparna Garimella, Francis Ferraro, Andrew Blair-Stanek, Benjamin Van Durme, Jordan Lee Boyd-Graber:
ADAPTIVE IE: Investigating the Complementarity of Human-AI Collaboration to Adaptively Extract Information on-the-fly. COLING 2025: 5870-5889 - [i155]Helia Hashemi, Jason Eisner, Corby Rosset, Benjamin Van Durme, Chris Kedzie:
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts. CoRR abs/2501.00274 (2025) - 2024
- [j10]Zhengping Jiang, Anqi Liu, Benjamin Van Durme:
Addressing the Binning Problem in Calibration Assessment through Scalar Annotations. Trans. Assoc. Comput. Linguistics 12: 120-136 (2024) - [c215]Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu:
RORA: Robust Free-Text Rationale Evaluation. ACL (1) 2024: 1070-1087 - [c214]Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie:
Do Androids Know They're Only Dreaming of Electric Sheep? ACL (Findings) 2024: 4401-4420 - [c213]Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme:
Dodo: Dynamic Contextual Compression for Decoder-only LMs. ACL (1) 2024: 9961-9975 - [c212]Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su:
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error. ACL (1) 2024: 10583-10604 - [c211]Helia Hashemi, Jason Eisner, Corby Rosset, Benjamin Van Durme, Chris Kedzie:
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts. ACL (1) 2024: 13806-13834 - [c210]Kate Sanders, Benjamin Van Durme:
A Survey of Video Datasets for Grounded Event Understanding. CVPR Workshops 2024: 7314-7327 - [c209]William Gantt, Shabnam Behzad, Hannah Youngeun An, Yunmo Chen, Aaron Steven White, Benjamin Van Durme, Mahsa Yarmohammadi:
MultiMUC: Multilingual Template Filling on MUC-4. EACL (1) 2024: 349-368 - [c208]Orion Weller, Aleem Khan, Nathaniel Weir, Dawn J. Lawrie, Benjamin Van Durme:
Defending Against Disinformation Attacks in Open-Domain Question Answering. EACL (2) 2024: 402-417 - [c207]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. EACL (Findings) 2024: 1987-2003 - [c206]Orion Weller, Dawn J. Lawrie, Benjamin Van Durme:
NevIR: Negation in Neural Information Retrieval. EACL (1) 2024: 2274-2287 - [c205]Orion Weller, Marc Marone, Nathaniel Weir, Dawn J. Lawrie, Daniel Khashabi, Benjamin Van Durme:
"According to . . . ": Prompting Language Models Improves Quoting from Pre-Training Data. EACL (1) 2024: 2288-2301 - [c204]Zhuowan Li, Cihang Xie, Benjamin Van Durme, Alan L. Yuille:
Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models. EACL (1) 2024: 2378-2390 - [c203]Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme:
Learning to Retrieve Iteratively for In-Context Learning. EMNLP 2024: 7156-7168 - [c202]Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas:
Language-to-Code Translation with a Single Labeled Example. EMNLP 2024: 8101-8112 - [c201]Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani:
Ontologically Faithful Generation of Non-Player Character Dialogues. EMNLP 2024: 9212-9242 - [c200]Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. EMNLP 2024: 9458-9482 - [c199]Kate Sanders, Reno Kriz, David Etter, Hannah Recknor, Alexander Martin, Cameron Carpenter, Jingyang Lin, Benjamin Van Durme:
Grounding Partially-Defined Events in Multimodal Data. EMNLP (Findings) 2024: 15905-15927 - [c198]Kate Sanders, Nathaniel Weir, Benjamin Van Durme:
TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning. EMNLP 2024: 19009-19028 - [c197]Elias Stengel-Eskin, Kyle Rawlins, Benjamin Van Durme:
Zero and Few-shot Semantic Parsing with Ambiguous Inputs. ICLR 2024 - [c196]Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young Jin Kim:
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation. ICML 2024 - [c195]Nathaniel Weir, Peter Clark, Benjamin Van Durme:
NELLIE: A Neuro-Symbolic Inference Engine for Grounded, Compositional, and Explainable Reasoning. IJCAI 2024: 3602-3612 - [c194]Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Benjamin Van Durme:
Natural Language Decomposition and Interpretation of Complex Utterances. IJCAI 2024: 6306-6314 - [c193]Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen:
Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles. NAACL-HLT (Findings) 2024: 490-502 - [c192]Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani:
Interpreting User Requests in the Context of Natural Language Standing Instructions. NAACL-HLT (Findings) 2024: 4043-4060 - [c191]Abe Bohan Hou, Jingyu Zhang, Tianxing He, Yichen Wang, Yung-Sung Chuang, Hongwei Wang, Lingfeng Shen, Benjamin Van Durme, Daniel Khashabi, Yulia Tsvetkov:
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation. NAACL-HLT 2024: 4067-4082 - [c190]Siddharth Vashishtha, Alexander Martin, William Gantt, Benjamin Van Durme, Aaron Steven White:
FAMuS: Frames Across Multiple Sources. NAACL-HLT 2024: 8250-8273 - [c189]Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme:
A Closer Look at Claim Decomposition. *SEM@NAACL 2024: 153-175 - [i154]Xinrui Zou, Ming Zhang, Nathaniel Weir, Benjamin Van Durme, Nils Holzenberger:
Reframing Tax Law Entailment as Analogical Reasoning. CoRR abs/2401.06715 (2024) - [i153]Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young Jin Kim:
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation. CoRR abs/2401.08417 (2024) - [i152]William Gantt, Shabnam Behzad, Hannah Youngeun An, Yunmo Chen, Aaron Steven White, Benjamin Van Durme, Mahsa Yarmohammadi:
MultiMUC: Multilingual Template Filling on MUC-4. CoRR abs/2401.16209 (2024) - [i151]Weiting Tan, Yunmo Chen, Tongfei Chen, Guanghui Qin
, Haoran Xu, Heidi C. Zhang, Benjamin Van Durme, Philipp Koehn:
Streaming Sequence Transduction through Dynamic Compression. CoRR abs/2402.01172 (2024) - [i150]Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. CoRR abs/2402.14798 (2024) - [i149]Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu:
RORA: Robust Free-Text Rationale Evaluation. CoRR abs/2402.18678 (2024) - [i148]Kate Sanders, Nathaniel Weir, Benjamin Van Durme:
TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning. CoRR abs/2402.19467 (2024) - [i147]Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su:
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error. CoRR abs/2403.04746 (2024) - [i146]Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme:
A Closer Look at Claim Decomposition. CoRR abs/2403.11903 (2024) - [i145]Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi:
Tur[k]ingBench: A Challenge Benchmark for Web Agents. CoRR abs/2403.11905 (2024) - [i144]Jeffrey Cheng, Marc Marone, Orion Weller, Dawn J. Lawrie, Daniel Khashabi, Benjamin Van Durme:
Dated Data: Tracing Knowledge Cutoffs in Large Language Models. CoRR abs/2403.12958 (2024) - [i143]Orion Weller, Benjamin Chang
, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn J. Lawrie, Luca Soldaini:
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions. CoRR abs/2403.15246 (2024) - [i142]Jingyu Zhang, Marc Marone, Tianjian Li, Benjamin Van Durme, Daniel Khashabi:
Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data. CoRR abs/2404.03862 (2024) - [i141]Dongwei Jiang, Jingyu Zhang, Orion Weller, Nathaniel Weir, Benjamin Van Durme, Daniel Khashabi:
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses. CoRR abs/2404.04298 (2024) - [i140]William Fleshman, Aleem Khan, Marc Marone, Benjamin Van Durme:
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees. CoRR abs/2404.08417 (2024) - [i139]William Fleshman, Benjamin Van Durme:
RE-Adapt: Reverse Engineered Adaptation of Large Language Models. CoRR abs/2405.15007 (2024) - [i138]Kate Sanders, Benjamin Van Durme:
A Survey of Video Datasets for Grounded Event Understanding. CoRR abs/2406.09646 (2024) - [i137]Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme:
Learning to Retrieve Iteratively for In-Context Learning. CoRR abs/2406.14739 (2024) - [i136]William Fleshman, Benjamin Van Durme:
RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation. CoRR abs/2406.14764 (2024) - [i135]Abe Bohan Hou, Orion Weller, Guanghui Qin
, Eugene Yang, Dawn J. Lawrie, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme:
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation. CoRR abs/2406.17186 (2024) - [i134]Zhengping Jiang, Jingyu Zhang, Nathaniel Weir, Seth Ebner, Miriam Wanner, Kate Sanders, Daniel Khashabi, Anqi Liu, Benjamin Van Durme:
Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification. CoRR abs/2407.03572 (2024) - [i133]Jiefu Ou, Arda Uzunoglu, Benjamin Van Durme, Daniel Khashabi:
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment. CoRR abs/2407.07778 (2024) - [i132]Xu Han, Felix Yu, João Sedoc, Benjamin Van Durme:
Baby Bear: Seeking a Just Right Rating Scale for Scalar Annotations. CoRR abs/2408.09765 (2024) - [i131]Abe Bohan Hou, William Jurayj, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme:
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations. CoRR abs/2409.09947 (2024) - [i130]Orion Weller, Benjamin Van Durme, Dawn J. Lawrie, Ashwin Paranjape, Yuhao Zhang, Jack Hessel:
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models. CoRR abs/2409.11136 (2024) - [i129]Dongwei Jiang, Guoxuan Wang, Yining Lu, Andrew Wang, Jingyu Zhang, Chuyu Liu, Benjamin Van Durme, Daniel Khashabi:
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning. CoRR abs/2410.01044 (2024) - [i128]Kate Sanders, Reno Kriz, David Etter, Hannah Recknor, Alexander Martin, Cameron Carpenter, Jingyang Lin, Benjamin Van Durme:
Grounding Partially-Defined Events in Multimodal Data. CoRR abs/2410.05267 (2024) - [i127]Jingyu Zhang, Ahmed Elgohary, Ahmed Magooda, Daniel Khashabi, Benjamin Van Durme:
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements. CoRR abs/2410.08968 (2024) - [i126]Reno Kriz, Kate Sanders, David Etter, Kenton Murray, Cameron Carpenter, Kelly Van Ochten, Hannah Recknor, Jimena Guallar-Blasco, Alexander Martin, Ronald Colaianni, Nolan King, Eugene Yang, Benjamin Van Durme:
MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval. CoRR abs/2410.11619 (2024) - [i125]Millicent Li, Tongfei Chen, Benjamin Van Durme, Patrick Xia:
Multi-Field Adaptive Retrieval. CoRR abs/2410.20056 (2024) - [i124]Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Benjamin Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng:
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass. CoRR abs/2411.05877 (2024) - [i123]Jeffrey Cheng, Benjamin Van Durme:
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations. CoRR abs/2412.13171 (2024) - [i122]Miriam Wanner, Benjamin Van Durme, Mark Dredze:
DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation. CoRR abs/2412.13175 (2024) - [i121]Nathaniel Weir, Bhavana Dalvi Mishra, Orion Weller, Oyvind Tafjord, Sam Hornstein, Alexander Sabol, Peter Jansen, Benjamin Van Durme, Peter Clark:
From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering. CoRR abs/2412.17701 (2024) - 2023
- [j9]Boyuan Zheng, Patrick Xia, Mahsa Yarmohammadi, Benjamin Van Durme:
Multilingual Coreference Resolution in Multiparty Dialogue. Trans. Assoc. Comput. Linguistics 11: 922-940 (2023) - [j8]Elias Stengel-Eskin, Benjamin Van Durme:
Calibrated Interpretation: Confidence Estimation in Semantic Parsing. Trans. Assoc. Comput. Linguistics 11: 1213-1231 (2023) - [c188]Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak:
Evaluating Paraphrastic Robustness in Textual Entailment Models. ACL (2) 2023: 880-892 - [c187]Elias Stengel-Eskin, Jimena Guallar-Blasco, Yi Zhou, Benjamin Van Durme:
Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA. ACL (1) 2023: 10220-10237 - [c186]Nathaniel Weir, Xingdi Yuan, Marc-Alexandre Côté, Matthew J. Hausknecht, Romain Laroche, Ida Momennejad, Harm van Seijen, Benjamin Van Durme:
One-Shot Learning from a Demonstration with Hierarchical Latent Language. AAMAS 2023: 2388-2390 - [c185]Zhuowan Li, Xingrui Wang, Elias Stengel-Eskin, Adam Kortylewski, Wufei Ma, Benjamin Van Durme, Alan L. Yuille:
Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning. CVPR 2023: 14963-14973 - [c184]Yunmo Chen, William Gantt, Weiwei Gu, Tongfei Chen, Aaron Steven White, Benjamin Van Durme:
Iterative Document-level Information Extraction via Imitation Learning. EACL 2023: 1850-1866 - [c183]Guanghui Qin, Yukun Feng, Benjamin Van Durme:
The NLP Task Effectiveness of Long-Range Transformers. EACL 2023: 3756-3772 - [c182]Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray:
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules. EMNLP 2023: 1575-1587 - [c181]Elias Stengel-Eskin, Benjamin Van Durme:
Did You Mean...? Confidence-based Trade-offs in Semantic Parsing. EMNLP 2023: 2621-2629 - [c180]Kangda Wei, Dawn J. Lawrie, Benjamin Van Durme, Yunmo Chen, Orion Weller:
When Do Decompositions Help for Machine Reading? EMNLP 2023: 3599-3606 - [c179]Justin Payan, Swaroop Mishra, Mukul Singh, Carina Negreanu, Christian Pölitz, Chitta Baral, Subhro Roy, Rasika Chakravarthy, Benjamin Van Durme, Elnaz Nouri:
InstructExcel: A Benchmark for Natural Language Instruction in Excel. EMNLP (Findings) 2023: 4026-4043 - [c178]Yunmo Chen, William Gantt, Tongfei Chen, Aaron Steven White, Benjamin Van Durme:
A Unified View of Evaluation Metrics for Structured Prediction. EMNLP 2023: 12868-12882 - [c177]Andrew Blair-Stanek
, Nils Holzenberger
, Benjamin Van Durme
:
Can GPT-3 Perform Statutory Reasoning? ICAIL 2023: 22-31 - [c176]Guanghui Qin, Benjamin Van Durme:
Nugget: Neural Agglomerative Embeddings of Text. ICML 2023: 28337-28350 - [c175]Kate Sanders, David Etter, Reno Kriz, Benjamin Van Durme:
MultiVENT: Multilingual Videos of Events and Aligned Natural Text. NeurIPS 2023 - [c174]Marc Marone, Benjamin Van Durme:
Data Portraits: Recording Foundation Model Training Data. NeurIPS 2023 - [c173]Subhro Roy, Samuel Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme:
BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing. NeurIPS 2023 - [i120]Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme:
Can GPT-3 Perform Statutory Reasoning? CoRR abs/2302.06100 (2023) - [i119]Marc Marone, Benjamin Van Durme:
Data Portraits: Recording Foundation Model Training Data. CoRR abs/2303.03919 (2023) - [i118]Elias Stengel-Eskin, Benjamin Van Durme:
Did You Mean...? Confidence-based Trade-offs in Semantic Parsing. CoRR abs/2303.16857 (2023) - [i117]Orion Weller, Dawn J. Lawrie, Benjamin Van Durme:
NevIR: Negation in Neural Information Retrieval. CoRR abs/2305.07614 (2023) - [i116]Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Benjamin Van Durme:
Natural Language Decomposition and Interpretation of Complex Utterances. CoRR abs/2305.08677 (2023) - [i115]Orion Weller, Marc Marone, Nathaniel Weir, Dawn J. Lawrie, Daniel Khashabi, Benjamin Van Durme:
"According to ..." Prompting Language Models Improves Quoting from Pre-Training Data. CoRR abs/2305.13252 (2023) - [i114]Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray:
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules. CoRR abs/2305.13993 (2023) - [i113]Ishani Mondal, Michelle Yuan, Anandhavelu Natarajan, Aparna Garimella, Francis Ferraro, Andrew Blair-Stanek, Benjamin Van Durme, Jordan L. Boyd-Graber:
InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction. CoRR abs/2305.14659 (2023) - [i112]Elias Stengel-Eskin, Kyle Rawlins, Benjamin Van Durme:
Zero and Few-shot Semantic Parsing with Ambiguous Inputs. CoRR abs/2306.00824 (2023) - [i111]Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak:
Evaluating Paraphrastic Robustness in Textual Entailment Models. CoRR abs/2306.16722 (2023) - [i110]Kate Sanders, David Etter, Reno Kriz, Benjamin Van Durme:
MultiVENT: Multilingual Videos of Events with Aligned Natural Text. CoRR abs/2307.03153 (2023) - [i109]Samuel Barham, Orion Weller, Michelle Yuan, Kenton Murray, Mahsa Yarmohammadi, Zhengping Jiang, Siddharth Vashishtha, Alexander Martin
, Anqi Liu, Aaron Steven White, Jordan L. Boyd-Graber, Benjamin Van Durme:
MegaWika: Millions of reports and their sources across 50 diverse languages. CoRR abs/2307.07049 (2023) - [i108]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan
, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. CoRR abs/2309.08541 (2023) - [i107]Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme:
OpenAI Cribbed Our Tax Example, But Can GPT-4 Really Do Tax? CoRR abs/2309.09992 (2023) - [i106]Kumar Shridhar, Harsh Jhamtani, Hao Fang, Benjamin Van Durme, Jason Eisner, Patrick Xia:
SCREWS: A Modular Framework for Reasoning with Revisions. CoRR abs/2309.13075 (2023) - [i105]Guanghui Qin, Benjamin Van Durme:
Nugget: Neural Agglomerative Embeddings of Text. CoRR abs/2310.01732 (2023) - [i104]Guanghui Qin
, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme:
Nugget 2D: Dynamic Contextual Compression for Scaling Decoder-only Language Models. CoRR abs/2310.02409 (2023) - [i103]Abe Bohan Hou, Jingyu Zhang, Tianxing He, Yichen Wang, Yung-Sung Chuang, Hongwei Wang, Lingfeng Shen, Benjamin Van Durme, Daniel Khashabi, Yulia Tsvetkov:
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation. CoRR abs/2310.03991 (2023) - [i102]Yunmo Chen, William Gantt, Tongfei Chen, Aaron Steven White, Benjamin Van Durme:
A Unified View of Evaluation Metrics for Structured Prediction. CoRR abs/2310.13793 (2023) - [i101]Justin Payan, Swaroop Mishra, Mukul Singh, Carina Negreanu, Christian Pölitz, Chitta Baral, Subhro Roy, Rasika Chakravarthy, Benjamin Van Durme, Elnaz Nouri:
InstructExcel: A Benchmark for Natural Language Instruction in Excel. CoRR abs/2310.14495 (2023) - [i100]Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen:
Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles. CoRR abs/2311.02310 (2023) - [i99]Siddharth Vashishtha, Alexander Martin
, William Gantt, Benjamin Van Durme, Aaron Steven White:
FAMuS: Frames Across Multiple Sources. CoRR abs/2311.05601 (2023) - [i98]William Fleshman, Benjamin Van Durme:
Toucan: Token-Aware Character Level Language Modeling. CoRR abs/2311.08620 (2023) - [i97]Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme:
BLT: Can Large Language Models Handle Basic Legal Text? CoRR abs/2311.09693 (2023) - [i96]Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani:
Interpreting User Requests in the Context of Natural Language Standing Instructions. CoRR abs/2311.09796 (2023) - [i95]Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie:
Do Androids Know They're Only Dreaming of Electric Sheep? CoRR abs/2312.17249 (2023) - 2022
- [c172]Anton Belyy, Chieh-Yang Huang, Jacob Andreas, Emmanouil Antonios Platanios, Sam Thomson, Richard Shin, Subhro Roy, Aleksandr Nisnevich, Charles Chen, Benjamin Van Durme:
Guided K-best Selection for Semantic Parsing Annotation. ACL (demo) 2022: 114-126 - [c171]Kevin Yang, Olivia Deng, Charles Chen, Richard Shin, Subhro Roy, Benjamin Van Durme:
Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation. ACL (Findings) 2022: 3685-3695 - [c170]Michelle Yuan, Patrick Xia, Chandler May
, Benjamin Van Durme, Jordan L. Boyd-Graber:
Adapting Coreference Resolution Models through Active Learning. ACL (1) 2022: 7533-7549 - [c169]Zhengping Jiang, Anqi Liu, Benjamin Van Durme:
Calibrating Zero-shot Cross-lingual (Un-)structured Predictions. EMNLP 2022: 2648-2674 - [c168]