default search action
NAACL-HLT Findings 2024: Mexico City, Mexico
- Kevin Duh, Helena Gómez-Adorno, Steven Bethard:
Findings of the Association for Computational Linguistics: NAACL 2024, Mexico City, Mexico, June 16-21, 2024. Association for Computational Linguistics 2024, ISBN 979-8-89176-119-3 - Honghe Zhang, Xiaolong Shi, Jingwei Sun, Guangzhong Sun:
Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning. 1-12 - Taiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao, Yujiu Yang:
Weight-Inherited Distillation for Task-Agnostic BERT Compression. 13-28 - Eugene Jang, Jian Cui, Dayeon Yim, Youngjin Jin, Jin-Woo Chung, Seungwon Shin, Yongjae Lee:
Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain. 29-42 - Nachshon Cohen, Yaron Fairstein, Guy Kushilevitz:
Extremely efficient online query encoding for dense retrieval. 43-50 - Wenting Zhao, Ye Liu, Tong Niu, Yao Wan, Philip S. Yu, Shafiq Joty, Yingbo Zhou, Semih Yavuz:
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text. 51-68 - Aleksandar Pavlovic, Emanuel Sallinger:
SpeedE: Euclidean Geometric Knowledge Graph Embedding Strikes Back. 69-92 - Hitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan:
Language Guided Exploration for RL Agents in Text Environments. 93-102 - Saranya Venkatraman, Adaku Uchendu, Dongwon Lee:
GPT-who: An Information Density-based Machine-Generated Text Detector. 103-115 - Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha:
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. 116-131 - Ta-Chung Chi, Ting-Han Fan, Alexander Rudnicky:
Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation. 132-148 - Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian J. McAuley, Jennifer Neville, Ahmed Awadallah, Nikhil Rao:
Automatic Pair Construction for Contrastive Post-training. 149-162 - Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang:
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models. 163-181 - Antoine Nzeyimana:
Low-resource neural machine translation with morphological modeling. 182-195 - Zhendong Chu, Ruiyi Zhang, Tong Yu, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Ani Nenkova:
Self-Cleaning: Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances. 196-210 - Phong Do, Son Tran, Phu Hoang, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen:
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding. 211-222 - Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, Heng Ji:
LETI: Learning to Generate from Textual Interactions. 223-239 - Yonghui Kong, Cunhang Fan, Yujie Chen, Shuai Zhang, Zhao Lv, Jianhua Tao:
Bilateral Masking with prompt for Knowledge Graph Completion. 240-249 - Zhenpeng Su, Zijia Lin, Bai Xue, Hui Chen, Guiguang Ding, Wei Zhou, Songlin Hu:
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models. 250-262 - Jiaxin Zhang, Yashar Moshfeghi:
GOLD: Geometry Problem Solver with Natural Language Description. 263-278 - Rotaru Codrut, Nicolae-Catalin Ristea, Radu Tudor Ionescu:
RoDia: A New Dataset for Romanian Dialect Identification from Speech. 279-286 - Rochelle Choenni, Ekaterina Shutova, Dan Garrette:
Examining Modularity in Multilingual LMs via Language-Specialized Subnetworks. 287-301 - Yinger Zhang, Hui Cai, Xierui Song, Yicheng Chen, Rui Sun, Jing Zheng:
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning. 302-325 - Jiqun Chu, Zuoquan Lin:
Incorporating Exponential Smoothing into MLP: a Simple but Effective Sequence Model. 326-337 - Yuxuan Kuang, Hai Lin, Meng Jiang:
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models. 338-351 - Nathan Brake, Thomas Schaaf:
Comparing Two Model Designs for Clinical Note Generation; Is an LLM a Useful Evaluator of Consistency? 352-363 - Yueen Ma, Dafeng Chi, Jingjing Li, Kai Song, Yuzheng Zhuang, Irwin King:
VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder. 364-378 - Divya V. Sharma:
EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker Verification. 379-394 - Rajarshi Bhowmik, Marco Ponza, Atharva Tendle, Anant Gupta, Rebecca Jiang, Xingyu Lu, Qian Zhao, Daniel Preotiuc-Pietro:
Leveraging Contextual Information for Effective Entity Salience Detection. 395-408 - Qihui Zhang, Chujie Gao, Dongping Chen, Yue Huang, Yixin Huang, Zhenyang Sun, Shilin Zhang, Weiye Li, Zhengyan Fu, Yao Wan, Lichao Sun:
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? 409-436 - Ivo Verhoeven, Pushkar Mishra, Rahel Beloch, Helen Yannakoudakis, Ekaterina Shutova:
A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection. 437-463 - Jie Huang, Kevin Chang:
Citation: A Key to Building Responsible and Accountable Large Language Models. 464-473 - Yingji Zhang, Marco Valentino, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas:
Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders. 474-489 - Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen:
Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles. 490-502 - Debarati Das, Ishaan Gupta, Jaideep Srivastava, Dongyeop Kang:
Which Modality should I use - Text, Motif, or Image? : Understanding Graphs with Large Language Models. 503-519 - Hieu Hoang, Huda Khayrallah, Marcin Junczys-Dowmunt:
On-the-Fly Fusion of Large Language Models and Machine Translation. 520-532 - Dawei Li, William Hogan, Jingbo Shang:
READ: Improving Relation Extraction from an ADversarial Perspective. 533-548 - Sana Ebrahimi, Nima Shahbazi, Abolfazl Asudeh:
REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models. 549-560 - Hannah Chen, Yangfeng Ji, David Evans:
Addressing Both Statistical and Causal Gender Fairness in NLP Models. 561-582 - Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Christopher Leung, Jiajie Tang, Jiebo Luo:
LLM-Rec: Personalized Recommendation via Prompting Large Language Models. 583-612 - Jie Ren, Han Xu, Yiding Liu, Yingqian Cui, Shuaiqiang Wang, Dawei Yin, Jiliang Tang:
A Robust Semantics-based Watermark for Large Language Model against Paraphrasing. 613-625 - Shraddha Barke, Christian Pölitz, Carina Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams:
Solving Data-centric Tasks using Large Language Models. 626-638 - Jiaxin Guo, Hao Yang, Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen:
A Novel Paradigm Boosting Translation Capabilities of Large Language Models. 639-649 - Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang:
Measuring Social Norms of Large Language Models. 650-699 - Maxwell Yin, Boyu Wang, Charles Ling:
Source-Free Unsupervised Domain Adaptation for Question Answering via Prompt-Assisted Self-learning. 700-713 - Chenlong Zhao, Xiwen Zhou, Xiaopeng Xie, Yong Zhang:
Hierarchical Attention Graph for Scientific Document Summarization in Global and Local Level. 714-726 - Nalin Kumar, Ondrej Dusek:
LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems. 727-735 - Bogdan Dobre:
Efficient Dependency Tree Sampling Without Replacement. 736-741 - Zixuan Zhang, Revanth Gangi Reddy, Kevin Small, Tong Zhang, Heng Ji:
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization. 742-753 - Yixiao Song, Kalpesh Krishna, Rajesh Bhatt, Kevin Gimpel, Mohit Iyyer:
GEE! Grammar Error Explanation with Large Language Models. 754-781 - Wanpeng Zhang, Zongqing Lu:
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback. 782-799 - Weihao Zeng, Dayuan Fu, Keqing He, Yejie Wang, Yukai Xu, Weiran Xu:
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations. 800-813 - Pavel Denisov, Thang Vu:
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training. 814-834 - Wenhong Zhu, Hongkun Hao, Zhiwei He, Yunze Song, Jiao Yueyang, Yumeng Zhang, Hanxu Hu, Yiran Wei, Rui Wang, Hongyuan Lu:
CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models. 835-847 - Roshan Sharma, Ruchira Sharma, Hira Dhamyal, Rita Singh, Bhiksha Raj:
R-BASS : Relevance-aided Block-wise Adaptation for Speech Summarization. 848-857 - Fei Yu, Anningzhe Gao, Benyou Wang:
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning. 858-875 - Lei Wang, Ee-Peng Lim:
The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential Recommendation. 876-895 - Dhruv Agarwal, Rajarshi Das, Sopan Khosla, Rashmi Gangadharaiah:
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA. 896-919 - Shuzhou Yuan, Michael Färber:
GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism. 920-933 - Boxin Wang, Yibo Zhang, Yuan Cao, Bo Li, Hugh McMahan, Sewoong Oh, Zheng Xu, Manzil Zaheer:
Can Public Large Language Models Help Private Cross-device Federated Learning? 934-949 - Bowen Pan, Rameswar Panda, SouYoung Jin, Rogério Feris, Aude Oliva, Phillip Isola, Yoon Kim:
LangNav: Language as a Perceptual Representation for Navigation. 950-974 - Tenghao Huang, Dongwon Jung, Vaibhav Kumar, Mohammad Kachuee, Xiang Li, Puyang Xu, Muhao Chen:
Planning and Editing What You Retrieve for Enhanced Tool Learning. 975-988 - Victor Carbune, Hassan Mansoor, Fangyu Liu, Rahul Aralikatte, Gilles Baechler, Jindong Chen, Abhanshu Sharma:
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs. 989-1004 - Chi-Heng Lin, Shikhar Tuli, James Seale Smith, Yen-Chang Hsu, Yilin Shen, Hongxia Jin:
SLiM: Speculative Decoding with Hypothesis Reduction. 1005-1017 - Zoher Kachwala, Jisun An, Haewoon Kwak, Filippo Menczer:
REMATCH: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity. 1018-1028 - Ben Hutchinson:
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing. 1029-1043 - William Macke, Michael Doyle:
Testing the Effect of Code Documentation on Large Language Model Code Understanding. 1044-1050 - Yuwei Cao, Nikhil Mehta, Xinyang Yi, Raghunandan Hulikal Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy:
Aligning Large Language Models with Recommendation Knowledge. 1051-1066 - Yihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schütze:
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining. 1067-1097 - Minju Kim, Haein Jung, Myoung-Wan Koo:
SELF-EXPERTISE: Knowledge-based Instruction Dataset Augmentation for a Legal Expert Language Model. 1098-1112 - Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge J. Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein:
Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction. 1113-1123 - Zixiao Zhu, Junlang Qian, Zijian Feng, Hanzhang Zhou, Kezhi Mao:
EDEntail: An Entailment-based Few-shot Text Classification with Extensional Definition. 1124-1137 - KV Aditya Srivatsa, Ekaterina Kochmar:
What Makes Math Word Problems Challenging for LLMs? 1138-1148 - Lee Hyun, Kim Sung-Bin, Seungju Han, Youngjae Yu, Tae-Hyun Oh:
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models. 1149-1167 - Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang:
T3M: Text Guided 3D Human Motion Synthesis from Speech. 1168-1177 - Miao Peng, Ben Liu, Wenjie Xu, Zihao Jiang, Jiahui Zhu, Min Peng:
Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning. 1178-1191 - Nishchal Prasad, Taoufiq Dkaki, Mohand Boughanem:
Explanation Extraction from Hierarchical Classification Frameworks for Long Legal Documents. 1192-1201 - Chenxi Whitehouse, Fantine Huot, Jasmijn Bastings, Mostafa Dehghani, Chu-Cheng Lin, Mirella Lapata:
Low-Rank Adaptation for Multilingual Summarization: An Empirical Study. 1202-1228 - Leonardo Ranaldi, Giulia Pucci, Federico Ranaldi, Elena Sofia Ruzzetti, Fabio Massimo Zanzotto:
A Tree-of-Thoughts to Broaden Multi-step Reasoning across Languages. 1229-1241 - Sherin Muckatira, Vijeta Deshpande, Vladislav Lialin, Anna Rumshisky:
Emergent Abilities in Reduced-Scale Generative Language Models. 1242-1257 - Clemencia Siro, Mohammad Aliannejadi, Maarten de Rijke:
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems. 1258-1273 - Xixi Zhou, Chunbin Gu, Xin Jie, Jiajun Bu, Haishuai Wang:
Matching Varying-Length Texts via Topic-Informed and Decoupled Sentence Embeddings. 1274-1280 - Bruce W. Lee, Hyunsoo Cho, Kang Min Yoo:
Instruction Tuning with Human Curriculum. 1281-1309 - Md Masudur Rahman, Yexiang Xue:
Natural Language-based State Representation in Deep Reinforcement Learning. 1310-1319 - Junzhe Wang, Qiang Zeng, Lannan Luo:
Learning Cross-Architecture Instruction Embeddings for Binary Code Analysis in Low-Resource Architectures. 1320-1332 - Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao:
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks. 1333-1351 - Tien-Hong Lo, Fu-An Chao, Tzu-I Wu, Yao-Ting Sung, Berlin Chen:
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution. 1352-1362 - Shen Zheng, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, Zhou Xun, Kevin Chang:
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond. 1363-1382 - Raj Patel, Carlotta Domeniconi:
Subword Attention and Post-Processing for Rare and Unknown Contextualized Embeddings. 1383-1389 - Sagar Gubbi Venkatesh, Partha Talukdar, Srini Narayanan:
UGIF-DataSet: A New Dataset for Cross-lingual, Cross-modal Sequential actions on the UI. 1390-1399 - Hossein Hajipour, Ning Yu, Cristian-Alexandru Staicu, Mario Fritz:
SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code Models. 1400-1416 - Nan Zhang, Yanchi Liu, Xujiang Zhao, Wei Cheng, Runxue Bao, Rui Zhang, Prasenjit Mitra, Haifeng Chen:
Pruning as a Domain-specific LLM Extractor. 1417-1428 - Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag:
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback. 1429-1445 - Pengyu Xu, Mingyang Song, Linkaida Liu, Bing Liu, Hongjian Sun, Liping Jing, Jian Yu:
Noisy Multi-Label Text Classification via Instance-Label Pair Correction. 1446-1458 - Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang:
Composite Backdoor Attacks Against Large Language Models. 1459-1472 - Jinyan Su, Claire Cardie, Preslav Nakov:
Adapting Fake News Detection to the Era of Large Language Models. 1473-1490 - Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Sijia Li, Defeng Xie, Haonan Lu:
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval. 1491-1503 - Zhen Qin, Rolf Jagerman, Kai Hui, Honglei Zhuang, Junru Wu, Le Yan, Jiaming Shen, Tianqi Liu, Jialu Liu, Donald Metzler, Xuanhui Wang, Michael Bendersky:
Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting. 1504-1518 - Zhihan Guo, Yifei Zhang, Zhuo Zhang, Zenglin Xu, Irwin King:
FedLFC: Towards Efficient Federated Multilingual Modeling with LoRA-based Language Family Clustering. 1519-1528 - Mohammad Mahdi Abdollah Pour, Ali Pesaranghader, Eldan Cohen, Scott Sanner:
Gaussian Process Optimization for Adaptable Multi-Objective Text Generation using Linearly-Weighted Language Models. 1529-1536 - Alessandro Stolfo:
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study. 1537-1552 - Mehrnaz Moslemi, Amal Zouaq:
TagDebias: Entity and Concept Tagging for Social Bias Mitigation in Pretrained Language Models. 1553-1567 - Edwin Thomas, Sowmya Vajjala:
Improving Absent Keyphrase Generation with Diversity Heads. 1568-1584 - Tianze Hua, Tian Yun, Ellie Pavlick:
mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models? 1585-1598 - Farsheed Haque, Depeng Xu, Shuhan Yuan:
Discovering and Mitigating Indirect Bias in Attention-Based Model Explanations. 1599-1614 - Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. 1615-1627 - Yifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han:
Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation. 1628-1644 - Aditi Chaudhary, Karthik Raman, Michael Bendersky:
It's All Relative! - A Synthetic Query Generation Approach for Improving Zero-Shot Relevance Prediction. 1645-1664 - Saeed Khaki, JinJin Li, Lan Ma, Liu Yang, Prathap Ramachandra:
RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models. 1665-1680 - Changqun Li, Linlin Wang, Xin Lin, Shizhou Huang, Liang He:
Hypernetwork-Assisted Parameter-Efficient Fine-Tuning with Meta-Knowledge Distillation for Domain Knowledge Disentanglement. 1681-1695 - Roy Siegelmann, Ninareh Mehrabi, Palash Goyal, Prasoon Goyal, Lisa Bauer, Jwala Dhamala, Aram Galstyan, Rahul Gupta, Reza Ghanadan:
MICo: Preventative Detoxification of Large Language Models through Inhibition Control. 1696-1703 - Wendi Li, Wei Wei, Kaihe Xu, Wenfeng Xie, Dangyang Chen, Yu Cheng:
Reinforcement Learning with Token-level Feedback for Controllable Text Generation. 1704-1719 - Pei Chen, Shuai Zhang, Boran Han:
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving. 1720-1738 - Anaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard S. Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta:
Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies. 1739-1756 - Ramit Sawhney, Shrey Pandit, Vishwa Shah, Megh Thakkar, Shafiq Joty:
AdaPT: A Set of Guidelines for Hyperbolic Multimodal Multilingual NLP. 1757-1771