


default search action
EMNLP 2024: Miami, FL, USA
- Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. Association for Computational Linguistics 2024, ISBN 979-8-89176-164-3 - Frontmatter.
- Juhwan Choi, Yeonghwa Kim, Seunguk Yu, Jungmin Yun, Youngbin Kim:
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation. 1-14 - Juhwan Choi, Jungmin Yun, Kyohoon Jin, Youngbin Kim:
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation. 15-29 - Joonho Yang, Seunghyun Yoon, Byeongjeong Kim, Hwanhee Lee:
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document. 30-45 - Rimon Melamed, Lucas H. McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà:
Prompts have evil twins. 46-74 - Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke
:
Table Question Answering for Low-resourced Indic Languages. 75-92 - Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut:
ImageInWords: Unlocking Hyper-Detailed Image Descriptions. 93-127 - Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang:
LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay. 128-145 - Xiangyu Zhang
, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps:
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection. 146-158 - Xiangyu Zhang
, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng
, Leibny Paola García-Perera, Engsiong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. 159-171 - Sanne Hoeken, Sina Zarrieß, Özge Alaçam:
Hateful Word in Context Classification. 172-186 - Özge Alaçam, Sanne Hoeken, Sina Zarrieß:
Eyes Don't Lie: Subjective Hate Annotation and Detection with Gaze. 187-205 - Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle:
NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning. 206-212 - Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka:
"Thinking" Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models. 213-227 - Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan:
A Usage-centric Take on Intent Understanding in E-Commerce. 228-236 - Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha:
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs. 237-250 - Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein:
Systematic Biases in LLM Simulations of Debates. 251-267 - Katherine Atwell, Danielle Bragg, Malihe Alikhani:
Studying and Mitigating Biases in Sign Language Understanding Models. 268-283 - Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban:
Uncertainty in Language Models: Assessment through Rank-Calibration. 284-312 - Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang:
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning. 313-333 - Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty:
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing. 334-350 - Santiago Cuervo, Ricard Marxer
:
Scaling Properties of Speech Language Models. 351-361 - Rajkumar Pujari, Chengfei Wu, Dan Goldwasser:
"We Demand Justice!": Towards Social Context Grounding of Political Texts. 362-372 - Rabindra Nath Nandi, Suman Kalyan Maity, Brian Uzzi, Sourav Medya:
An Experimental Analysis on Evaluating Patent Citations. 373-387 - Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow:
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? 388-409 - Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis:
Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing. 410-423 - Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua:
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation. 424-444 - Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman:
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation. 445-463 - Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Mahong Xia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu:
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation. 464-478 - Abhilasha Sancheti, Haozhe An, Rachel Rudinger:
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models. 479-494 - Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux:
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models. 495-507 - Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang
, Jia Wu
, Hao Fan:
On Fake News Detection with LLM Enhanced Semantics Mining. 508-521 - Branislav Pecher
, Ivan Srba, Mária Bieliková:
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. 522-556 - Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan:
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection. 557-568 - Valentin Barrière, Sebastian Cifuentes:
A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers. 569-579 - Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. 580-606 - Meng Li, Haoran Jin, Ruixuan Huang, Zhihao Xu, Defu Lian, Zijia Lin, Di Zhang, Xiting Wang:
Evaluating Readability and Faithfulness of Concept-based Explanations. 607-625 - Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen:
Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems. 626-642 - Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou:
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making. 643-659 - Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Huang
:
CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds. 660-677 - Craig W. Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner:
Tokenization Is More Than Compression. 678-702 - Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard S. Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta:
FLIRT: Feedback Loop In-context Red Teaming. 703-718 - Lingjun Zhao, Khanh Nguyen, Hal Daumé III:
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections. 719-736 - Haoyuan Wu, Haisheng Zheng, Zhuolun He, Bei Yu:
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks. 737-749 - Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng:
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation. 750-766 - Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates:
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities. 767-783 - Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu:
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models. 784-801 - Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li:
LongEmbed: Extending Embedding Models for Long Context Retrieval. 802-816 - Xiangyang Liu, Junliang He, Xipeng Qiu:
Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences. 817-838 - Xianlong Luo, Meng Yang, Yihao Wang
:
Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue. 839-856 - Dongjun Lim, Yun-Gyung Cheong:
Integrating Plutchik's Theory with Mixture of Experts for Enhancing Emotion Classification. 857-867 - Chao Liang, Wei Xiang, Bang Wang:
In-context Contrastive Learning for Event Causality Identification. 868-881 - Anna Wegmann, Tijs A. van den Broek
, Dong Nguyen:
What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs. 882-912 - Kanishka Misra, Kyle Mahowald:
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs. 913-929 - Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu:
Large Language Models for Data Annotation and Synthesis: A Survey. 930-957 - Hongyuan Lu, Haoran Yang, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei:
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models. 958-976 - Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang:
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning. 977-995 - Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao:
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning. 996-1008 - Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, Chen Luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao:
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering. 1009-1025 - Jocelyn Shen, Joel Mire, Hae Park, Cynthia Breazeal, Maarten Sap:
HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs. 1026-1046 - Junru Lu, Jiazheng Li
, Siyu An, Meng Zhao, Yulan He, Di Yin, Xing Sun:
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. 1047-1067 - Tianyi Hu, Maria Maistro, Daniel Hershcovich:
Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval. 1068-1080 - Peng Xia, Kangyu Zhu, Haoran Li
, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao:
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models. 1081-1093 - Yuan Li
, Bingqiao Luo, Qian Wang, Nuo Chen
, Xu Liu, Bingsheng He:
CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading. 1094-1106 - Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui:
A Survey on In-context Learning. 1107-1128 - Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao:
DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing. 1129-1142 - Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing:
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation. 1143-1166 - Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai:
EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models. 1167-1181 - Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee:
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization. 1182-1191 - Roman Koshkin
, Katsuhito Sudoh
, Satoshi Nakamura:
LLMs Are Zero-Shot Context-Aware Simultaneous Translators. 1192-1207 - Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang:
AgentReview: Exploring Peer Review Dynamics with LLM Agents. 1208-1226 - Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou:
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval. 1227-1240 - Han Zhou
, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulic, Anna Korhonen:
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments. 1241-1252 - Chenlong Deng, Kelong Mao, Zhicheng Dou:
Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation. 1253-1265 - Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang:
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process. 1266-1280 - Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander Toshev:
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation. 1281-1287 - Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng:
QUDSELECT: Selective Decoding for Questions Under Discussion Parsing. 1288-1299 - Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng:
Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration. 1300-1310 - Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang:
Model Balancing Helps Low-data Training and Fine-tuning. 1311-1331 - Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami:
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment. 1332-1353 - Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu:
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment. 1354-1365 - Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou:
A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning. 1366-1381 - Zhiyuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin:
Towards Tool Use Alignment of Large Language Models. 1382-1400 - Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun:
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models. 1401-1418 - Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. 1419-1436 - Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment. 1437-1454 - Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin, Kwok-Yan Lam:
Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation. 1455-1466 - Haoran Li
, Qiang Gao, Hongmei Wu, Li Huang:
Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network. 1467-1478 - Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt, Zhenglu Yang:
Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors. 1479-1489 - Xiangyu Zhao, Yuehan Zhang, Wenlong Zhang, Xiao-Ming Wu:
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation. 1490-1507 - Hayden S. Helm, Brandon Duderstadt, Youngser Park, Carey E. Priebe:
Tracking the perspectives of interacting language models. 1508-1519 - Zhengxuan Zhang, Yin Wu, Yuyu Luo, Nan Tang:
MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering. 1520-1530 - Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui:
Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? 1531-1555 - Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li:
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement. 1556-1572 - Joseph Marvin Imperial
, Gail Forey
, Harish Tayyar Madabushi
:
Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation. 1573-1594 - Zhihao Zhang
, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou:
Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective. 1595-1609 - Zhen Tan, Chengshuai Zhao, Raha Moraffah
, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, Huan Liu:
Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models. 1610-1626 - Yuxuan Wang, Xiaoyuan Liu:
Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement. 1627-1639 - Xiaoze Liu, Ting Sun
, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao:
SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation. 1640-1670 - Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie:
MatchTime: Towards Automatic Soccer Game Commentary Generation. 1671-1685 - Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang:
Rethinking Token Reduction for State Space Models. 1686-1697 - Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang:
Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering. 1698-1710 - Yuyan Zhou, Liang Song, Bingning Wang, Weipeng Chen:
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic. 1711-1724 - Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson:
Event Causality Identification with Synthetic Control. 1725-1737 - Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Sheng Wang, Lingpeng Kong:
Retrieved Sequence Augmentation for Protein Representation Learning. 1738-1767 - Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li:
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding. 1768-1785 - Chengzu Li, Caiqi Zhang, Han Zhou
, Nigel Collier, Anna Korhonen, Ivan Vulic:
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners. 1786-1807 - Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu:
DA³: A Distribution-Aware Adversarial Attack against Language Models. 1808-1825 - Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing:
Evaluating Psychological Safety of Large Language Models. 1826-1843 - Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong:
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification. 1844-1856 - Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu:
Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering. 1857-1868 - Libo Zhao, Jing Li, Ziqian Zeng:
PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation. 1869-1881 - Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging. 1882-1898 - Caiqi Zhang, Zhijiang Guo, Andreas Vlachos
:
Do We Need Language-Specific Fact-Checking Models? The Case of Chinese. 1899-1914 - Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai:
Enhancing Advanced Visual Reasoning Ability of Large Language Models. 1915-1929 - Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang:
CMD: a framework for Context-aware Model self-Detoxification. 1930-1949 - Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao:
Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection. 1950-1959 - Yu Zhang
, Ziyue Jiang, Ruiqi Li, Changhao Pan
, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. 1960-1975 - Junlin Li, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang:
Be Helpful but Don't Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support. 1976-1988 - Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim:
Aligning Language Models to Explicitly Handle Ambiguity. 1989-2007 - Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li:
Tag-grounded Visual Instruction Tuning with Retrieval Augmentation. 2008-2026 - Xuanchang Zhang, Zhuosheng Zhang
, Hai Zhao:
GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models. 2027-2039 - Runze Xia, Congchi Yin, Piji Li:
Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information. 2040-2052 - Rui Li
, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang:
Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models. 2053-2065 - Yongjin Yang, Jongwoo Ko, Se-Young Yun:
Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models. 2066-2085 - Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu:
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning. 2086-2099 - Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan:
An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference. 2100-2104 - Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen:
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation. 2105-2123 - Yuxuan Wan
, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu:
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models. 2124-2155 - Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin:
Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training. 2156-2171 - Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang:
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning. 2172-2190 - Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen:
I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation. 2191-2199 - Michael Wiegand
, Josef Ruppenhofer:
Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm. 2200-2218 - Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee:
By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting. 2219-2241 - Seungwoo Son
, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee:
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization. 2242-2252 - Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie:
CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search. 2253-2268 - Jianzhao Huang, Hongzhan Lin, Ziyan Liu, Ziyang Luo, Guang Chen, Jing Ma:
Towards Low-Resource Harmful Meme Detection with LMM Agents. 2269-2293 - Zhe Hu, Yixiao Ren, Jing Li, Yu Yin:
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values. 2294-2311 - Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng:
Direct Multi-Turn Preference Optimization for Language Agents. 2312-2324 - Leonardo Ranaldi
, André Freitas
:
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models. 2325-2347 - Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren:
In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search. 2348-2370 - Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, Liqian Wen, Zulong Chen:
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation. 2371-2389 - Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf:
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space. 2390-2422 - Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu:
Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding. 2423-2451 - Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu:
Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! 2452-2469 - Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar R. Zaïane:
Reusing Transferable Weight Increments for Low-resource Style Generation. 2470-2488 - Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee:
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course. 2489-2513 - Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler:
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? 2514-2528 - Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. 2529-2550 - Renzhi Wang, Piji Li:
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models. 2551-2575 - Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma
:
Collaborative Performance Prediction for Large Language Models. 2576-2596 - Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari:
Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese. 2597-2615 - Fanqi Wan, Xinting Huang, Leyang Cui
, Xiaojun Quan, Wei Bi, Shuming Shi:
Knowledge Verification to Nip Hallucination in the Bud. 2616-2633 - Timo Pierre Schrader
, Lukas Lange, Simon Razniewski
, Annemarie Friedrich:
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios. 2634-2652 - Gregor Geigle, Radu Timofte, Goran Glavas:
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification. 2653-2669 - Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao:
Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models. 2670-2683 - Bastien Liétard, Pascal Denis, Mikaela Keller:
To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models. 2684-2696 - Hao Wang, Hao Li, Minlie Huang, Lei Sha:
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings. 2697-2711 - Xiutian Zhao, Ke Wang, Wei Peng:
An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making. 2712-2727 - Gregor Geigle, Radu Timofte, Goran Glavas:
Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? 2728-2742 - Zhenyu Liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min Zhang:
Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment. 2743-2757 - Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, Zihan Wang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang:
MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning. 2758-2770 - Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang:
Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification. 2771-2783 - Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu:
PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study. 2784-2801 - Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su:
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions. 2802-2816 - Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu:
MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction. 2817-2834 - Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi
:
Evaluating Large Language Models via Linguistic Profiling. 2835-2848 - Tyler Loakman, Yucheng Li, Chenghua Lin:
With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models. 2849-2867 - Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li:
KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases. 2868-2882 - Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira
:
Understanding Higher-Order Correlations Among Semantic Components in Embeddings. 2883-2899 - Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Shaorong Xie, Wei Liu, Xian Wu, Yefeng Zheng:
DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection. 2900-2912 - Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg:
Evaluating D-MERIT of Partial-annotation on Information Retrieval. 2913-2932 - Xin Quan
, Marco Valentino
, Louise A. Dennis
, André Freitas
:
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving. 2933-2958 - Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu:
Calibrating the Confidence of Large Language Models by Eliciting Fidelity. 2959-2979 - Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen:
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models. 2980-2989 - Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea:
How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics. 2990-3001 - Gaetan Latouche, Marc-André Carbonneau, Benjamin Swanson:
Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection. 3002-3016 - Lukas Edman, Helmut Schmid, Alexander Fraser:
CUTE: Measuring LLMs' Understanding of Their Tokens. 3017-3026 - Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min Zhang:
SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation. 3027-3041 - Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox:
On the Role of Context in Reading Time Prediction. 3042-3058 - Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin
, Ruifeng Xu, Wotao Yin:
BC-Prover: Backward Chaining Prover for Formal Theorem Proving. 3059-3077 - Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva:
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP. 3078-3105 - Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu:
Autoregressive Pre-Training on Pixels and Texts. 3106-3125 - Yekun Chai, Qingyi Liu, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu:
On Training Data Influence of GPT Models. 3126-3150 - Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat:
Understanding "Democratization" in NLP and ML Research. 3151-3166 - Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. 3167-3193 - Seonjeong Hwang, Yunsu Kim, Gary Geunbae Lee:
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages. 3194-3208 - Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng:
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws. 3209-3222 - Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka:
Word Alignment as Preference for Machine Translation. 3223-3239 - Yaxin Fan
, Peifeng Li, Qiaoming Zhu:
Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence. 3240-3253 - Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang:
SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models. 3254-3266 - Zeping Yu, Sophia Ananiadou:
Neuron-Level Knowledge Attribution in Large Language Models. 3267-3280 - Zeping Yu, Sophia Ananiadou:
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning. 3281-3292 - Zeping Yu, Sophia Ananiadou:
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis. 3293-3306 - Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux:
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models. 3307-3320 - Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song:
GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory. 3321-3343 - Ali Al-Laith, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Parby, Alexander Conroy, Timothy Tangherlini:
Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature. 3344-3354 - Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh:
QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models. 3355-3371 - Omer Shubi, Yoav Meiri
, Cfir Avraham Hadar, Yevgeni Berzak
:
Fine-Grained Prediction of Reading Comprehension from Eye Movements. 3372-3391 - Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering. 3392-3411 - Sumuk Shashidhar, Abhinav Chinta
, Vaibhav Sahai, Dilek Hakanni-Tür:
Unsupervised Human Preference Learning. 3412-3445 - Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo
, Elena Cabrio, Serena Villata, Marco Guerini:
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering. 3446-3463 - Byung-Doh Oh, William Schuler:
Leading Whitespaces of Language Models' Subword Vocabulary Pose a Confound for Calculating Word Probabilities. 3464-3472 - Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang:
LLM4Decompile: Decompiling Binary Code with Large Language Models. 3473-3487 - Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong:
From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning. 3488-3500 - Yike Wu, Yi Huang, Nan Hu, Yuncheng Hua, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan:
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering. 3501-3520 - Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li:
MTLS: Making Texts into Linguistic Symbols. 3521-3535 - Yifan Chen, Kuntao Li
, Weixing Mai, Qiaofeng Wu
, Yun Xue, Fenghuan Li:
D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection. 3536-3547 - Chang Tian, Matthew B. Blaschko
, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens:
A Generic Method for Fine-grained Category Discovery in Natural Language Texts. 3548-3566 - Yang Trista Cao, Lovely-Frances Domingo, Sarah A. Gilbert, Michelle L. Mazurek, Katie Shilton, Hal Daumé III:
Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method. 3567-3587 - Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie:
A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models. 3588-3612 - Qian Yang, Weixiang Yan, Aishwarya Agrawal:
Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison. 3613-3627 - Lang Cao:
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism. 3628-3646 - Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee:
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation. 3647-3659 - Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia
, Constantin Orasan
, Tharindu Ranasinghe, Frédéric Blain
:
What do Large Language Models Need for Machine Translation Evaluation? 3660-3674 - Flavio Palo, Prateek Singhi, Bilal Fadlallah:
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale. 3675-3687 - Debela Gemechu, Chris Reed
:
External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models. 3688-3709 - Maaz Bin Musa, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin, Padmini Srinivasan, Mihailis Diamantis, Rishab Nithyanand:
C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits. 3710-3722 - Taowen Wang, Yiyang Liu, James Liang, Junhan Zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han
, Lifu Huang, Qifan Wang, Dongfang Liu:
M²PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning. 3723-3740 - Letian Peng, Yi Gu
, Chengyu Dong, Zihan Wang, Jingbo Shang:
Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification. 3741-3752 - Letian Peng, Zilong Wang, Jingbo Shang:
Incubating Text Classifiers Following User Instruction with Nothing but LLM. 3753-3766 - Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang:
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL. 3767-3799 - Wesley H. Holliday, Matthew Mandelkern, Cedegao Zhang:
Conditional and Modal Reasoning in Large Language Models. 3800-3821 - Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin
:
Advancing Large Language Model Attribution through Self-Improving. 3822-3836 - Ziqi Liang, Haoxiang Shi, Hanhui Chen:
AlignCap: Aligning Speech Emotion Captioning to Human Preferences. 3837-3846 - Yihuai Hong, Aldo Lipani:
Interpretability-based Tailored Knowledge Editing in Transformers. 3847-3858 - Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan:
PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling. 3859-3920 - Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap:
Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting. 3921-3932 - Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang
, Haiqin Yang:
Dissecting Fine-Tuning Unlearning in Large Language Models. 3933-3941 - Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang:
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models. 3942-3965 - Renato Lui Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck:
Where is the signal in tokenization space? 3966-3979 - Tianhao Huang
, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang
:
Private Language Models via Truncated Laplacian Mechanism. 3980-3993 - Daniela Gottesman, Mor Geva:
Estimating Knowledge in Large Language Models Without Generating a Single Token. 3994-4019 - Lan Zhang
, Xin Quan
, André Freitas
:
Consistent Autoformalization for Constructing Mathematical Libraries. 4020-4033 - Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal:
When Context Leads but Parametric Memory Follows in Large Language Models. 4034-4058 - Aditya Yedetore, Najoung Kim:
Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers. 4059-4073 - Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen:
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages. 4074-4096 - Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai:
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use. 4097-4114 - Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings:
MiTTenS: A Dataset for Evaluating Gender Mistranslation. 4115-4124 - Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov:
Teaching LLMs to Abstain across Languages via Multilingual Feedback. 4125-4150 - Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov:
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. 4151-4171 - Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L. Gordon, Zaïd Harchaoui, Yejin Choi:
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements. 4172-4206 - Wenting Zhao, Ge Gao, Claire Cardie, Alexander M. Rush:
I Could've Asked That: Reformulating Unanswerable Questions. 4207-4220 - Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami:
STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions. 4221-4243 - Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song:
Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters. 4244-4275 - Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu:
SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning. 4276-4292 - Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu:
When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives. 4293-4308 - Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai:
An Analysis of Multilingual FActScore. 4309-4333 - Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee
, Minjoon Seo:
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models. 4334-4353 - Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli:
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering. 4354-4374 - Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon
:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. 4375-4391 - Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov:
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects. 4392-4409 - Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault:
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback. 4410-4430 - Rongting Zhang, Martín Bertrán, Aaron Roth:
Order of Magnitude Speedups for LLM Membership Inference. 4431-4443 - Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov:
VIMI: Grounding Video Generation through Multi-modal Instruction. 4444-4456 - Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou:
F²RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation. 4457-4470 - Chang Yang, Peng Zhang, Hui Gao, Jing Zhang:
Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning. 4471-4483 - Qixuan Zhang, Zhifeng Wang
, Dylan Zhang, Wenjia Niu, Sabrina B. Caldwell
, Tom Gedeon, Yang Liu, Zhenyue Qin:
Visual Prompting in LLMs for Enhancing Emotion Recognition. 4484-4499 - Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. 4500-4511 - Che-Wei Tsai, Yen-Hao Huang, Tsu-Keng Liao, Didier Estrada, Retnani Latifah, Yi-Shin Chen:
Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset. 4512-4522 - Lingzi Hong
, Pengcheng Luo, Eduardo Blanco, Xiaoying Song:
Outcome-Constrained Large Language Models for Countering Hate Speech. 4523-4536 - Changbing Yang, Garrett Nicolai, Miikka Silfverberg:
Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing. 4537-4552 - Ao Wang, Xinghao Yang, Chen Li, Baodi Liu, Weifeng Liu:
Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks. 4553-4565 - Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang:
Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping. 4566-4580 - Huachuan Qiu, Lizhi Ma, Zhenzhong Lan:
PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling. 4581-4607 - Jiacong Wang, Bohong Wu, Haiyong Jiang, Xun Zhou, Xin Xiao, Haoyuan Guo, Jun Xiao:
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering. 4608-4623 - Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo:
DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering. 4624-4637 - Long Li, Xuzheng He, Haozhe Wang
, Linlin Wang, Liang He:
How Do Humans Write Code? Large Models Do It the Same Way Too. 4638-4649 - Yufei Xiang, Yiqun Shen, Yeqin Zhang, Cam-Tu Nguyen:
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic. 4650-4666 - Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu:
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models. 4667-4682 - Yuanjie Lyu
, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen:
Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation. 4683-4702 - Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang:
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation. 4703-4721 - Bowen Jiang, Yangxinyu Xie
, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie Su, Camillo J. Taylor, Dan Roth:
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners. 4722-4756 - Yicheng Gao
, Gonghan Xu, Zhe Wang, Arman Cohan:
Bayesian Calibration of Win Rate Estimation with LLM Evaluators. 4757-4769 - Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai:
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning. 4770-4785 - Weijun Li
, Qiongkai Xu
, Mark Dras
:
Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients. 4786-4798 - Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng:
RWKV-CLIP: A Robust Vision-Language Representation Learner. 4799-4812 - Mir Tafseer Nayeem, Davood Rafiei:
KidLM: Advancing Language Models for Children - Early Insights and Future Directions. 4813-4836 - Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr:
Using Language Models to Disambiguate Lexical Choices in Translation. 4837-4848 - Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin:
How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? 4849-4868 - Joakim Edin, Maria Maistro
, Lars Maaløe, Lasse Borgholt, Jakob D. Havtorn, Tuukka Ruotsalo
:
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records. 4869-4890 - Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, Wei Shi:
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs. 4891-4906 - Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji:
EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation. 4907-4926 - Tatsuya Aoyama, Nathan Schneider:
Modeling Nonnative Sentence Processing with L2 Language Models. 4927-4940 - Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan:
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis. 4941-4957 - Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin:
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs. 4958-4976 - Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang:
Cross-Domain Audio Deepfake Detection: Dataset and Analysis. 4977-4983 - Ting Liu, Zunnan Xu, Yue Hu, Liangtao Shi, Zhiqiang Wang, Quanjun Yin:
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension. 4984-4994 - Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo:
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization. 4995-5027 - Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin
:
Aligning Translation-Specific Understanding to General Understanding in Large Language Models. 5028-5041 - Mohamad Ballout, Anne Dedert, Nohayr Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger:
FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation. 5042-5059 - Jaewoo Lee, Boyang Li, Sung Ju Hwang:
Concept-skill Transferability-based Data Selection for Large Vision-Language Models. 5060-5080 - Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin:
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing. 5081-5099 - Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S. Rosenberg, Sebastian Gehrmann:
Academics Can Contribute to Domain-Specialized Language Models. 5100-5110 - Keonwoong Noh, Seokjin Oh, Woohwan Jung:
Beyond Reference: Evaluating High Quality Translations Better than Human References. 5111-5127 - Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie:
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement. 5128-5154 - Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Montalan, Ryan Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem
, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Ngee Tai Chia, Ayu Purwarianti
, Sebastian Ruder, William-Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya:
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages. 5155-5203 - Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen:
Induct-Learn: Short Phrase Prompting with Instruction Induction. 5204-5231 - Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li:
Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning. 5232-5243 - Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier:
LUQ: Long-text Uncertainty Quantification for LLMs. 5244-5262 - Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng:
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method. 5263-5274 - Damien Sileo:
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars. 5275-5283 - Maxime Poli
, Emmanuel Chemla, Emmanuel Dupoux:
Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach. 5284-5292 - Jiaying Zheng
, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi Ming Zheng:
Safely Learning with Private Data: A Federated Learning Framework for Large Language Model. 5293-5306 - Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen:
Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge. 5307-5320 - Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You:
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? 5321-5335 - Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang:
How Far Can We Extract Diverse Perspectives from Large Language Models? 5336-5366 - Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Yerragorla, Sourangshu Bhattacharya, Avishek Anand:
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning. 5367-5388 - Lexin Zhou, Youmna Farag, Andreas Vlachos
:
An LLM Feature-based Framework for Dialogue Constructiveness Assessment. 5389-5409 - Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou:
Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System. 5410-5420 - Sergio Burdisso, Srikanth R. Madikeri, Petr Motlícek:
Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction. 5421-5440 - Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. 5441-5454 - Ilias Chalkidis:
Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024. 5455-5467 - Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian:
Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning. 5468-5495 - Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu:
LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations. 5496-5510 - Qiwei Peng, Anders Søgaard:
Concept Space Alignment in Multilingual LLMs. 5511-5526 - Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou:
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model. 5527-5542 - Peng Liu, Lemei Zhang, Terje Nissen Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang:
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian. 5543-5560 - Yifan Wang, Vera Demberg:
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework. 5561-5582 - Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang:
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models. 5583-5595 - Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam:
Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems. 5596-5612 - Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen:
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering. 5613-5626 - Minzheng Wang, Longze Chen, Fu Cheng, Shengyi Liao, Xinghua Zhang, Bingli Wu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li:
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA. 5627-5646 - Monorama Swain, Anna Zee, Anders Søgaard:
On Mitigating Performance Disparities in Multilingual Speech Recognition. 5647-5655 - Stephen Meisenbacher
, Florian Matthes:
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting. 5656-5665 - Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen:
To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models. 5666-5680 - Esther Ploeger
, Wessel Poelman
, Miryam de Lhoneux, Johannes Bjerva
:
What is "Typological Diversity" in NLP? 5681-5700 - Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi:
The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse. 5701-5723 - Georgi Shopov, Stefan Gerdjikov:
Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness. 5724-5768 - Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal:
Benchmarking Vision Language Models for Cultural Understanding. 5769-5790 - Olga Iakovenko
, Thomas Hain
:
Methods of Automatic Matrix Language Determination for Code-Switched Speech. 5791-5800 - Jaewook Lee, Yeajin Jang, Hongjin Kim, Woojin Lee, Harksoo Kim:
Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts. 5801-5816 - Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar:
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models. 5817-5830 - Tao Feng, Yicheng Li, Chenglin Li, Hao Chen, Fei Yu, Yin Zhang:
Teaching Small Language Models Reasoning through Counterfactual Distillation. 5831-5842 - Meet Doshi, Raj Dabre, Pushpak Bhattacharyya:
Pretraining Language Models Using Translationese. 5843-5862 - Kyle Buettner, Adriana Kovashka:
Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval. 5863-5870 - Qixi Lu, Endong Xun, Gongbo Tang:
MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval. 5871-5883 - Aida Kostikova, Dominik Beese
, Benjamin Paassen, Ole Pütz
, Gregor Wiedemann, Steffen Eger:
Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates. 5884-5907 - Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie C. K. Cheung:
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling. 5908-5930 - Hans Ole Hatzel, Chris Biemann:
Story Embeddings - Narrative-Focused Representations of Fictional Stories. 5931-5943 - Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou:
C-LLM: Learn to Check Chinese Spelling Errors Character by Character. 5944-5957 - Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu:
PSC: Extending Context Window of Large Language Models via Phase Shift Calibration. 5958-5970 - Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan:
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. 5971-5984 - Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao:
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales. 5985-5998 - Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn:
Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing. 5999-6011 - Yunze Xiao
, Yujia Hu, Kenny T. W. Choo
, Roy Ka-Wei Lee:
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations. 6012-6025 - Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang:
Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models? 6026-6036 - Jirui Qi
, Gabriele Sarti
, Raquel Fernández, Arianna Bisazza:
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation. 6037-6053 - Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar:
Do Large Language Models Know How Much They Know? 6054-6070 - Somin Wadhwa, Silvio Amir, Byron C. Wallace:
Investigating Mysteries of CoT-Augmented Distillation. 6071-6086 - Zhiwen You
, Kanyao Han
, Haotian Zhu, Bertram Ludäscher, Jana Diesner:
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics. 6087-6104 - Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi:
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP. 6105-6113 - Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Luyang Kong:
Learning from Natural Language Explanations for Generalizable Entity Matching. 6114-6129 - Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Kumar Sricharan, Murat Kantarcioglu, Bradley A. Malin:
Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation. 6130-6151 - Jen-tse Huang, Wenxiang Jiao, Man Ho Lam, Eric John Li, Wenxuan Wang, Michael R. Lyu:
On the Reliability of Psychological Scales on Large Language Models. 6152-6173 - Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring:
Contrastive Entity Coreference and Disambiguation for Historical Texts. 6174-6186 - Jeonghwan Kim, Heng Ji:
Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models. 6187-6207 - Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata:
Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts. 6208-6226 - Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu:
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment. 6227-6246 - Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang
, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li:
Focused Large Language Models are Stable Many-Shot Learners. 6247-6261 - Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus:
Reconsidering Sentence-Level Sign Language Translation. 6262-6287 - Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha:
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. 6288-6313 - Liviu P. Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas:
Verba volant, scripta volant? Don't worry! There are computational solutions for protoword reconstruction. 6314-6326 - Victoria R. Li, Yida Chen, Naomi Saphra:
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context. 6327-6345 - Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He:
Personas as a Way to Model Truthfulness in Language Models. 6346-6359 - Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J. Hammond:
Satyrn: A Platform for Analytics Augmented Generation. 6360-6385 - Ashish Seth, Ramaneswaran Selvakumar, S. Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha:
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning. 6386-6400 - Qi Zhao, Haotian Fu, Chen Sun, George Konidaris:
EPO: Hierarchical LLM Agents with Environment Preference Optimization. 6401-6415 - Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C. Wallace:
Detection and Measurement of Syntactic Templates in Generated Text. 6416-6431 - Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu:
UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models. 6432-6441 - Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet:
Optimized Speculative Sampling for GPU Hardware Accelerators. 6442-6458 - Zhaoxuan Tan, Zheyuan Liu, Meng Jiang:
Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts. 6459-6475 - Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang:
Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning. 6476-6491 - Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. 6492-6505 - Shaomu Tan, Di Wu, Christof Monz:
Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation. 6506-6527 - Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson:
An Audit on the Perspectives and Challenges of Hallucinations in NLP. 6528-6548 - Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut:
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models. 6549-6583 - Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang:
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models. 6584-6600 - Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner:
Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering. 6601-6633 - Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner:
Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution. 6634-6652 - Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder:
Understanding and Mitigating Language Confusion in LLMs. 6653-6677 - Gaël Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael J. Witbrock, Gillian Dobbie:
Can Large Language Models Learn Independent Causal Mechanisms? 6678-6701 - Sarfaroz Yunusov, Hamza Sidat, Ali Emami:
MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models. 6702-6717 - Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao:
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context. 6718-6746 - Farhan Samir, Chan Young Park, Anjalie Field
, Vered Shwartz, Yulia Tsvetkov:
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia. 6747-6762 - Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, Eunjeong Hwang, Vered Shwartz:
From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models. 6763-6782 - Karin de Langis, Ryan Koo, Dongyeop Kang:
Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation. 6783-6800 - Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu:
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model. 6801-6816 - Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra:
Learning to Extract Structured Entities Using Language Models. 6817-6834 - Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark J. F. Gales:
Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons. 6835-6855 - Shira Wein, Juri Opitz:
A Survey of AMR Applications. 6856-6875 - Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang:
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning. 6876-6911 - Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C. Kaelin, Mary A. Khetani, Natalie Parde
:
CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation. 6912-6927 - Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Ben Hu:
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion. 6928-6941 - Xinying Qian
, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song:
TimeR⁴ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering. 6942-6952 - Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang:
Knowledge-Centric Hallucination Detection. 6953-6975 - Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu:
Revealing the Parallel Multilingual Learning within Large Language Models. 6976-6997 - Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen:
Automatic Instruction Evolving for Large Language Models. 6998-7018 - Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou:
RepEval: Effective Text Evaluation with LLM Representation. 7019-7033 - Yuxin He, Buzhou Tang, Xiaoling Wang:
Generative Models for Automatic Medical Decision Rule Extraction from Text. 7034-7048 - Thong Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy Nguyen, See-Kiong Ng, Anh Tuan Luu:
Encoding and Controlling Global Semantics for Long-form Video Question Answering. 7049-7066 - Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang:
Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis. 7067-7085 - Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun:
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs. 7086-7100 - Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu:
Does Large Language Model Contain Task-Specific Neurons? 7101-7113 - Philipp Mondorf, Barbara Plank:
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models. 7114-7137 - Hongfu Liu, Hengguan Huang, Ye Wang:
Advancing Test-Time Adaptation in Wild Acoustic Test Settings. 7138-7155 - Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme:
Learning to Retrieve Iteratively for In-Context Learning. 7156-7168 - SeongKu Kang
, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu:
Taxonomy-guided Semantic Indexing for Academic Paper Search. 7169-7184 - Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che:
Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts. 7185-7212 - Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh:
Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models. 7213-7224 - Zhiyu Cao, Peifeng Li, Yaxin Fan
, Qiaoming Zhu:
Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation. 7225-7238 - Yiyuan Li, Shichao Sun, Pengfei Liu:
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs. 7239-7256 - Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash:
Aligning Large Language Models with Diverse Political Viewpoints. 7257-7267 - Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III:
"You Gotta be a Doctor, Lin" : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations. 7268-7287 - Yingsheng Wu, Yuxuan Gu
, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin
:
Extending Context Window of Large Language Models from a Distributional Perspective. 7288-7301 - Hakyung Sung, Kristopher Kyle:
Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions. 7302-7314 - Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng:
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration. 7315-7332 - Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna Qiu, Lili Qiu:
Position Engineering: Boosting Large Language Models through Positional Information Manipulation. 7333-7345 - Junying Chen, Chi Gui, Ruyi Ouyang, Anningzhe Gao, Shunian Chen, Guiming Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang:
Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale. 7346-7370 - Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li:
ADELIE: Aligning Large Language Models on Information Extraction. 7371-7387 - Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Zeng:
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons. 7388-7402 - Jindrich Libovický, Jindrich Helcl:
Lexically Grounded Subword Segmentation. 7403-7420 - Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang:
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees. 7421-7432 - Hy Nguyen, Xuefei He, Andrew Reeson, Cécile Paris, Josiah Poon, Jonathan K. Kummerfeld:
Do Text-to-Vis Benchmarks Test Real Use of Visualisations? 7433-7441 - Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu:
Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs. 7442-7459 - Jingyu Hu, Weiru Liu, Mengnan Du:
Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning. 7460-7475 - Nguyen Dinh, Thanh Dang, Luan Thanh Nguyen, Kiet Van Nguyen:
Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges. 7476-7498 - Vyas Raina, Adian Liusie, Mark J. F. Gales:
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment. 7499-7517 - Zhicong Lu, Li Jin, Peiguang Li, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai:
Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal. 7518-7530 - Chengyuan Liu, Yangyang Kang, Shihang Wang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu:
More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs. 7531-7548 - Vyas Raina, Rao Ma, Charles McGhee, Kate M. Knill, Mark J. F. Gales:
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models. 7549-7565 - Georgios Katsimpras, Georgios Paliouras:
GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation. 7566-7577 - Zichen Chen, Jianda Chen, Ambuj K. Singh, Misha Sra:
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs. 7578-7596 - Yuanpin Zhou, Huogen Wang:
Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning. 7597-7610 - Jiashuo Sun
, Jihai Zhang, Yucheng Zhou
, Zhaochen Su, Xiaoye Qu, Yu Cheng:
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information. 7611-7629 - Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui:
UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models. 7630-7645 - Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su:
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments. 7646-7663 - Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia Jinxiaojia, Zhangjijun Zhangjijun, Ruifang He, Yuexian Hou:
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space. 7664-7676 - Wenhao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang:
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server. 7677-7695 - Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei:
DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination. 7696-7712 - Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao:
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models. 7713-7724 - Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou:
Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale. 7725-7738 - Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul
, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong:
An Empirical Study of Multilingual Reasoning Distillation for Question Answering. 7739-7751 - Gal Yona, Roee Aharoni, Mor Geva:
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? 7752-7764 - Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig:
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? 7765-7784 - Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee:
Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning. 7785-7799 - Baixuan Xu
, Weiqi Wang, Haochen Shi
, Wenxuan Ding, Huihao Jing
, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song:
MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding. 7800-7815 - Cheng Jiayang, Chunkit Chan, Qianqian Zhuang, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang:
ECON: On the Detection and Resolution of Evidence Conflicts. 7816-7844 - Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych:
"Image, Tell me your story!" Predicting the original meta-context of visual misinformation. 7845-7864 - Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan:
Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning. 7865-7879 - Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong:
Mixture-of-Subspaces in Low-Rank Adaptation. 7880-7899 - Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram:
PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data. 7900-7932 - Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng:
LawBench: Benchmarking Legal Knowledge of Large Language Models. 7933-7962 - Furkan Sahinuç, Thy Thy Tran
, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych:
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards. 7963-7977 - Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martínez, Georgios Tzimiropoulos:
Efficient Vision-Language pre-training via domain-specific learning for human activities. 7978-8000 - Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su:
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training. 8001-8014 - Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang:
Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works. 8015-8036 - Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang:
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners. 8037-8051 - Hao Sun, Jiayi Wu
, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin:
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning. 8052-8062 - Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li:
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models. 8063-8077 - Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen:
mDPO: Conditional Preference Optimization for Multimodal Large Language Models. 8078-8088 - Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan:
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models. 8089-8100 - Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas:
Language-to-Code Translation with a Single Labeled Example. 8101-8112 - Jan Buchmann, Xiao Liu, Iryna Gurevych:
Attribute or Abstain: Large Language Models as Long Document Assistants. 8113-8140 - Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma:
FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models. 8141-8154 - Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang:
Retrieved In-Context Principles from Previous Mistakes. 8155-8169 - Haozhe Chen, Run Chen, Julia Hirschberg:
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control. 8170-8180 - Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. 8181-8196 - Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Reda Boumasmoud, Ryan Cotterell:
An L* Algorithm for Deterministic Weighted Regular Languages. 8197-8210 - Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin:
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection. 8211-8227 - Pritish Sahu, Karan Sikka, Ajay Divakaran:
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. 8228-8248 - Yusuke Hirota, Jerone Theodore Alexander Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang:
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes. 8249-8267 - Di Cao, Yong Liao, Xiuwei Shang:
RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? 8268-8282 - Brendan King, Jeffrey Flanigan
:
Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel. 8283-8300 - Guiming Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang:
Humans or LLMs as the Judge? A Study on Judgement Bias. 8301-8327 - Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu:
WPO: Enhancing RLHF with Weighted Preference Optimization. 8328-8340 - Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu, Han Qiu:
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias. 8341-8368 - Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi:
MetaReflection: Learning Instructions for Language Agents using Past Reflections. 8369-8385 - Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan:
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors. 8386-8411 - Yiran Wang, Masao Utiyama:
On Eliciting Syntax from Language Models via Hashing. 8412-8427 - Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He:
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios. 8428-8438 - Heng Yang, Ke Li:
The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples. 8439-8457 - Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal:
CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages. 8458-8466 - Catarina G. Belém
, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth:
Perceptions of Linguistic Uncertainty by Language Models and Humans. 8467-8502 - Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung:
Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM. 8503-8526 - Xiaoyu Dong, Yujie Feng, Zexin Lu, Guangyuan Shi, Xiao-Ming Wu:
Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding. 8527-8540 - Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu:
Knowledge Conflicts for LLMs: A Survey. 8541-8565 - Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar:
MisinfoEval: Generative AI in the Era of "Alternative Facts". 8566-8578 - Benjamin Irving, Annika Schoene:
MEANT: Multimodal Encoder for Antecedent Information. 8579-8600 - Chufan Shi, Haoran Yang, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam:
A Thorough Examination of Decoding Methods in the Era of LLMs. 8601-8629 - Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar:
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings. 8630-8641 - Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md. Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji:
FIRST: Faster Improved Listwise Reranking with Single Token Decoding. 8642-8652 - Hongjin Kim, Jai-Eun Kim, Harksoo Kim:
Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights. 8653-8670 - Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei
, Neil Gong, Bhuwan Dhingra:
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods. 8671-8689 - Karina Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut:
"Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models. 8690-8707 - Yujian Liu, Yang Zhang, Tommi S. Jaakkola, Shiyu Chang:
Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective. 8708-8731 - Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu:
LIONs: An Empirically Optimized Approach to Align Language Models. 8732-8753 - Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada:
Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing. 8754-8782 - Yu Zhang
, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han:
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery. 8783-8817 - Liyan Tang, Philippe Laban, Greg Durrett:
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents. 8818-8847 - John Wu, David Wu, Jimeng Sun:
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning. 8848-8871 - Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella:
MOSEL: Inference Serving Using Dynamic Modality Selection. 8872-8886 - Palak Jain, Livio Baldini Soares, Tom Kwiatkowski:
From RAG to Riches: Retrieval Interlaced with Sequence Generation. 8887-8904 - Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee:
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition. 8905-8915 - Jaehyung Kim, Dongyoung Kim, Yiming Yang:
Learning to Correct for QA Reasoning with Black-box LLMs. 8916-8937 - Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant:
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? 8938-8968 - Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Wieting, Mohit Iyyer:
PostMark: A Robust Blackbox Watermark for Large Language Models. 8969-8987 - Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang:
Assessing "Implicit" Retrieval Robustness of Large Language Models. 8988-9003 - Suyash Fulay, William Brannon
, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara:
On the Relationship between Truth and Political Bias in Language Models. 9004-9018 - Karan Taneja, Ashok K. Goel:
Can Active Label Correction Improve LLM-based Modular AI Systems? 9019-9031 - Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho:
Statistical Uncertainty in Word Embeddings: GloVe-V. 9032-9047 - Rajiv Movva, Pang Wei Koh, Emma Pierson:
Annotation alignment: Comparing LLM and human annotations of conversational safety. 9048-9062 - Nigel Fernandez, Alexander Scarlatos, Wanyong Feng, Simon Woodhead, Andrew S. Lan:
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions. 9063-9081 - Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang:
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention. 9082-9100 - Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran:
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models. 9101-9118 - Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng:
Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic. 9119-9138 - Layla Bouzoubaa, Elham Aghakhani, Rezvaneh Rezapour:
Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models. 9139-9156 - Dingyang Chen, Qi Zhang, Yinglun Zhu:
Efficient Sequential Decision Making with Large Language Models. 9157-9170 - Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling:
SignCLIP: Connecting Text and Sign Language by Contrastive Learning. 9171-9193 - Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang:
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization. 9194-9211 - Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani:
Ontologically Faithful Generation of Non-Player Character Dialogues. 9212-9242 - Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker:
LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives. 9243-9267 - Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov:
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs. 9268-9299 - Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng
, Yauwai Yim, Yangqiu Song:
Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction. 9300-9322 - Kate McCurdy, Paul Soulos, Paul Smolensky, Roland Fernandez, Jianfeng Gao:
Toward Compositional Behavior in Neural Models: A Survey of Current Views. 9323-9339 - Krista Opsahl-Ong, Michael J. Ryan
, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia
, Omar Khattab:
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs. 9340-9366 - Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell:
Reverse-Engineering the Reader. 9367-9389 - Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang:
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation. 9390-9406 - Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun:
Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text. 9407-9430 - David Schulte, Felix Hamborg, Alan Akbik:
Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning. 9431-9442 - So Lee, Mai Vu:
The effects of distance on NPI illusive effects in BERT. 9443-9457 - Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. 9458-9482 - Christabel Acquaye, Haozhe An, Rachel Rudinger:
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US. 9483-9502 - Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Wang:
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding. 9503-9522 - Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi:
Ranking Manipulation for Conversational Search Engines. 9523-9552 - Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov:
Fast Forwarding Low-Rank Training. 9553-9562 - Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort:
Precise Model Benchmarking with Only a Few Observations. 9563-9575 - Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra:
Attribute Diversity Determines the Systematicity Gap in VQA. 9576-9611 - Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S. Weld, Joseph Chee Chang, Kyle Lo:
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models. 9612-9631 - Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma:
Development of Cognitive Intelligence in Pre-trained Language Models. 9632-9657 - Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui:
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding. 9658-9678 - Sam Blouir, Jimmy T. H. Smith, Antonios Anastasopoulos, Amarda Shehu:
Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives. 9679-9705 - Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow:
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? 9706-9726 - Sheridan Feucht, David Atkinson, Byron C. Wallace, David Bau:
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs. 9727-9739 - Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig:
TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering. 9740-9766 - Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell:
Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding. 9767-9781 - Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang:
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting. 9782-9796 - Reza Esfandiarpoor, Cristina Menghini, Stephen H. Bach:
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions. 9797-9819 - Bowen Zhang, Harold Soh:
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction. 9820-9836 - Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun:
MQuinE: a Cure for "Z-paradox" in Knowledge Graph Embedding. 9837-9850 - Anej Svete, Nadav Borenstein, Mike Zhou, Isabelle Augenstein
, Ryan Cotterell:
Can Transformers Learn n-gram Language Models? 9851-9867 - Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim:
StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model. 9868-9884 - Philippe Laban, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu:
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems. 9885-9903 - Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu:
Multi-pass Decoding for Grammatical Error Correction. 9904-9916 - Yucheng Jiang, Yijia Shao, Dekun Ma, Sina J. Semnani, Monica S. Lam:
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations. 9917-9955 - Chenming Tang, Zhixiang Wang
, Yunfang Wu:
SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation. 9956-9971 - Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng:
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge. 9972-9987 - Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Adams, Lydia B. Chilton, Kathleen R. McKeown:
STORYSUMM: Evaluating Faithfulness in Story Summarization. 9988-10005 - Haofei Yu, Zhengyang Qi, Lawrence Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang:
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts. 10006-10030 - Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee:
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer. 10031-10045 - Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg:
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension. 10046-10063 - Jun Rao, Xuebo Liu, Lian Lian, Shengjun Cheng, Yunjie Liao, Min Zhang:
CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions. 10064-10083 - Yuzhe Gu, Enmao Diao:
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers. 10084-10096 - Jaeseong Lee, Seung-won Hwang, Wonpyo Park, Mingi Ji:
Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models. 10097-10107 - Yang Xu, Yu Wang, Hao An, Zhichen Liu, Yongyuan Li:
Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. 10108-10121 - Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, Jun Zhou:
Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning. 10122-10140 - Xiaohua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin:
Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models. 10141-10155 - Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen:
ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs. 10156-10168 - Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu:
On the In-context Generation of Language Models. 10169-10187 - Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei:
Atomic Inference for NLI with Generated Facts as Atoms. 10188-10204 - William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe:
Towards Robust Speech Representation Learning for Thousands of Languages. 10205-10224 - Xuan Ren, Biao Wu, Lingqiao Liu:
I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses. 10225-10245 - Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen:
PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment. 10246-10257 - Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig:
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance. 10258-10279 - Ting-Yun Chang, Jesse Thomason, Robin Jia:
When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models. 10280-10299 - Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo Zhang, Yanghui Rao:
Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference. 10300-10317 - Jinsung Yoon, Rajarishi Sinha, Sercan Ömer Arik, Tomas Pfister:
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions. 10318-10336 - Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao:
KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction. 10337-10350 - Zhen Lin, Shubhendu Trivedi, Jimeng Sun:
Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation. 10351-10368 - Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl:
MixGR: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity. 10369-10391 - Tuan Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen:
CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction. 10392-10407 - Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan:
"In-Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning. 10408-10422 - Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He:
Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective. 10423-10435 - Xin Liu, Farima Fatahi Bayat, Lu Wang:
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding. 10436-10448 - Esther Gan, Yiran Zhao, Liying Cheng, Yancan Mao, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh:
Reasoning Robustness of LLMs to Adversarial Typographical Errors. 10449-10459 - Pengyu Wang, Dong Zhang,