近年来,自然语言处理(NLP)和计算语言学领域在十年前被深度学习革命化之后,再次被大语言模型(LLM)革命化。不出所料,在今年的计算语言学协会北美分会(NAACL)会议上,涉及LLM的研究(无论是将其作为研究对象本身,还是作为其他自然语言处理应用的工具)占据了主导地位。本指南将某中心的NAACL论文分为明确涉及LLM的论文和不涉及LLM的论文——尽管在许多情况下,后者提出的通用技术或数据集既可以用于LLM,也可以用于更传统的模型。
与LLM相关的研究
智能体
FLAP: Flow-adhering planning with constrained decoding in LLMs
Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, Arshit Gupta
属性值提取
EIVEN: Efficient implicit attribute value extraction using multimodal LLM
Henry Peng Zou, Gavin Yu, Ziwei Fan, Dan Bu, Han Liu, Peng Dai, Dongmei Jia, Cornelia Caragea
持续学习
Q-Tuning: Queue-based prompt tuning for lifelong few-shot language learning
Yanhui Guo, Shaoyuan Xu, Jinmiao Fu, Jia (Kevin) Liu, Chaosheng Dong, Bryan Wang
对话
Leveraging LLMs for dialogue quality measurement
Jinghan Jia, Abi Komma, Timothy Leffel, Xujun Peng, Ajay Nagesh, Tamer Soliman, Aram Galstyan, Anoop Kumar
缓解幻觉
Less is more for improving automatic evaluation of factual consistency
Tong Wang, Ninad Kulkarni, Yanjun (Jane) Qi
TofuEval: Evaluating hallucinations of LLMs on topic-focused dialogue summarization
Liyan Tang, Igor Shalyminov, Amy Wong, Jon Burnsky, Jake Vincent, Yu’an Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Justin Sun, Yi Zhang, Saab Mansour, Kathleen McKeown
Towards improved multi-source attribution for long-form answer generation
Nilay Patel, Shivashankar Subramanian, Siddhant Garg, Pratyay Banerjee, Amita Misra
机器翻译
A preference-driven paradigm for enhanced translation with large language models
Dawei Zhu, Sony Trenous, Xiaoyu Shen, Dietrich Klakow, Bill Byrne, Eva Hasler
自然语言处理
Toward informal language processing: Knowledge of slang in large language models
Zhewei Sun, Qian Hu, Rahul Gupta, Richard Zemel, Yang Xu
问答
Bring your own KG: Self-supervised program synthesis for zero-shot KGQA
Dhruv Agarwal, Rajarshi (Raj) Das, Sopan Khosla, Rashmi Gangadharaiah
推理
CoMM: Collaborative multi-agent, multi-reasoning-path prompting for complex problem solving
Pei Chen, Boran Han, Shuai Zhang
推荐系统
RecMind: Large language model powered agent for recommendation
Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
基于人类反馈的强化学习
RS-DPO: A hybrid rejection sampling and direct preference optimization method for alignment of large language models
Saeed Khaki, JinJin Li, Lan Ma, Liu Yang, Prathap Ramachandra
负责任AI
ITERALIGN: Iterative constitutional alignment of large language models
Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang
MICo: Preventative detoxification of large language models through inhibition control
Roy Siegelmann, Ninareh Mehrabi, Palash Goyal, Prasoon Goyal, Lisa Bauer, Jwala Dhamala, Aram Galstyan, Rahul Gupta, Reza Ghanadan
The steerability of large language models toward>通用和经典技术
对话智能体
Leveraging interesting facts to enhance user engagement with conversational interfaces
Nikhita Vedula, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko, Shervin Malmasi
信息抽取
Leveraging customer feedback for multi-modal insight extraction
Sandeep Sricharan Mukku, Abinesh Kanagarajan, Pushpendu Ghosh, Chetan Aggarwal
REXEL: An end-to-end model for document-level relation extraction and entity linking
Nacime Bouziani, Shubhi Tyagi, Joseph Fisher, Jens Lehmann, Andrea Pierleoni
机器学习
DEED: Dynamic early exit on decoder for accelerating encoder-decoder transformer models
Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha
机器翻译
How lexical is bilingual lexicon induction?
Harsh Kohli, Helian Feng, Nicholas Dronen, Calvin McCarter, Sina Moeini, Ali Kebarighotbi
M3T: A new benchmark dataset for multi-modal document-level machine translation
Benjamin Hsu, Xiaoyu Liu, Huayang Li, Yoshinari Fujinuma, Maria Nădejde, Xing Niu, Yair Kittenplon, Ron Litman, Raghavendra Pappagari
负责任AI
Mitigating bias for question answering models by tracking bias influence
Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng
语义检索
Extremely efficient online query encoding for dense retrieval
Nachshon Cohen, Yaron Fairstein, Guy Kushilevitz
文本摘要
CCSUM: A large-scale and high-quality dataset for abstractive news summarization
Xiang Jiang, Markus Dreyer
Semi-supervised dialogue abstractive summarization via high-quality pseudolabel selection
Jianfeng He, Hang Su, Jason Cai, Igor Shalyminov, Hwanjun Song, Saab Mansour
视觉问答
Multiple-question multiple-answer text-VQA
Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay MahadevanFINISHED
更多精彩内容 请关注我的个人公众号 公众号(办公AI智能小助手)或者 我的个人博客 https://blog.qife122.com/
对网络安全、黑客技术感兴趣的朋友可以关注我的安全公众号(网络安全技术点滴分享)