Residential College | false |
Status | 已發表Published |
DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation | |
Gu, Zihui1,2; Sun, Xingwu2,3; Lian, Fengzong2; Kang, Zhanhui2; Xu, Cheng Zhong3; Fan, Ju1 | |
2024-03-25 | |
Conference Name | 38th AAAI Conference on Artificial Intelligence, AAAI 2024 |
Source Publication | Proceedings of the AAAI Conference on Artificial Intelligence |
Volume | 38 |
Issue | 16 |
Pages | 18108-18116 |
Conference Date | 20 February 2024through 27 February 2024 |
Conference Place | Vancouver |
Abstract | Instruction-following is particularly crucial for large language models (LLMs) to support diverse user requests. While existing work has made progress in aligning LLMs with human preferences, evaluating their capabilities on instruction-following remains a challenge due to complexity and diversity of real-world user instructions. While existing evaluation methods focus on general skills, they suffer from two main shortcomings, i.e., lack of fine-grained task-level evaluation and reliance on singular instruction expression. To address these problems, this paper introduces DINGO, a fine-grained and diverse instruction-following evaluation dataset that has two main advantages: (1) DINGO is based on a manual-annotated, fine-grained and multi-level category tree with 130 nodes derived from real-world user requests; (2) DINGO includes diverse instructions, generated by both GPT-4 and human experts. Through extensive experiments, we demonstrate that DINGO can not only provide more challenging and comprehensive evaluation for LLMs, but also provide task-level fine-grained directions to further improve LLMs. |
DOI | 10.1609/aaai.v38i16.29768 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85189621532 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Renmin Univerisity of China, China 2.Tencent Inc, China 3.University of Macau, Macao |
Recommended Citation GB/T 7714 | Gu, Zihui,Sun, Xingwu,Lian, Fengzong,et al. DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation[C], 2024, 18108-18116. |
APA | Gu, Zihui., Sun, Xingwu., Lian, Fengzong., Kang, Zhanhui., Xu, Cheng Zhong., & Fan, Ju (2024). DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), 18108-18116. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment