DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation

doi:10.1609/aaai.v38i16.29768

UM > Faculty of Science and Technology

Residential College	false
Status	已發表Published
	DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation
	Gu, Zihui 1,2; Sun, Xingwu 2,3; Lian, Fengzong 2; Kang, Zhanhui 2; Xu, Cheng Zhong3 ; Fan, Ju 1
	2024-03-25
Conference Name	38th AAAI Conference on Artificial Intelligence, AAAI 2024
Source Publication	Proceedings of the AAAI Conference on Artificial Intelligence
Volume	38
Issue	16
Pages	18108-18116
Conference Date	20 February 2024through 27 February 2024
Conference Place	Vancouver
Abstract	Instruction-following is particularly crucial for large language models (LLMs) to support diverse user requests. While existing work has made progress in aligning LLMs with human preferences, evaluating their capabilities on instruction-following remains a challenge due to complexity and diversity of real-world user instructions. While existing evaluation methods focus on general skills, they suffer from two main shortcomings, i.e., lack of fine-grained task-level evaluation and reliance on singular instruction expression. To address these problems, this paper introduces DINGO, a fine-grained and diverse instruction-following evaluation dataset that has two main advantages: (1) DINGO is based on a manual-annotated, fine-grained and multi-level category tree with 130 nodes derived from real-world user requests; (2) DINGO includes diverse instructions, generated by both GPT-4 and human experts. Through extensive experiments, we demonstrate that DINGO can not only provide more challenging and comprehensive evaluation for LLMs, but also provide task-level fine-grained directions to further improve LLMs.
DOI	10.1609/aaai.v38i16.29768
URL	View the original
Language	英語English
Scopus ID	2-s2.0-85189621532
Fulltext Access	View Full-Text via DOI View Full-Text via Scopus
Citation statistics
Document Type	Conference paper
Collection	Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation	1.Renmin Univerisity of China, China 2.Tencent Inc, China 3.University of Macau, Macao
Recommended Citation GB/T 7714	Gu, Zihui,Sun, Xingwu,Lian, Fengzong,et al. DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation[C], 2024, 18108-18116.
APA	Gu, Zihui., Sun, Xingwu., Lian, Fengzong., Kang, Zhanhui., Xu, Cheng Zhong., & Fan, Ju (2024). DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), 18108-18116.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh