UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation
Gu, Zihui1,2; Sun, Xingwu2,3; Lian, Fengzong2; Kang, Zhanhui2; Xu, Cheng Zhong3; Fan, Ju1
2024-03-25
Conference Name38th AAAI Conference on Artificial Intelligence, AAAI 2024
Source PublicationProceedings of the AAAI Conference on Artificial Intelligence
Volume38
Issue16
Pages18108-18116
Conference Date20 February 2024through 27 February 2024
Conference PlaceVancouver
Abstract

Instruction-following is particularly crucial for large language models (LLMs) to support diverse user requests. While existing work has made progress in aligning LLMs with human preferences, evaluating their capabilities on instruction-following remains a challenge due to complexity and diversity of real-world user instructions. While existing evaluation methods focus on general skills, they suffer from two main shortcomings, i.e., lack of fine-grained task-level evaluation and reliance on singular instruction expression. To address these problems, this paper introduces DINGO, a fine-grained and diverse instruction-following evaluation dataset that has two main advantages: (1) DINGO is based on a manual-annotated, fine-grained and multi-level category tree with 130 nodes derived from real-world user requests; (2) DINGO includes diverse instructions, generated by both GPT-4 and human experts. Through extensive experiments, we demonstrate that DINGO can not only provide more challenging and comprehensive evaluation for LLMs, but also provide task-level fine-grained directions to further improve LLMs.

DOI10.1609/aaai.v38i16.29768
URLView the original
Language英語English
Scopus ID2-s2.0-85189621532
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Affiliation1.Renmin Univerisity of China, China
2.Tencent Inc, China
3.University of Macau, Macao
Recommended Citation
GB/T 7714
Gu, Zihui,Sun, Xingwu,Lian, Fengzong,et al. DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation[C], 2024, 18108-18116.
APA Gu, Zihui., Sun, Xingwu., Lian, Fengzong., Kang, Zhanhui., Xu, Cheng Zhong., & Fan, Ju (2024). DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), 18108-18116.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Gu, Zihui]'s Articles
[Sun, Xingwu]'s Articles
[Lian, Fengzong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Gu, Zihui]'s Articles
[Sun, Xingwu]'s Articles
[Lian, Fengzong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Gu, Zihui]'s Articles
[Sun, Xingwu]'s Articles
[Lian, Fengzong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.