Residential College | false |
Status | 已發表Published |
Benchmarking and Improving Long-Text Translation with Large Language Models | |
Longyue Wang1; Zefeng Du2; Wenxiang Jiao1; Chenyang Lyu1; Jianhui Pang2; Leyang Cui1; Kaiqiang Song1; Derek F. Wong2; Shuming Shi1; Zhaopeng Tu2 | |
2024 | |
Conference Name | The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) |
Source Publication | Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) |
Pages | 7175-7187 |
Conference Date | 11-16 August 2024 |
Conference Place | Bangkok |
Country | Thailand |
Publisher | Association for Computing Machinery |
Abstract | Recent studies have illuminated the promising capabilities of large language models (LLMs) in handling long texts. However, their performance in machine translation (MT) of long documents remains underexplored. This paper aims to shed light on how LLMs navigate this complex task, offering a comprehensive evaluation of their capabilities and limitations in long-text MT. First, we collect and construct an instruction-based benchmark dataset, specifically designed for the finetuning and evaluation of LLMs, encompassing multilingual, multi-domain, and document-level parallel data. Second, we conduct a comprehensive comparison between MT and LLM models concerning document-level translation. Our analysis uncovers that LLMs exhibit shortcomings in long-text domains, and their performance diminishes as document size escalates. By exploiting various extrapolation strategies, we enhance the capacity of LLMs to translate longer texts. We release data, code, and models at https://github.com/ longyuewangdcu/Document-MT-LLM. |
DOI | 10.18653/v1/2024.findings-acl.428 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85198433516 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Longyue Wang |
Affiliation | 1.Tencent AI Lab 2.University of Macau |
Recommended Citation GB/T 7714 | Longyue Wang,Zefeng Du,Wenxiang Jiao,et al. Benchmarking and Improving Long-Text Translation with Large Language Models[C]:Association for Computing Machinery, 2024, 7175-7187. |
APA | Longyue Wang., Zefeng Du., Wenxiang Jiao., Chenyang Lyu., Jianhui Pang., Leyang Cui., Kaiqiang Song., Derek F. Wong., Shuming Shi., & Zhaopeng Tu (2024). Benchmarking and Improving Long-Text Translation with Large Language Models. Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), 7175-7187. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment