Residential College | false |
Status | 已發表Published |
A Benchmark Dataset and Evaluation Methodology for Chinese Zero Pronoun Translation | |
Xu,Mingzhou1; Wang,Longyue2; Liu,Siyou3; Wong,Derek F.1; Shi,Shuming2; Tu,Zhaopeng2 | |
2023-06-10 | |
Source Publication | Language Resources and Evaluation |
ISSN | 1574-020X |
Volume | 57Pages:1263–1293 |
Abstract | The phenomenon of zero pronoun (ZP) has attracted increasing interest in the machine translation community due to its importance and difficulty. However, previous studies generally evaluate the quality of translating ZPs with BLEU score on MT testsets, which is not expressive or sensitive enough for accurate assessment. To bridge the data and evaluation gaps, we propose a benchmark testset and evaluation metric for target evaluation on Chinese ZP translation. The human-annotated testset covers five challenging genres, which reveal different characteristics of ZPs for comprehensive evaluation. We systematically revisit advanced models on ZP translation and identify current challenges for future exploration. We release data, code, and trained models, which we hope can significantly promote research in this field. |
Keyword | Benchmark Dataset Discourse Evaluation Metric Machine Translation Zero Pronoun |
DOI | 10.1007/s10579-023-09660-5 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Interdisciplinary Applications |
WOS ID | WOS:001004065600002 |
Publisher | SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS |
Scopus ID | 2-s2.0-85161604473 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Wang,Longyue; Wong,Derek F. |
Affiliation | 1.Department of Computer and Information Science,University of Macau,Macao 2.AI Lab,Tencent,Shenzhen,China 3.School of Languages and Translation,Macao Polytechnic Institute,Macao |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Xu,Mingzhou,Wang,Longyue,Liu,Siyou,et al. A Benchmark Dataset and Evaluation Methodology for Chinese Zero Pronoun Translation[J]. Language Resources and Evaluation, 2023, 57, 1263–1293. |
APA | Xu,Mingzhou., Wang,Longyue., Liu,Siyou., Wong,Derek F.., Shi,Shuming., & Tu,Zhaopeng (2023). A Benchmark Dataset and Evaluation Methodology for Chinese Zero Pronoun Translation. Language Resources and Evaluation, 57, 1263–1293. |
MLA | Xu,Mingzhou,et al."A Benchmark Dataset and Evaluation Methodology for Chinese Zero Pronoun Translation".Language Resources and Evaluation 57(2023):1263–1293. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment