UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge
Xuebo Liu2; Yutong Wang2; Derek F. Wong1; Runzhe Zhan1; Liangxuan Yu1; Min Zhang2
2023-07
Conference NameThe 61st Annual Meeting of the Association for Computational Linguistics
Source PublicationProceedings of the 61st Annual Meeting of the Association for Computational Linguistics
Volume1: Long Papers
Pages15536-15550
Conference DateJULY 09-14, 2023
Conference PlaceToronto
CountryCandada
Author of SourceAnna Rogers ; Jordan Boyd-Graber ; Naoaki Okazaki
PublisherAssociation for Computational Linguistics (ACL)
Abstract

The ability of commonsense reasoning (CR) decides whether a neural machine translation (NMT) model can move beyond pattern recognition. Despite the rapid advancement of NMT and the use of pretraining to enhance NMT models, research on CR in NMT is still in its infancy, leaving much to be explored in terms of effectively training NMT models with high CR abilities and devising accurate automatic evaluation metrics. This paper presents a comprehensive study aimed at expanding the understanding of CR in NMT.For the training, we confirm the effectiveness of incorporating pretrained knowledge into NMT models and subsequently utilizing these models as robust testbeds for investigating CR in NMT. For the evaluation, we propose a novel entity-aware evaluation method that takes into account both the NMT candidate and important entities in the candidate, which is more aligned with human judgement. Based on the strong testbed and evaluation methods, we identify challenges in training NMT models with high CR abilities and suggest directions for further unlabeled data utilization and model design. We hope that our methods and findings will contribute to advancing the research of CR in NMT. Source data, code and scripts are freely available at https://github.com/YutongWang1216/CR-NMT.

DOI10.18653/v1/2023.acl-long.866
URLView the original
Indexed ByCPCI-S
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods
WOS IDWOS:001190962507019
Scopus ID2-s2.0-85174407342
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorXuebo Liu; Derek F. Wong
Affiliation1.Department of Computer and Information Science, University of Macau
2.Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Xuebo Liu,Yutong Wang,Derek F. Wong,et al. Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge[C]. Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki:Association for Computational Linguistics (ACL), 2023, 15536-15550.
APA Xuebo Liu., Yutong Wang., Derek F. Wong., Runzhe Zhan., Liangxuan Yu., & Min Zhang (2023). Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, 1: Long Papers, 15536-15550.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Xuebo Liu]'s Articles
[Yutong Wang]'s Articles
[Derek F. Wong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Xuebo Liu]'s Articles
[Yutong Wang]'s Articles
[Derek F. Wong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xuebo Liu]'s Articles
[Yutong Wang]'s Articles
[Derek F. Wong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.