Residential College | false |
Status | 已發表Published |
Unsupervised Word Sense Disambiguation and Rules Extraction using non-aligned bilingual corpus | |
Oliveera F.1; Wong F.1; Li Y.1; Zheng J.2 | |
2005-12-01 | |
Conference Name | International Conference on Natural Language Processing and Knowledge Engineering |
Source Publication | Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05 |
Volume | 2005 |
Pages | 30-35 |
Conference Date | OCT 30-NOV 01, 2005 |
Conference Place | Wuhan, PEOPLES R CHINA |
Abstract | This paper presents a statistical Word Sense Disambiguation with application in Portuguese-Chinese Machine Translation systems. Due to the limited availability of Portuguese-Chinese resources in the form of digital corpora and annotated Treebank, an unsupervised learning and a non-aligned bilingual corpus are applied. The proposed method first identifies words related to each of the ambiguous words based on their surrounding words and relative distance. A mathematical model is then applied in the identification of the most suitable sense of an ambiguous word in terms of the related words. All the senses discovered are converted into a set of rules and stored in the Sense Knowledge base for later use in disambiguation and translation process. Preliminary experiment results show an improvement of 6% in assigning correctly the corresponding translation over the baseline method. © 2005 IEEE. |
Keyword | Machine Translation Natural Language Processing Word Sense Disambiguation |
DOI | 10.1109/NLPKE.2005.1598702 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems |
WOS ID | WOS:000235577200007 |
Scopus ID | 2-s2.0-33847245900 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | 1.Universidade de Macau 2.Tsinghua University |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Oliveera F.,Wong F.,Li Y.,et al. Unsupervised Word Sense Disambiguation and Rules Extraction using non-aligned bilingual corpus[C], 2005, 30-35. |
APA | Oliveera F.., Wong F.., Li Y.., & Zheng J. (2005). Unsupervised Word Sense Disambiguation and Rules Extraction using non-aligned bilingual corpus. Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE'05, 2005, 30-35. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment