Residential College | false |
Status | 已發表Published |
Neuron interaction based representation composition for neural machine translation | |
Li, Jian1,2; Wang, Xing3; Yang, Baosong4; Shi, Shuming3; Lyu, Michael R.1,2; Tu, Zhaopeng3 | |
2020 | |
Conference Name | 34th AAAI Conference on Artificial Intelligence, AAAI 2020 |
Source Publication | AAAI 2020 - 34th AAAI Conference on Artificial Intelligence |
Pages | 8204-8211 |
Conference Date | 7 February 2020 - 12 February 2020 |
Conference Place | New York |
Abstract | Recent NLP studies reveal that substantial linguistic information can be attributed to single neurons, i.e., individual dimensions of the representation vectors. We hypothesize that modeling strong interactions among neurons helps to better capture complex information by composing the linguistic properties embedded in individual neurons. Starting from this intuition, we propose a novel approach to compose representations learned by different components in neural machine translation (e.g., multi-layer networks or multi-head attention), based on modeling strong interactions among neurons in the representation vectors. Specifically, we leverage bilinear pooling to model pairwise multiplicative interactions among individual neurons, and a low-rank approximation to make the model computationally feasible. We further propose extended bilinear pooling to incorporate first-order representations. Experiments on WMT14 English-German and English-French translation tasks show that our model consistently improves performances over the SOTA TRANSFORMER baseline. Further analyses demonstrate that our approach indeed captures more syntactic and semantic information as expected. |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science ; Education & Educational Research |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications ; Education, Scientific Disciplines |
WOS ID | WOS:000668126800071 |
Scopus ID | 2-s2.0-85106435054 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | University of Macau |
Corresponding Author | Tu, Zhaopeng |
Affiliation | 1.Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong 2.Shenzhen Research Institute, The Chinese University of Hong Kong, Hong Kong 3.Tencent AI Lab., 4.University of Macau, Macao |
Recommended Citation GB/T 7714 | Li, Jian,Wang, Xing,Yang, Baosong,et al. Neuron interaction based representation composition for neural machine translation[C], 2020, 8204-8211. |
APA | Li, Jian., Wang, Xing., Yang, Baosong., Shi, Shuming., Lyu, Michael R.., & Tu, Zhaopeng (2020). Neuron interaction based representation composition for neural machine translation. AAAI 2020 - 34th AAAI Conference on Artificial Intelligence, 8204-8211. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment