Residential College | false |
Status | 已發表Published |
Canonical Correlation Analysis Regularization: An Effective Deep Multi-View Learning Baseline for RGB-D Object Recognition | |
Lulu Tang1; Zhi-Xin Yang1; Kui Jia2 | |
2019-03 | |
Source Publication | IEEE Transactions on Cognitive and Developmental Systems |
ISSN | 2379-8920 |
Volume | 11Issue:1Pages:107 - 118 |
Abstract | Object recognition methods based on multimodal data, color plus depth (RGB-D), usually treat each modality separately in feature extraction, which neglects implicit relations between two views and preserves noise from any view to the final representation. To address these limitations, we propose a novel canonical correlation analysis (CCA)-based multiview convolutional neural network (CNNs) framework for RGB-D object representation. The RGB and depth streams process corresponding images, respectively, then are connected by CCA module leading to a common-correlated feature space. In addition, to embed CCA into deep CNNs in a supervised manner, two different schemes are explored. One considers CCA as a regularization (CCAR) term adding to the loss function. However, solving CCA optimization directly is neither computationally efficient nor compatible with the mini-batch-based stochastic optimization. Thus, we further propose an approximation method of CCAR, using the obtained CCA projection matrices to replace the weights of feature concatenation layer at regular intervals. Such a scheme enjoys benefits of full CCAR and is efficient by amortizing its cost over many training iterations. Experiments on benchmark RGB-D object recognition datasets have shown that the proposed methods outperform most existing methods using the very same of their network architectures. |
DOI | 10.1109/TCDS.2018.2866587 |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Robotics ; Neurosciences & Neurology |
WOS Subject | Computer Science, Artificial Intelligence ; Robotics ; Neurosciences |
WOS ID | WOS:000461254800010 |
Scopus ID | 2-s2.0-85052819248 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF ELECTROMECHANICAL ENGINEERING Faculty of Science and Technology |
Corresponding Author | Zhi-Xin Yang; Kui Jia |
Affiliation | 1.Department of Electromechanical Engineering, Faculty of Science and Technology, University of Macau, Macau, China 2.School of Electronic and Information Engineering, South China University of Technology, Guangzhou, China |
First Author Affilication | Faculty of Science and Technology |
Corresponding Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Lulu Tang,Zhi-Xin Yang,Kui Jia. Canonical Correlation Analysis Regularization: An Effective Deep Multi-View Learning Baseline for RGB-D Object Recognition[J]. IEEE Transactions on Cognitive and Developmental Systems, 2019, 11(1), 107 - 118. |
APA | Lulu Tang., Zhi-Xin Yang., & Kui Jia (2019). Canonical Correlation Analysis Regularization: An Effective Deep Multi-View Learning Baseline for RGB-D Object Recognition. IEEE Transactions on Cognitive and Developmental Systems, 11(1), 107 - 118. |
MLA | Lulu Tang,et al."Canonical Correlation Analysis Regularization: An Effective Deep Multi-View Learning Baseline for RGB-D Object Recognition".IEEE Transactions on Cognitive and Developmental Systems 11.1(2019):107 - 118. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment