Residential College | false |
Status | 已發表Published |
Deformable Template Network (DTN) for Object Detection | |
Wu, Shuai1; Xu, Yong1; Zhang, Bob2; Yang, Jian3; Zhang, David4 | |
2021-04-26 | |
Source Publication | IEEE Transactions on Multimedia |
ISSN | 1520-9210 |
Volume | 24Pages:2058-2068 |
Abstract | Objects often have different appearances because of viewpoint changes or part deformation. How to reasonably model these variations is still a big challenge for object detection. In this paper, we propose a novel Deformable Template Network (DTN), which exploits the pictorial structure to model possible variations of an object. DTN represents an object by virtue of a generated template in a deformable way. It has two key modules: The template generating module and the part matching module. The template generating module produces a template for a given object which defines the anchor positions of the $k{\times }k$ parts. Based on such a template, the part matching module aims to perform part alignment around the anchor positions. In terms of each part, the matching process makes a trade-off between maximizing the detection score and minimizing the deformation cost relative to the anchor position. Moreover, DTN is a fully convolutional network which means it is competitive in terms of detection efficiency. We evaluate DTN on both the PASCAL VOC and MSCOCO datasets, achieving the state-of-The-Art results, an accuracy of 82.7% for PASCAL VOC and of 44.9% for MSCOCO. |
Keyword | Deformable Template Deformation Cost Object Detection Part Matching |
DOI | 10.1109/TMM.2021.3075323 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Computer Science ; Telecommunications |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering ; Telecommunications |
WOS ID | WOS:000778959200022 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Scopus ID | 2-s2.0-85105042832 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology |
Corresponding Author | Xu, Yong |
Affiliation | 1.Bio-Computing Research Center, Harbin Institute of Technology, Shen zhen, 518055, China 2.Pami Research Group, University of Macau, Taipa, 999078, Macao 3.College of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing, 210094, China 4.School of Science and Engineering, Chinese University of Hong Kong at Shen Zhen, Shenzhen, 518172, China |
Recommended Citation GB/T 7714 | Wu, Shuai,Xu, Yong,Zhang, Bob,et al. Deformable Template Network (DTN) for Object Detection[J]. IEEE Transactions on Multimedia, 2021, 24, 2058-2068. |
APA | Wu, Shuai., Xu, Yong., Zhang, Bob., Yang, Jian., & Zhang, David (2021). Deformable Template Network (DTN) for Object Detection. IEEE Transactions on Multimedia, 24, 2058-2068. |
MLA | Wu, Shuai,et al."Deformable Template Network (DTN) for Object Detection".IEEE Transactions on Multimedia 24(2021):2058-2068. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment