UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Active Perception for Visual-Language Navigation
Wang, Hanqing1; Wang, Wenguan2; Liang, Wei1; Hoi, Steven C.H.3; Shen, Jianbing4; Gool, Luc Van5
2022-12-03
Source PublicationInternational Journal of Computer Vision
ISSN0920-5691
Volume131Issue:3Pages:607-625
Abstract

Visual-language navigation (VLN) is the task of entailing an agent to carry out navigational instructions inside photo-realistic environments. One of the key challenges in VLN is how to conduct robust navigation by mitigating the uncertainty caused by ambiguous instructions and insufficient observation of the environment. Agents trained by current approaches typically suffer from this and would consequently struggle to take navigation actions at every step. In contrast, when humans face such a challenge, we can still maintain robust navigation by actively exploring the surroundings to gather more information and thus make a more confident navigation decision. This work draws inspiration from human navigation behavior and endows an agent with an active perception ability for more intelligent navigation. To achieve this, we propose an end-to-end framework for learning an exploration policy that decides (i) when and where to explore, (ii) what information is worth gathering during exploration, and (iii) how to adjust the navigation decision after the exploration. In this way, the agent is able to turn its past experiences as well as new explored knowledge to contexts for more confident navigation decision making. In addition, an external memory is used to explicitly store the visited visual environments and thus allows the agent to adopt a late action-taking strategy to avoid duplicate exploration and navigation movements. Our experimental results on two standard benchmark datasets show promising exploration strategies emerged from training, which leads to significant boost in navigation performance.

KeywordActive Perception Curriculum Reinforcement Learning Visual-language Navigation
DOI10.1007/s11263-022-01721-6
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:000912641700001
PublisherSPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Scopus ID2-s2.0-85143213809
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionFaculty of Science and Technology
THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorShen, Jianbing
Affiliation1.Beijing Institute of Technology, Beijing, China
2.ReLER Lab, Australian Artificial Intelligence Institute, University of Technology Sydney, Sydney, Australia
3.Salesforce Research Asia, Singapore, Singapore
4.The State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science, University of Macau, Macao
5.ETH Zurich, Zurich, Switzerland
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Wang, Hanqing,Wang, Wenguan,Liang, Wei,et al. Active Perception for Visual-Language Navigation[J]. International Journal of Computer Vision, 2022, 131(3), 607-625.
APA Wang, Hanqing., Wang, Wenguan., Liang, Wei., Hoi, Steven C.H.., Shen, Jianbing., & Gool, Luc Van (2022). Active Perception for Visual-Language Navigation. International Journal of Computer Vision, 131(3), 607-625.
MLA Wang, Hanqing,et al."Active Perception for Visual-Language Navigation".International Journal of Computer Vision 131.3(2022):607-625.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang, Hanqing]'s Articles
[Wang, Wenguan]'s Articles
[Liang, Wei]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang, Hanqing]'s Articles
[Wang, Wenguan]'s Articles
[Liang, Wei]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang, Hanqing]'s Articles
[Wang, Wenguan]'s Articles
[Liang, Wei]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.