UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Explicit Visual Prompting for Low-Level Structure Segmentations
Weihuang Liu1; Xi Shen2; Chi-Man Pun1; Xiaodong Cun2
2023-06
Conference Name2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Source PublicationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2023-June
Pages19434-19445
Conference DateJUN 17-24, 2023
Conference PlaceVancouver, BC, Canada
CountryCanada
Publication PlaceUSA
PublisherIEEE Computer Society
Abstract

We consider the generic problem of detecting low-level structures in images, which includes segmenting the manipulated parts, identifying out-of-focus pixels, separating shadow regions, and detecting concealed objects. Whereas each such topic has been typically addressed with a domainspecific solution, we show that a unified approach performs well across all of them. We take inspiration from the widelyused pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP). Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i.e., the features from frozen patch embeddings and the input’s high-frequency components. The proposed EVP significantly outperforms other parameter-efficient tuning protocols under the same amount of tunable parameters (5.7% extra trainable parameters of each task). EVP also achieves state-of-the-art performances on diverse lowlevel structure segmentation tasks compared to task-specific solutions. Our code is available at: https://github.com/NiFangBaAGe/Explicit-Visual-Prompt.

DOI10.1109/CVPR52729.2023.01862
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:001062531303071
Scopus ID2-s2.0-85165567435
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorChi-Man Pun; Xiaodong Cun
Affiliation1.University of Macau
2.Tencent AI Lab
First Author AffilicationUniversity of Macau
Corresponding Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Weihuang Liu,Xi Shen,Chi-Man Pun,et al. Explicit Visual Prompting for Low-Level Structure Segmentations[C], USA:IEEE Computer Society, 2023, 19434-19445.
APA Weihuang Liu., Xi Shen., Chi-Man Pun., & Xiaodong Cun (2023). Explicit Visual Prompting for Low-Level Structure Segmentations. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June, 19434-19445.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Weihuang Liu]'s Articles
[Xi Shen]'s Articles
[Chi-Man Pun]'s Articles
Baidu academic
Similar articles in Baidu academic
[Weihuang Liu]'s Articles
[Xi Shen]'s Articles
[Chi-Man Pun]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Weihuang Liu]'s Articles
[Xi Shen]'s Articles
[Chi-Man Pun]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.