UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks
Lai, Zhixin1; Wu, Jing2; Chen, Suiyao3; Zhou, Yucheng4; Hovakimyan, Naira2
2024-09
Conference Name2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Source PublicationIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Pages5086-5096
Conference Date17-18 June 2024
Conference PlaceSeattle, WA, USA
CountryUSA
PublisherIEEE Computer Society
Abstract

In this study, we uncover the unexpected efficacy of residual-based large language models (LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of language or textual data. The approach diverges from established methodologies by utilizing a frozen transformer block, extracted from pre-trained LLMs, as an innovative encoder layer for the direct processing of visual tokens. This strategy represents a significant departure from the standard multi-modal vision-language frameworks, which typically hinge on language-driven prompts and inputs. We found that these LLMs could boost performance across a spectrum of biomedical imaging applications, including both 2D and 3D visual classification tasks, serving as plug-and-play boosters. More interestingly, as a byproduct, we found that the proposed framework achieved superior performance, setting new state-of-the-art results on extensive, standardized datasets in MedMNIST-2D and 3D. Through this work, we aim to open new avenues for employing LLMs in biomedical imaging and enriching the understanding of their potential in this specialized domain. The code is available at https://github.com/ZhixinLai/LLMBoostMedical

KeywordVisualization Three-dimensional Displays Large Language Models Conferences Fasteners Transformers Pattern Recognition Llm Biomedical Imaging
DOI10.1109/CVPRW63382.2024.00515
URLView the original
Language英語English
Scopus ID2-s2.0-85202595241
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
Affiliation1.Cornel University, United States
2.University of Illinois, Urbana-Champaign, United States
3.University of South Florida, United States
4.University of Macau, Macao
Recommended Citation
GB/T 7714
Lai, Zhixin,Wu, Jing,Chen, Suiyao,et al. Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks[C]:IEEE Computer Society, 2024, 5086-5096.
APA Lai, Zhixin., Wu, Jing., Chen, Suiyao., Zhou, Yucheng., & Hovakimyan, Naira (2024). Residual-based Language Models are Free Boosters for Biomedical Imaging Tasks. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 5086-5096.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Lai, Zhixin]'s Articles
[Wu, Jing]'s Articles
[Chen, Suiyao]'s Articles
Baidu academic
Similar articles in Baidu academic
[Lai, Zhixin]'s Articles
[Wu, Jing]'s Articles
[Chen, Suiyao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Lai, Zhixin]'s Articles
[Wu, Jing]'s Articles
[Chen, Suiyao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.