UM

Browse/Search Results:  1-10 of 26 Help

Selected(0)Clear Items/Page:    Sort:
Multi-Modal Inductive Framework for Text-Video Retrieval Conference paper
Li, Qian, Zhou, Yucheng, Ji, Cheng, Lu, Feihong, Gong, Jianian, Wang, Shangguang, Li, Jianxin. Multi-Modal Inductive Framework for Text-Video Retrieval[C], New York, NY, USA:Association for Computing Machinery, Inc, 2024, 2389-2398.
Authors:  Li, Qian;  Zhou, Yucheng;  Ji, Cheng;  Lu, Feihong;  Gong, Jianian; et al.
Favorite | TC[Scopus]:0 | Submit date:2024/12/05
Fine-tuning Llm  Multi-modal Inductive  Text-video Retrieval  
SparseInteraction: Sparse Semantic Guidance for Radar and Camera 3D Object Detection Conference paper
Xu, Shaoqing, Jiang, Shengyin, Li, Fang, Liu, Li, Song, Ziying, Yang, Bo, Yang, Zhi Xin. SparseInteraction: Sparse Semantic Guidance for Radar and Camera 3D Object Detection[C]:Association for Computing Machinery, Inc, 2024, 9224-9233.
Authors:  Xu, Shaoqing;  Jiang, Shengyin;  Li, Fang;  Liu, Li;  Song, Ziying; et al.
Favorite | TC[Scopus]:2 | Submit date:2024/12/05
3d Object Detection  Autonomous Driving  Multi-modal  
Driving Fatigue Detection Based on Hybrid Electroencephalography and Eye Tracking Journal article
Lian, Zequan, Xu, Tao, Yuan, Zhen, Li, Junhua, Thakor, Nitish, Wang, Hongtao. Driving Fatigue Detection Based on Hybrid Electroencephalography and Eye Tracking[J]. IEEE Journal of Biomedical and Health Informatics, 2024, 28(11), 6568-6580.
Authors:  Lian, Zequan;  Xu, Tao;  Yuan, Zhen;  Li, Junhua;  Thakor, Nitish; et al.
Favorite | TC[WOS]:3 TC[Scopus]:4  IF:6.7/7.1 | Submit date:2024/09/03
Cross-modal Alignment  Electroencephalograph  Eye Tracking  Fatigue Detection  Multi-modality  
Reason-and-Execute Prompting: Enhancing Multi-Modal Large Language Models for Solving Geometry Questions Conference paper
Duan, Xiuliang, Tan, Dating, Fang, Liangda, Zhou, Yuyu, He, Chaobo, Chen, Ziliang, Wu, Lusheng, Chen, Guanliang, Gong, Zhiguo, Luo, Weiqi, Guan, Quanlong. Reason-and-Execute Prompting: Enhancing Multi-Modal Large Language Models for Solving Geometry Questions[C], New York, NY, USA:Association for Computing Machinery, Inc, 2024, 6959-6968.
Authors:  Duan, Xiuliang;  Tan, Dating;  Fang, Liangda;  Zhou, Yuyu;  He, Chaobo; et al.
Favorite | TC[Scopus]:1 | Submit date:2024/12/05
Geometry Questions  Multi-modal Large Language Models  Prompting Method  
Collaborative group: Composed image retrieval via consensus learning from noisy annotations Journal article
Zhang, Xu, Zheng, Zhedong, Zhu, Linchao, Yang, Yi. Collaborative group: Composed image retrieval via consensus learning from noisy annotations[J]. Knowledge-Based Systems, 2024, 300, 112135.
Authors:  Zhang, Xu;  Zheng, Zhedong;  Zhu, Linchao;  Yang, Yi
Favorite | TC[WOS]:0 TC[Scopus]:0  IF:7.2/7.4 | Submit date:2024/07/04
Compositional Image Retrieval  Data Ambiguity  Image Retrieval With Text Feedback  Multi-modal Retrieval  Noisy Annotation  
Voluntary forward-looking disclosures and default risk pricing Journal article
Chen, C., Wei, M.H., Zhang, H., Yan, J.J.. Voluntary forward-looking disclosures and default risk pricing[J]. Accounting and Business Research, 2024.
Authors:  Chen, C.;  Wei, M.H.;  Zhang, H.;  Yan, J.J.
Favorite | TC[WOS]:0 TC[Scopus]:0  IF:2.0/2.8 | Submit date:2023/12/17
Management Forecast Reports  Multi-modal Information  Default Risk Pricing  Credit Default Swap  
Multimodal Fusion with Optimized Embedding Strength for Consumptive Medical Image Protection Journal article
Wang, Xingrun, Tian, Jinyu, Li, Jianqing, Wang, Binze, Tang, Yuanyan. Multimodal Fusion with Optimized Embedding Strength for Consumptive Medical Image Protection[J]. IEEE Transactions on Consumer Electronics, 2024, 70(3), 6000-6010.
Authors:  Wang, Xingrun;  Tian, Jinyu;  Li, Jianqing;  Wang, Binze;  Tang, Yuanyan
Favorite | TC[WOS]:0 TC[Scopus]:1  IF:4.3/3.9 | Submit date:2024/12/26
Derivable Function  Fusion Strength  Multi-modal Fusion  Multiple Watermarks  Optimization Method  
COMMA: Co-articulated Multi-Modal Learning Conference paper
Hu, Lianyu, Gao, Liqing, Liu, Zekang, Pun, Chi Man, Feng, Wei. COMMA: Co-articulated Multi-Modal Learning[C]:Association for the Advancement of Artificial Intelligence, 2024, 2238-2246.
Authors:  Hu, Lianyu;  Gao, Liqing;  Liu, Zekang;  Pun, Chi Man;  Feng, Wei
Favorite | TC[Scopus]:0 | Submit date:2024/05/16
Cv: Language And Vision  Cv: Large Vision Models  Cv: Multi-modal Vision  Cv: Video Understanding & Activity Analysis  
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models Conference paper
Mou, Chong, Wang, Xintao, Xie, Liangbin, Wu, Yanze, Zhang, Jian, Qi, Zhongang, Shan, Ying. T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models[C]:Association for the Advancement of Artificial Intelligence, 2024, 4296-4304.
Authors:  Mou, Chong;  Wang, Xintao;  Xie, Liangbin;  Wu, Yanze;  Zhang, Jian; et al.
Favorite | TC[WOS]:30 TC[Scopus]:52 | Submit date:2024/05/16
Cv: Computational Photography  Image & Video Synthesis  Cv: Multi-modal Vision  
Quaternion Cross-Modality Spatial Learning for Multi-Modal Medical Image Segmentation Journal article
Chen, Junyang, Huang, Guoheng, Yuan, Xiaochen, Zhong, Guo, Zheng, Zewen, Pun, Chi Man, Zhu, Jian, Huang, Zhixin. Quaternion Cross-Modality Spatial Learning for Multi-Modal Medical Image Segmentation[J]. IEEE Journal of Biomedical and Health Informatics, 2024, 28(3), 1412-1423.
Authors:  Chen, Junyang;  Huang, Guoheng;  Yuan, Xiaochen;  Zhong, Guo;  Zheng, Zewen; et al.
Favorite | TC[WOS]:2 TC[Scopus]:2  IF:6.7/7.1 | Submit date:2024/04/02
Quaternions  Convolution  Three-dimensional Displays  Biomedical Imaging  Image Segmentation  Feature Extraction  Lesions  Multi-modal Medical Image  Quaternion  Spatial Dependency  Cross-modality