UM

Browse/Search Results:  1-10 of 203 Help

Selected(0)Clear Items/Page:    Sort:
Instance Tracking in 3D Scenes from Egocentric Videos Conference paper
Zhao, Yunhan, Haoyu Ma, Shu Kong, Charless Fowlkes. Instance Tracking in 3D Scenes from Egocentric Videos[C]:IEEE Computer Society, 2024, 21933-21944.
Authors:  Zhao, Yunhan;  Haoyu Ma;  Shu Kong;  Charless Fowlkes
Adobe PDF | Favorite | TC[Scopus]:0 | Submit date:2024/06/20
Three-dimensional Displays  Protocols  Annotations  Benchmark Testing  Cameras  Sensors  Pattern Recognition  Egocentric Videos  Instance TrackIng In 3d  
DeIL: Direct-and-Inverse CLIP for Open-World Few-Shot Learning Conference paper
Shao, Shuai, Bai, Yu, Wang, Yan, Liu, Baodi, Zhou, Yicong. DeIL: Direct-and-Inverse CLIP for Open-World Few-Shot Learning[C]:IEEE Computer Society, 2024, 28505-28514.
Authors:  Shao, Shuai;  Bai, Yu;  Wang, Yan;  Liu, Baodi;  Zhou, Yicong
Favorite | TC[Scopus]:1 | Submit date:2024/11/05
Computer Vision  Filtering  Noise Reduction  Transforms  Pattern Recognition  Few Shot Learning  Clip  Open-world Few-shot Learning  
Boosting Image Restoration via Priors from Pre-Trained Models Conference paper
Xu, Xiaogang, Kong, Shu, Hu, Tao, Liu, Zhe, Bao, Hujun. Boosting Image Restoration via Priors from Pre-Trained Models[C]:IEEE Computer Society, 2024, 2900-2909.
Authors:  Xu, Xiaogang;  Kong, Shu;  Hu, Tao;  Liu, Zhe;  Bao, Hujun
Favorite | TC[Scopus]:2 | Submit date:2024/11/05
Computer Vision  Shape  Computational Modeling  Noise Reduction  Training Data  Boosting  Data Models  Pre-trained Models  Image Restoration  Spatial-varying Enhancement  Channel-spatial Attention  
ParameterNet: Parameters are All You Need for Large-Scale Visual Pretraining of Mobile Networks Conference paper
Han, Kai, Wang, Yunhe, Guo, Jianyuan, Wu, Enhua. ParameterNet: Parameters are All You Need for Large-Scale Visual Pretraining of Mobile Networks[C]:IEEE Computer Society, 2024, 15751-15761.
Authors:  Han, Kai;  Wang, Yunhe;  Guo, Jianyuan;  Wu, Enhua
Favorite | TC[Scopus]:6 | Submit date:2024/11/05
Convolutional Codes  Visualization  Computer Vision  Accuracy  Transformers  Pattern Recognition  
SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models Conference paper
Huang, Yuzhou, Xie, Liangbin, Wang, Xintao, Yuan, Ziyang, Cun, Xiaodong, Ge, Yixiao, Zhou, Jiantao, Dong, Chao, Huang, Rui, Zhang, Ruimao, Shan, Ying. SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models[C]:IEEE Computer Society, 2024, 8362-8371.
Authors:  Huang, Yuzhou;  Xie, Liangbin;  Wang, Xintao;  Yuan, Ziyang;  Cun, Xiaodong; et al.
Favorite | TC[Scopus]:0 | Submit date:2024/11/05
Training  Visualization  Computer Vision  Large Language Models  Diffusion Models  Cognition  Pattern Recognition  Instruction-based Image Editing  Multimodal Large Language Models  
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection Conference paper
Yin, Junbo, Shen, Jianbing, Chen, Runnan, Li, Wei, Yang, Ruigang, Frossard, Pascal, Wang, Wenguan. IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection[C]:IEEE Computer Society, 2024, 14905-14915.
Authors:  Yin, Junbo;  Shen, Jianbing;  Chen, Runnan;  Li, Wei;  Yang, Ruigang; et al.
Favorite | TC[Scopus]:3 | Submit date:2024/11/05
Point Cloud Compression  Computer Vision  Three-dimensional Displays  Collaboration  Object Detection  Benchmark Testing  Transformers  Multimodal  Object Detection  Point Cloud  Autonomous Driving  
Hierarchical Local Temporal Feature Enhancing for Transformer-Based 3D Human Pose Estimation Conference paper
X. Yan, PUN CHI MAN, H. Li, M. Liu, H. Gao. Hierarchical Local Temporal Feature Enhancing for Transformer-Based 3D Human Pose Estimation[C]:IEEE Computer Society, 2024, 203042.
Authors:  X. Yan;  PUN CHI MAN;  H. Li, M. Liu, H. Gao
Favorite | TC[Scopus]:0 | Submit date:2024/08/26
Deep Unfolding  Hyperspectral Snapshot Compressive Imaging  Non-local Mechanism  Transformer  
Depth-aware Test-Time Training for Zero-shot Video Object Segmentation Conference paper
W. Liu, X. Shen, H. Li, X. Bi, B. Liu, C.-M. Pun, X. Cun. Depth-aware Test-Time Training for Zero-shot Video Object Segmentation[C], 2024.
Authors:  W. Liu, X. Shen, H. Li, X. Bi, B. Liu;  C.-M. Pun;  X. Cun
Favorite |  | Submit date:2024/08/26
Gas Sensors Based on 1 & 2-Dof Piezoelectric Baw MEMS Resonators with Coated ZIF-8 Conference paper
Wang, Linlin, Wang, Chen, Tietze, Max, Verstreken, Margot, Madeira, Bernardo P., Wang, Yuan, Chanut, Nicolas, Wang, Chenxi, Quan, Aojie, Ameloot, Rob, Kraft, Michael. Gas Sensors Based on 1 & 2-Dof Piezoelectric Baw MEMS Resonators with Coated ZIF-8[C]:IEEE345 E 47TH ST, NEW YORK, NY 10017 USA, 2024, 871-874.
Authors:  Wang, Linlin;  Wang, Chen;  Tietze, Max;  Verstreken, Margot;  Madeira, Bernardo P.; et al.
Favorite | TC[WOS]:0 TC[Scopus]:0 | Submit date:2024/05/16
1 And 2-dof Bulk Acoustic Wave (Baw) Resonators  Ethanol Vapor Detection  Gas Sensor  Zeolitic Imidazolate Framework-8 (Zif-8)  
The Neglected Tails in Vision-Language Models Conference paper
Parashar, Shubham, Lin, Zhiqiu, Liu, Tian, Dong, Xiangjue, Li, Yanan, Ramanan, Deva, Caverlee, James, Kong, Shu. The Neglected Tails in Vision-Language Models[C]:IEEE Computer Society, 2024, 12988-12997.
Authors:  Parashar, Shubham;  Lin, Zhiqiu;  Liu, Tian;  Dong, Xiangjue;  Li, Yanan; et al.
Favorite | TC[Scopus]:1 | Submit date:2024/11/05
Long Tailed Recognition  Vision-language Models  Zero-shot Recognition