UM

Browse/Search Results:  1-6 of 6 Help

Selected(0)Clear Items/Page:    Sort:
Query-Efficient Adversarial Attack With Low Perturbation Against End-to-End Speech Recognition Systems Journal article
Wang, Shen, Zhang, Zhaoyang, Zhu, Guopu, Zhang, Xinpeng, Zhou, Yicong, Huang, Jiwu. Query-Efficient Adversarial Attack With Low Perturbation Against End-to-End Speech Recognition Systems[J]. IEEE Transactions on Information Forensics and Security, 2023, 18, 351 - 364.
Authors:  Wang, Shen;  Zhang, Zhaoyang;  Zhu, Guopu;  Zhang, Xinpeng;  Zhou, Yicong; et al.
Favorite | TC[WOS]:15 TC[Scopus]:18  IF:6.3/7.3 | Submit date:2023/01/30
Adversarial Example  Automatic Speech Recognition  Black-box Attack  Monte Carlo Tree Search  
Temporal Relation Inference Network for Multi-modal Speech Emotion Recognition Journal article
Dong, Guan Nan, Pun, Chi Man, Zhang, Zheng. Temporal Relation Inference Network for Multi-modal Speech Emotion Recognition[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(9), 6472-6485.
Authors:  Dong, Guan Nan;  Pun, Chi Man;  Zhang, Zheng
Favorite | TC[WOS]:12 TC[Scopus]:18  IF:8.3/7.1 | Submit date:2022/05/17
Cognition  Correlation  Emotion Recognition  Feature Extraction  Hidden Markov Models  Multi-modal Learning  Relation Inference Network  Speech Emotion Recognition  Speech Recognition  Task Analysis  Temporal Learning  
Applying speech-to-text recognition and computer-aided translation for supporting multi-lingual communications in cross-cultural learning project Conference paper
Rustam Shadiev, Barry Lee Reynolds, Yueh-Min Huang, Narzikul Shadiev, Wei Wang, Rai Laxmisha, Wanwisa Wannapipat. Applying speech-to-text recognition and computer-aided translation for supporting multi-lingual communications in cross-cultural learning project[C], 345 E 47TH ST, NEW YORK, NY 10017 USA:IEEE, 2017, 182-183.
Authors:  Rustam Shadiev;  Barry Lee Reynolds;  Yueh-Min Huang;  Narzikul Shadiev;  Wei Wang; et al.
Favorite | TC[WOS]:0 TC[Scopus]:3 | Submit date:2018/10/30
Speech-to-text Recognition  Computer-aided Translation  Cross-cultural Learning  Accuracy Rate  
Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array Journal article
Li Weifeng, Wang L., Zhou Yicong, Dines J., Magimai-Doss M., Bourlard H., Liao Q.. Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array[J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2014, 22(12), 2244-2255.
Authors:  Li Weifeng;  Wang L.;  Zhou Yicong;  Dines J.;  Magimai-Doss M.; et al.
Favorite | TC[WOS]:9 TC[Scopus]:10 | Submit date:2018/12/21
Beamforming  Microphone Array  Neural Network  Speech Recognition  Speech Separation  
Feature denoising using joint sparse representation for in-car speech recognition Journal article
Li Weifeng, Zhou Yicong, Poh Norman, Zhou Fei, Liao Qingmin. Feature denoising using joint sparse representation for in-car speech recognition[J]. IEEE Signal Processing Letters, 2013, 20(7), 681-684.
Authors:  Li Weifeng;  Zhou Yicong;  Poh Norman;  Zhou Fei;  Liao Qingmin
Favorite | TC[WOS]:28 TC[Scopus]:35  IF:3.2/3.4 | Submit date:2018/12/21
Dictionary Training  In-car Speech Recognition  Log Mel-filter Bank (Mfb) Outputs  Sparse Representation  
Robust log-energy estimation and its dynamic change enhancement for in-car speech recognition Journal article
Li Weifeng, Wang Longbiao, Zhou Yicong, Bourlard Hervé, Liao Qingmin. Robust log-energy estimation and its dynamic change enhancement for in-car speech recognition[J]. IEEE Transactions on Audio, Speech and Language Processing, 2013, 21(8), 1689-1698.
Authors:  Li Weifeng;  Wang Longbiao;  Zhou Yicong;  Bourlard Hervé;  Liao Qingmin
Favorite | TC[WOS]:3 TC[Scopus]:3 | Submit date:2018/12/21
Dynamic Change Enhancement  In-car Speech Recognition  Log-energy  Mel-filterbank (Mfb)  Mel-frequency Cepstral Coefficients (Mfccs)