UM  > INSTITUTE OF MICROELECTRONICS
Residential Collegefalse
Status已發表Published
Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-Tuning
Wang, Xinyu1; Li, Mingtao1,2; Li, Hao1; Pun, Sio Hang2; Chen, Fei1
2023
Conference Name2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Source Publication2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Pages535-539
Conference Date2023/10/31-2023/11/03
Conference PlaceTaipei
Abstract

Direct speech brain-computer interface (DS-BCI) is an ideal way for speech communication by decoding signals collected from the brain. Electroencephalogram (EEG) has gained widespread use in DS-BCI studies due to its simplicity of operation and high temporal resolution. However, as human brain exhibits considerable inter-individual variability, classification models trained on the basis of data from one subject may not generalise well to other individuals, which is a major challenge in existing EEG signal classification studies. In this paper, the cross-subject classification performance of spoken Mandarin speech with EEG signals was investigated by using an end-to-end convolutional neural network (CNN) model pre-trained on the source data and fine-tuned on the target data. The raw EEG signals were directly used as the input to the model without using extracted features. In addition, adding Gaussian noise was used as the data augmentation method in order to deal with the unbalanced dataset. The proposed method was tested on a collected EEG dataset of spoken Mandarin speech, including vowel classification and tone classification tasks. The average classification accuracies of four vowels and four tones were 63.1% and 51.7% respectively. The average accuracy of tone classification was significantly improved compared with the machine learning and subject-dependent methods. The results of this work showed the potential of the fine-tuning based CNN model in the cross-subject studies of EEG decoding.

DOI10.1109/APSIPAASC58517.2023.10317100
URLView the original
Indexed ByCPCI-S
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods
WOS IDWOS:001108741800085
Scopus ID2-s2.0-85180013657
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionINSTITUTE OF MICROELECTRONICS
Affiliation1.Southern University of Science and Technology, Shenzhen, China
2.University of Macau, Macao
Recommended Citation
GB/T 7714
Wang, Xinyu,Li, Mingtao,Li, Hao,et al. Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-Tuning[C], 2023, 535-539.
APA Wang, Xinyu., Li, Mingtao., Li, Hao., Pun, Sio Hang., & Chen, Fei (2023). Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-Tuning. 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023, 535-539.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang, Xinyu]'s Articles
[Li, Mingtao]'s Articles
[Li, Hao]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang, Xinyu]'s Articles
[Li, Mingtao]'s Articles
[Li, Hao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang, Xinyu]'s Articles
[Li, Mingtao]'s Articles
[Li, Hao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.