Residential College | false |
Status | 已發表Published |
A 108-nW 0.8-mm 2 Analog Voice Activity Detector Featuring a Time-Domain CNN With Sparsity-Aware Computation and Sparsified Quantization in 28-nm CMOS | |
Chen, Feifei; Un, Ka Fai; Yu, Wei Han; Mak, Pui In; Martins, Rui P. | |
2022-07 | |
Source Publication | IEEE JOURNAL OF SOLID-STATE CIRCUITS |
ISSN | 0018-9200 |
Volume | 57Issue:11Pages:3288 - 3297 |
Abstract | This article reports a passive analog feature extractor for realizing an area-and-power-efficient voice activity detector (VAD) for voice-control edge devices. It features a switched-capacitor circuit as the time-domain convolutional neural network (TD-CNN) that extracts the 1-bit features for the subsequent binarized neural network (BNN) classifier. TD-CNN also allows area savings and low latency by evaluating the features temporally. The applied sparsity-aware computation (SAC) and sparsified quantization (SQ) aid in enlarging the output swing and reducing the model size without sacrificing the classification accuracy. With these techniques, the diversified output also aids in desensitizing the 1-bit quantizer from the offset and noise. The TD-CNN and BNN are trained as a single network to improve the VAD reconfigurability. Benchmarking with the prior art, our VAD in 28-nm CMOS scores a 90% (94%) speech (non-speech) hit rate on the TIMIT dataset with small power (108 nW) and area (0.8 mm 2) . We can configure the TD-CNN as a feature extractor for keyword spotting (KWS). It achieves a 93.5% KWS accuracy with the Google speech command dataset (two keywords). With two TD-CNNs operating simultaneously to extract more features, the KWS accuracy is 94.3%. |
Keyword | Approximate Computing Convolutional Neural Network (Cnn) Feature Extraction Keyword Spotting (Kws) Quantization Reconfigurable Sparsity Switched-capacitor Circuits Voice Activity Detection (Vad) |
DOI | 10.1109/JSSC.2022.3191008 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Engineering |
WOS Subject | Engineering, Electrical & Electronic |
WOS ID | WOS:000829074600001 |
Scopus ID | 2-s2.0-85135222660 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | THE STATE KEY LABORATORY OF ANALOG AND MIXED-SIGNAL VLSI (UNIVERSITY OF MACAU) Faculty of Science and Technology INSTITUTE OF MICROELECTRONICS DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING |
Corresponding Author | Un, Ka Fai |
Affiliation | University of Macau |
First Author Affilication | University of Macau |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Chen, Feifei,Un, Ka Fai,Yu, Wei Han,et al. A 108-nW 0.8-mm 2 Analog Voice Activity Detector Featuring a Time-Domain CNN With Sparsity-Aware Computation and Sparsified Quantization in 28-nm CMOS[J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2022, 57(11), 3288 - 3297. |
APA | Chen, Feifei., Un, Ka Fai., Yu, Wei Han., Mak, Pui In., & Martins, Rui P. (2022). A 108-nW 0.8-mm 2 Analog Voice Activity Detector Featuring a Time-Domain CNN With Sparsity-Aware Computation and Sparsified Quantization in 28-nm CMOS. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 57(11), 3288 - 3297. |
MLA | Chen, Feifei,et al."A 108-nW 0.8-mm 2 Analog Voice Activity Detector Featuring a Time-Domain CNN With Sparsity-Aware Computation and Sparsified Quantization in 28-nm CMOS".IEEE JOURNAL OF SOLID-STATE CIRCUITS 57.11(2022):3288 - 3297. |
Files in This Item: | Download All | |||||
File Name/Size | Publications | Version | Access | License | ||
FINAL VERSION.PDF(1724KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment