TY - GEN
T1 - A 50μW 4-channel 83dBA-SNDR Speech Recognition Front-End with Adaptive Beamforming and Feature Extraction
AU - Kang, Taewook
AU - Lee, Seungjong
AU - Haghigat, Mohammad
AU - Abramson, Darren
AU - Flynn, Michael
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - Beamforming is essential for accurate Automatic Speech Recognition (ASR) in noisy environments. Commercial products, such as Google Home, Amazon Alexa, and Apple Airpods, beamform with multiple microphones. Delay-and-sum beamforming is amenable to low-power IC implementation (e.g. [1]) but can only suppress noise from a fixed direction making it ineffective for real situations with multiple constantly-changing noise sources. Adaptive beamforming (ABF) solves this limitation by automatically and adaptively suppressing noise from multiple, varying sources (Fig. 1). However, the high DSP power and large die area needed for conventional ABF DSP prevent practical implementation. Another challenge is that high angle accuracy is crucial for ABF to avoid severe distortion of the desired signal [2]. We combine the robust generalized sidelobe canceller (RGSC) algorithm [2] with bitstream processing for accurate and low-power ABF. We further reduce power and area through hardware sharing and optimized DSP clock rate. Our system combines multi-channel digitization, beamforming, automatic noise suppression, and feature extraction for a robust sub-mW single-chip speech-processing frontend.
AB - Beamforming is essential for accurate Automatic Speech Recognition (ASR) in noisy environments. Commercial products, such as Google Home, Amazon Alexa, and Apple Airpods, beamform with multiple microphones. Delay-and-sum beamforming is amenable to low-power IC implementation (e.g. [1]) but can only suppress noise from a fixed direction making it ineffective for real situations with multiple constantly-changing noise sources. Adaptive beamforming (ABF) solves this limitation by automatically and adaptively suppressing noise from multiple, varying sources (Fig. 1). However, the high DSP power and large die area needed for conventional ABF DSP prevent practical implementation. Another challenge is that high angle accuracy is crucial for ABF to avoid severe distortion of the desired signal [2]. We combine the robust generalized sidelobe canceller (RGSC) algorithm [2] with bitstream processing for accurate and low-power ABF. We further reduce power and area through hardware sharing and optimized DSP clock rate. Our system combines multi-channel digitization, beamforming, automatic noise suppression, and feature extraction for a robust sub-mW single-chip speech-processing frontend.
UR - https://www.scopus.com/pages/publications/85107191430
U2 - 10.1109/CICC51472.2021.9431579
DO - 10.1109/CICC51472.2021.9431579
M3 - Conference contribution
AN - SCOPUS:85107191430
T3 - Proceedings of the Custom Integrated Circuits Conference
BT - 2021 IEEE Custom Integrated Circuits Conference, CICC 2021 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE Custom Integrated Circuits Conference, CICC 2021
Y2 - 25 April 2021 through 30 April 2021
ER -