TY - GEN
T1 - Application of Adversarial Domain Adaptation to Voice Activity Detection
AU - Kim, Tae Soo
AU - Ko, Jong Hwan
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Voice Activity Detection (VAD) is becoming an essential front-end component in various speech processing systems. As those systems are commonly deployed in environments with diverse noise types and low signal-to-noise ratios (SNRs), an effective VAD method should perform robust detection of speech region out of noisy background signals. In this paper, we propose applying an adversarial domain adaptation technique to VAD. The proposed method trains DNN models for a VAD task in a supervised manner, simultaneously mitigating the problem of area mismatch between noisy and clean audio stream in a unsupervised manner. The experimental results show that the proposed method improves robust detection performance in noisy environments compared to other DNN-based model learned with hand-crafted acoustic feature.
AB - Voice Activity Detection (VAD) is becoming an essential front-end component in various speech processing systems. As those systems are commonly deployed in environments with diverse noise types and low signal-to-noise ratios (SNRs), an effective VAD method should perform robust detection of speech region out of noisy background signals. In this paper, we propose applying an adversarial domain adaptation technique to VAD. The proposed method trains DNN models for a VAD task in a supervised manner, simultaneously mitigating the problem of area mismatch between noisy and clean audio stream in a unsupervised manner. The experimental results show that the proposed method improves robust detection performance in noisy environments compared to other DNN-based model learned with hand-crafted acoustic feature.
KW - Domain adversarial adaptation
KW - Generative adversarial network
KW - VAD
KW - Voice activity detection
UR - https://www.scopus.com/pages/publications/85113756600
U2 - 10.1007/978-3-030-82199-9_55
DO - 10.1007/978-3-030-82199-9_55
M3 - Conference contribution
AN - SCOPUS:85113756600
SN - 9783030821982
T3 - Lecture Notes in Networks and Systems
SP - 823
EP - 829
BT - Intelligent Systems and Applications - Proceedings of the 2021 Intelligent Systems Conference, IntelliSys
A2 - Arai, Kohei
PB - Springer Science and Business Media Deutschland GmbH
T2 - Intelligent Systems Conference, IntelliSys 2021
Y2 - 2 September 2021 through 3 September 2021
ER -