TY - GEN
T1 - Dempster-Shafer Fusion Based Gender Recognition for Speech Analysis Applications
AU - Ahmad, Jamil
AU - Muhammad, Khan
AU - Kwon, Soon Il
AU - Baik, Sung Wook
AU - Rho, Seungmin
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/4/19
Y1 - 2016/4/19
N2 - Speech signals carry valuable information about the speaker including age, gender, and emotional state. Gender information can act as a vital preprocessing ingredient for enhancing speech analysis applications like adaptive human-machine interfaces, multi-modal security applications, and sophisticated intent and context analysis based forensic systems. In uncontrolled environments like telephone speech applications, the gender recognition system should be adaptive, accurate, and robust to noisy environments. This paper presents a reasoning method governed by Dempster-Shafer theory of evidence for automatic gender recognition from telephone speech. The proposed method uses mel-frequency cepstral coefficients with a support vector machine to generate the initial prediction results for individual speech segments. The reasoning scheme collects and validates results from support vector machine and treats convincing predictions as valid evidence. It is argued that the consideration of valid evidence in the reasoning process improves recognition performance by avoiding unconvincing classification results. Experiments conducted on large speech datasets reveal the superiority of the proposed gender recognition scheme for speech analysis applications.
AB - Speech signals carry valuable information about the speaker including age, gender, and emotional state. Gender information can act as a vital preprocessing ingredient for enhancing speech analysis applications like adaptive human-machine interfaces, multi-modal security applications, and sophisticated intent and context analysis based forensic systems. In uncontrolled environments like telephone speech applications, the gender recognition system should be adaptive, accurate, and robust to noisy environments. This paper presents a reasoning method governed by Dempster-Shafer theory of evidence for automatic gender recognition from telephone speech. The proposed method uses mel-frequency cepstral coefficients with a support vector machine to generate the initial prediction results for individual speech segments. The reasoning scheme collects and validates results from support vector machine and treats convincing predictions as valid evidence. It is argued that the consideration of valid evidence in the reasoning process improves recognition performance by avoiding unconvincing classification results. Experiments conducted on large speech datasets reveal the superiority of the proposed gender recognition scheme for speech analysis applications.
KW - Dempster-Shafer theory
KW - evidence fusion
KW - gender recognition
KW - speech forensics applications
UR - https://www.scopus.com/pages/publications/84968586030
U2 - 10.1109/PlatCon.2016.7456788
DO - 10.1109/PlatCon.2016.7456788
M3 - Conference contribution
AN - SCOPUS:84968586030
T3 - 2016 International Conference on Platform Technology and Service, PlatCon 2016 - Proceedings
BT - 2016 International Conference on Platform Technology and Service, PlatCon 2016 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Platform Technology and Service, PlatCon 2016
Y2 - 15 February 2016 through 17 February 2016
ER -