Audio Spectrogram Transformer-based Audio Classification using Voice data of Dementia Patients

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Globally, aging is ongoing, and the probability of senile disease and dementia patients continue to increase as human lifespan and the proportion of the elderly increase. Dementia is a disease that affects the depression and quality of life of the elderly, and it is a socially and economically dangerous disease that cannot live independently. Currently, the only treatment for dementia is through training if the disease is delayed or detected early. Diagnosing dementia early is the most important step at this point, and studies are currently being conducted non-face-To-face or through voice. This study aims to predict dementia early through human voice. The study was conducted with the aim of optimizing and lightening the characteristics of dementia patients by applying the Audio Spectrogram Transformer analysis model as a voice classification model. The model used in this paper classifies dementia with an accuracy of 89%. In future research, we will try a study that combines Explainable AI (XAI), an interpretable artificial intelligence of deep learning.

Original languageEnglish
Title of host publication2023 IEEE/ACIS 8th International Conference on Big Data, Cloud Computing, and Data Science, BCD 2023
EditorsJongwoo Park, Ngo Thi Phuong Lan, Sungtaek Lee, Tran Anh Tien, Jongbae Kim
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages114-119
Number of pages6
ISBN (Electronic)9798350373615
DOIs
StatePublished - 2023
Event8th IEEE/ACIS International Conference on Big Data, Cloud Computing, and Data Science, BCD 2023 - Ho Chi Minh City, Viet Nam
Duration: 14 Dec 202316 Dec 2023

Publication series

Name2023 IEEE/ACIS 8th International Conference on Big Data, Cloud Computing, and Data Science, BCD 2023

Conference

Conference8th IEEE/ACIS International Conference on Big Data, Cloud Computing, and Data Science, BCD 2023
Country/TerritoryViet Nam
CityHo Chi Minh City
Period14/12/2316/12/23

Keywords

  • Audio Classification
  • Audio Processing
  • Audio Transformer
  • Dementia
  • Mel-Spectrogram

Fingerprint

Dive into the research topics of 'Audio Spectrogram Transformer-based Audio Classification using Voice data of Dementia Patients'. Together they form a unique fingerprint.

Cite this