TY - GEN
T1 - Knowledge augmented significant language model-based chatbot for explainable diabetes mellitus prediction
AU - Elfayoumi, Mazen
AU - AbouElazm, Mohamed
AU - Mohamed, Omar
AU - Abuhmed, Tamer
AU - El-Sappagh, Shaker
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This study proposes an innovative diabetes prediction chatbot that utilizes large language models (LLMs) to determine the likelihood of diabetes based on specific patient inputs. Unlike conventional machine learning models and in addition to providing precise, individualized, robust prediction of diabetes augmented by the percentage of its confidence, this chatbot provides detailed text-based explanations for its individual predictions and enhances user’s understanding by retrieving and presenting similar cases using case-based reasoning techniques. Utilizing key health indicators such as hypertension status, heart disease presence, smoking history, BMI, HbA1c level, and blood glucose levels, the chatbot not only predicts the presence of diabetes but also educates users about the underlying reasons for each prediction. The system's explanation module promotes transparency and trust in the predictive process. The proposed architecture integrates the retrieval augmented generation (RAG) technique with prompt engineering across multiple LLMs, including Llama 3.1, GPT-3.5, and Gemma2. This approach augments the chatbot’s responses with contextually relevant data from similar past cases, thereby enhancing both relevance and accuracy. Furthermore, RAG minimizes LLM hallucinations while enhancing them with up-to-date, precise, personalized medical information. The objective of this chatbot is to offer healthcare professionals and patients a dependable, educational instrument for early detection of diabetes supported by transparent AI principles. This will have a positive effect on preventative healthcare measures.
AB - This study proposes an innovative diabetes prediction chatbot that utilizes large language models (LLMs) to determine the likelihood of diabetes based on specific patient inputs. Unlike conventional machine learning models and in addition to providing precise, individualized, robust prediction of diabetes augmented by the percentage of its confidence, this chatbot provides detailed text-based explanations for its individual predictions and enhances user’s understanding by retrieving and presenting similar cases using case-based reasoning techniques. Utilizing key health indicators such as hypertension status, heart disease presence, smoking history, BMI, HbA1c level, and blood glucose levels, the chatbot not only predicts the presence of diabetes but also educates users about the underlying reasons for each prediction. The system's explanation module promotes transparency and trust in the predictive process. The proposed architecture integrates the retrieval augmented generation (RAG) technique with prompt engineering across multiple LLMs, including Llama 3.1, GPT-3.5, and Gemma2. This approach augments the chatbot’s responses with contextually relevant data from similar past cases, thereby enhancing both relevance and accuracy. Furthermore, RAG minimizes LLM hallucinations while enhancing them with up-to-date, precise, personalized medical information. The objective of this chatbot is to offer healthcare professionals and patients a dependable, educational instrument for early detection of diabetes supported by transparent AI principles. This will have a positive effect on preventative healthcare measures.
KW - Chatbot
KW - Clinical Decision Support Systems
KW - Diabetes Prediction
KW - Explainable AI
KW - Large Language Models
KW - Retrieval Augmented Generation
UR - https://www.scopus.com/pages/publications/85218144058
U2 - 10.1109/IMCOM64595.2025.10857525
DO - 10.1109/IMCOM64595.2025.10857525
M3 - Conference contribution
AN - SCOPUS:85218144058
T3 - Proceedings of the 2025 19th International Conference on Ubiquitous Information Management and Communication, IMCOM 2025
BT - Proceedings of the 2025 19th International Conference on Ubiquitous Information Management and Communication, IMCOM 2025
A2 - Lee, Sukhan
A2 - Choo, Hyunseung
A2 - Ismail, Roslan
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 19th International Conference on Ubiquitous Information Management and Communication, IMCOM 2025
Y2 - 3 January 2025 through 5 January 2025
ER -