TY - GEN
T1 - Developing a GPT-Based text Extraction Model for Cancer Information
AU - Yi, Yong Jeong
AU - Jo, Jaemin
AU - Bae, Beom Jun
AU - Moon, Hyunwoo
AU - Yoon, June
AU - Lee, Sanghyuk
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - By employing Aristotle's rhetoric as the theoretical framework, the present study aims to develop a model that automatically extracts the three key components of persuasive strategies-ethos (authority), pathos (emotional appeal), and logos (logic)-from answers to pertinent cancer questions on Quora, a social question and answer platform. Furthermore, we apply the model to discrete groups of the most upvoted and random (non-upvoted) answers to compare differences in the three persuasive components. The dataset consists of a total of 103 questions and their corresponding answers, including both upvoted and random answers. It was employed for preliminary findings, comprising a total of 33 questions and answers, with answers to 19 questions used as training data and answers to 14 questions used as test data. We annotated sentences in the answers according to the three types of rhetoric employed. We then fine-tuned models based on Generative Pretrained Transformers (GPT) to classify the phrases, achieving an average F1 score of 0.84. Paired sample t-tests confirmed our research hypotheses regarding ethos and logos, while our hypothesis about pathos was not confirmed. Results suggest that ethos and logos are effective in communicating cancer information to consumers, but that pathos is not.
AB - By employing Aristotle's rhetoric as the theoretical framework, the present study aims to develop a model that automatically extracts the three key components of persuasive strategies-ethos (authority), pathos (emotional appeal), and logos (logic)-from answers to pertinent cancer questions on Quora, a social question and answer platform. Furthermore, we apply the model to discrete groups of the most upvoted and random (non-upvoted) answers to compare differences in the three persuasive components. The dataset consists of a total of 103 questions and their corresponding answers, including both upvoted and random answers. It was employed for preliminary findings, comprising a total of 33 questions and answers, with answers to 19 questions used as training data and answers to 14 questions used as test data. We annotated sentences in the answers according to the three types of rhetoric employed. We then fine-tuned models based on Generative Pretrained Transformers (GPT) to classify the phrases, achieving an average F1 score of 0.84. Paired sample t-tests confirmed our research hypotheses regarding ethos and logos, while our hypothesis about pathos was not confirmed. Results suggest that ethos and logos are effective in communicating cancer information to consumers, but that pathos is not.
KW - Aristotle's rhetoric
KW - ChatGPT
KW - artificial intelligence
KW - cancer information
KW - machine learning
KW - persuasion
KW - social Q&A
UR - https://www.scopus.com/pages/publications/85190269833
U2 - 10.1109/Confluence60223.2024.10463424
DO - 10.1109/Confluence60223.2024.10463424
M3 - Conference contribution
AN - SCOPUS:85190269833
T3 - Proceedings of the 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024
SP - 165
EP - 169
BT - Proceedings of the 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024
A2 - Thakur, Sanjeev
A2 - Garg, Rakesh
A2 - Singhal, Abhishek
A2 - Kumar, Sumit
A2 - Kumar, Sumit
A2 - Arora, Renuka
A2 - Sehgal Kaushik, Rajni
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th International Conference on Cloud Computing, Data Science and Engineering, Confluence 2024
Y2 - 18 January 2024 through 19 January 2024
ER -