TY - GEN
T1 - BlindFilter
T2 - 42nd International Symposium on Reliable Distributed Systems, SRDS 2023
AU - Lee, Dongwon
AU - Ahn, Myeonghwan
AU - Kwak, Hyesun
AU - Hong, Jin B.
AU - Kim, Hyoungshick
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Spam filtering services typically operate via cloud outsourcing, which exposes sensitive and private email content to the cloud server spam filter. Homomorphic encryption (HE) can address this issue by ensuring that user emails remain encrypted throughout all stages of the spam detection process on the cloud server. However, existing HE-based approaches are computationally infeasible due to the nature of HE operations. This paper proposes BlindFilter, a distributed, lightweight, HE-based spam email detection approach that consists of clients and servers collaborating to perform spam detection operations securely. BlindFilter employs WordPiece encoding and a modified Naive Bayes classifier, mitigating the need for multiplications and comparisons that would be prohibitive in terms of computation when applied with HE. Our experimental results demonstrate the efficacy of BlindFilter, with F1 scores exceeding 97% across two public email datasets. Furthermore, BlindFilter proves to be efficient as it can process an email in an average of 482.78 milliseconds. Our analysis also reveals that BlindFilter is robust against model extraction attacks, in which malicious users attempt to deduce the features of BlindFilter from query-response pairs.
AB - Spam filtering services typically operate via cloud outsourcing, which exposes sensitive and private email content to the cloud server spam filter. Homomorphic encryption (HE) can address this issue by ensuring that user emails remain encrypted throughout all stages of the spam detection process on the cloud server. However, existing HE-based approaches are computationally infeasible due to the nature of HE operations. This paper proposes BlindFilter, a distributed, lightweight, HE-based spam email detection approach that consists of clients and servers collaborating to perform spam detection operations securely. BlindFilter employs WordPiece encoding and a modified Naive Bayes classifier, mitigating the need for multiplications and comparisons that would be prohibitive in terms of computation when applied with HE. Our experimental results demonstrate the efficacy of BlindFilter, with F1 scores exceeding 97% across two public email datasets. Furthermore, BlindFilter proves to be efficient as it can process an email in an average of 482.78 milliseconds. Our analysis also reveals that BlindFilter is robust against model extraction attacks, in which malicious users attempt to deduce the features of BlindFilter from query-response pairs.
KW - Homomorphic encryption
KW - Spam detection
UR - https://www.scopus.com/pages/publications/85182399704
U2 - 10.1109/SRDS60354.2023.00014
DO - 10.1109/SRDS60354.2023.00014
M3 - Conference contribution
AN - SCOPUS:85182399704
T3 - Proceedings of the IEEE Symposium on Reliable Distributed Systems
SP - 35
EP - 45
BT - Proceedings - 2023 42nd International Symposium on Reliable Distributed Systems, SRDS 2023
PB - IEEE Computer Society
Y2 - 25 September 2023 through 29 September 2023
ER -