Phishing detection with popular search engines: Simple and effective

Jun Ho Huh, Hyoungshick Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

47 Scopus citations

Abstract

We propose a new phishing detection heuristic based on the search results returned from popular web search engines such as Google, Bing and Yahoo. The full URL of a website a user intends to access is used as the search string, and the number of results returned and ranking of the website are used for classification. Most of the time, legitimate websites get back large number of results and are ranked first, whereas phishing websites get back no result and/or are not ranked at all. To demonstrate the effectiveness of our approach, we experimented with four well-known classification algorithms - Linear Discriminant Analysis, Naïve Bayesian, K-Nearest Neighbour, and Support Vector Machine - and observed their performance. The K-Nearest Neighbour algorithm performed best, achieving true positive rate of 98% and false positive and false negative rates of 2%. We used new legitimate websites and phishing websites as our dataset to show that our approach works well even on newly launched websites/webpages - such websites are often misclassified in existing blacklisting and whitelisting approaches.

Original languageEnglish
Title of host publicationFoundations and Practice of Security - 4th Canada-France MITACS Workshop, FPS 2011, Revised Selected Papers
Pages194-207
Number of pages14
DOIs
StatePublished - 2012
Externally publishedYes
Event4th Canada-France MITACS Workshop on Foundations and Practice of Security, FPS 2011 - Paris, France
Duration: 12 May 201113 May 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6888 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference4th Canada-France MITACS Workshop on Foundations and Practice of Security, FPS 2011
Country/TerritoryFrance
CityParis
Period12/05/1113/05/11

Keywords

  • Classification
  • Phishing detection
  • URL Reputation

Fingerprint

Dive into the research topics of 'Phishing detection with popular search engines: Simple and effective'. Together they form a unique fingerprint.

Cite this