Architecture and algorithm for web phishing detection
2010 (English)In: Journal of Southeast University (English Edition), ISSN 1003-7985, Vol. 26, no 1, p. 43-47Article in journal (Refereed) Published
Abstract [en]
A phishing detection system, which comprises client-side filtering plug-in, analysis center and protected sites, is proposed. An image-based similarity detection algorithm is conceived to calculate the similarity of two web pages. The web pages are first converted into images, and then divided into sub-images with iterated dividing and shrinking. After that, the attributes of sub-images including color histograms, gray histograms and size parameters are computed to construct the attributed relational graph (ARG) of each page. In order to match two ARGs, the inner earth mover's distances (EMD) between every two nodes coming from each ARG respectively are first computed, and then the similarity of web pages by the outer EMD between two ARGs is worked out to detect phishing web pages. The experimental results show that the proposed architecture and algorithm has good robustness along with scalability, and can effectively detect phishing.
Place, publisher, year, edition, pages
2010. Vol. 26, no 1, p. 43-47
Keywords [en]
Attributed relational graph, Image similarity, Inner EMD, Outer EMD, Phishing detection, Color histogram, Detection system, Earth mover's distance, Gray histogram, Image-based, Phishing, Plug-ins, Proposed architectures, Similarity detection, Size parameters, Subimages, Web page, Algorithms, Graphic methods, World Wide Web
National Category
Other Environmental Engineering
Identifiers
URN: urn:nbn:se:kth:diva-149421Scopus ID: 2-s2.0-77952121835OAI: oai:DiVA.org:kth-149421DiVA, id: diva2:739414
Note
QC 20140821
2014-08-212014-08-212022-06-23Bibliographically approved