Project: Web Document Analysis and its Application to Antiphishing

The Online World is evolving at an amazing speed and it is now the greatest information and knowledge repository. Many web documents are accumulated, which demand programmed processing and evaluation for smart applications. In this dissertation, we investigate the web document analysis technique and also create a software to antiphishing. For Web document analysis, a visual factor based page segmentation strategy is suggested and applied. According to the W3C DOM model of HTML, this process first breaks down the full web page into many separate salient blocks, that are visually and semantically consistent within each block but distinguishable between adjacent blocks. In the next step, the method aggregates these salient blocks into semantically important blocks according their positions and visual cues in the webpage. In such as bottom-up way, the technique ultimately builds up a hierarchical segmented blocks tree. We use our web page segmentation to the AntiPhishing issue. Phishing webpages generally present similar visual styles and structure with their target ones. According to web page segmentation, we offer 3 metrics (block level similarity, layout similarity, and over-all style similarity) to examine the visual similarities between a phishing page and its target. If one of them exceeds a certain threshold, a phishing alert is issued. We have put together a model program to show the business model of our antiphishing mechanism, and feel our technique can be employed as an enterprise solution for antiphishing.

Contents: Antiphishing and Web Document Analysis

Chapter 1 Introduction
1.1 Problem Description
1.1.1 Webpage Segmentation
1.1.2 Antiphishing
1.2 Motivation
1.3 Contributions
1.4 Thesis Organization

Chapter 2 Literature Review
2.1 Web Document Analysis
2.2 Webpage Segmentation
2.3 Phishing Webpage Detection
Chapter 3 Web Document Analysis
3.1 Related Works
3.2 Salient Block Decomposition
3.3 Block Clustering
3.3.1 Location and Appearance clues
3.3.2 Semantics clues
3.4 Experiments
3.4.1 Prototype System
3.4.2 Evaluation Results
3.5 Conclusions on Web Document Analysis
Chapter 4 Phishing Webpage Detection
4.1 Related Works
4.2 The AntiPhishing Approach….

Antiphishing and Web Document Analysis Downloads

Source: City University of Hong Kong

Download URL 2: Visit Now

More Interesting Reports For You