0% found this document useful (0 votes)

9 views

Phishing

The document discusses a project focused on developing machine learning models for detecting URL-based phishing attacks, which pose significant cybersecurity threats. It outlines objectives such as accurate identification of phishing URLs, real-time detection, and adaptability to evolving attacks, while addressing challenges like imbalanced datasets. The proposed methodology includes feature engineering, ensemble learning, and transfer learning to improve detection accuracy.

Uploaded by

honuleritesh603

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Phishing

Uploaded by

honuleritesh603

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Dr.M.S.

Sheshgiri College of Engineering &

Belagavi
Campus
Technology

URL-BASED PHISHING
By
DETECTION 02FE23MCA027 : Ritesh Honule
02FE23MCA040: Rohan Patil
02FE23MCA045: Anish vernekar
02FE23MCA057: Aditya Sankpal

GUIDE :-

1
Introduction
 The Internet is essential but enables phishing.
Phishers use fake websites and social engineering to
steal credentials. They constantly evolve to bypass
detection. Machine learning eff ectively identifi es
phishing by recognizing common attack patterns. It
helps diff erentiate between legitimate and malicious
websites.

2
Literature Review
 [1] Machine Learning-Based Phishing Detection Using URL

Features (Feature Engineering)

• Extracts URL-based features like domain reputation, length, special

characters, and domain age to classify phishing and legitimate

URLs.
 [3] Deep Learning for Phishing URL Detection (Deep Learning

Techniques)
• Uses CNNs and transformers to automatically learn phishing

patterns, improving detection rates.

 [4] Handling Imbalanced Datasets in Phishing Detection (Data

Balancing Techniques)
• Addresses imbalanced datasets using oversampling,

undersampling, and SMOTE to enhance classifi cation accuracy. 3

Objectives of the Project
• Accurate Identification of Phishing URLs – Develop ML models that

effectively differentiate between legitimate and phishing URLs using

lexical, host-based, and content-based features.

• Real-Time Detection – Ensure the system can quickly analyze and

classify URLs to prevent users from accessing malicious websites.

• Adaptability to Evolving Attacks – Improve models to detect new

phishing techniques and evade adversarial attacks by continuously

learning from updated datasets.

• Minimizing False Positives & False Negatives – Optimize the

detection system to reduce incorrect classifications, ensuring reliability

and user trust.

4
Final Problem Defination
Phishing attacks use deceptive URLs to steal sensitive
information, posing a major cybersecurity threat. Traditional
detection methods struggle as attackers constantly evolve
techniques to bypass them. Machine learning can effectively
identify phishing URLs by analyzing patterns and features.
However, challenges like imbalanced datasets, real-time
detection, and accuracy persist. This project aims to develop an
ML-based model for accurate and efficient phishing URL
detection.

5
Software & Hardware
Requirements
Software Requirements:
1. Operating System: Windows 10/11, Linux (Ubuntu), or macOS

2. Programming Language: Python (Version 3.7 or above)

3. Development Environment: Anaconda Navigator (for managing
dependencies) Jupyter Notebook or VS Code (for development and testing)

Hardware Requirements:
4. P r o c e s s o r : I n te l C ore i 5/ i 7 ( or A MD e q u i v a l e n t ) – Mi n i m u m 2. 5
GHz
5. R A M : 8G B (Mi n i m u m ) , 16G B ( Re c om m e n d e d f or l arg e d at as e t s )
6. St o r ag e : 50G B fre e s p ace ( fo r da ta s e ts , m od e l tr ai n i n g, an d
logs)
7. G P U ( O p ti on al ) : NVIDIA GPU ( f or deep l e arn i n g m od e l s , if
re q u i re d )
6
Software & Hardware
Requirements
Libraries Requirements:
1. NumPy, pandas – Data manipulation

2. scikit-learn – Machine learning models

3. matplotlib, seaborn – Data visualization

4. SciPy – Scientific computing

5. pickle-mixin – Model serialization

6. Flask – Web application for deployment

7
Proposed Methodology
• Feature Engineering: Extract key URL features like domain

reputation, length, keywords, and age to help ML models differentiate

phishing and legitimate URLs.

• Ensemble Learning: Combine models like Random Forest, Gradient

Boosting, and Decision Tree to improve detection accuracy and reduce

individual weaknesses.
• Imbalanced Data Handling: Use techniques like oversampling,

undersampling, or SMOTE to balance phishing and legitimate URL

data.
• Transfer Learning: Fine-tune pre-trained models on phishing-specific

data to enhance detection.

8
References
1. Machine Learning-Based Phishing Detection Using URL Features -

Published: 02 October 2023

 Authors: Asif Uz Zaman Asif, Hossein Shirazi, Indrakshi Ray

2. Machine Learning based URL Analysis for Phishing Detection - Date

of Conference: 3-4 March 2023 Publisher: IEEE

 CONCLUSION : Both studies underscore the efficacy of machine learning

techniques in detecting phishing URLs through the analysis of URL

features. The integration of lexical and host-based features, coupled with

the application of robust machine learning algorithms, significantly

enhances detection accuracy. However, challenges such as feature

selection, dataset quality, and the adaptability of models to evolving

phishing tactics remain critical areas for ongoing research and 9

Thank You

Questions
0% (1)
Questions
42 pages
Final PPT - Phishing Website
100% (1)
Final PPT - Phishing Website
23 pages
Programmable Load Shedding Time Management For Utility Department
75% (8)
Programmable Load Shedding Time Management For Utility Department
42 pages
updated_phishing_url_detection
No ratings yet
updated_phishing_url_detection
13 pages
Phishing_Review_2023
No ratings yet
Phishing_Review_2023
17 pages
Phishing-Detection Using Ml[1]
No ratings yet
Phishing-Detection Using Ml[1]
14 pages
Fake Website Detection
No ratings yet
Fake Website Detection
13 pages
paper2
No ratings yet
paper2
10 pages
Department of Computer Engineering: Phishing Website Detector Using ML
No ratings yet
Department of Computer Engineering: Phishing Website Detector Using ML
13 pages
phishing final
No ratings yet
phishing final
13 pages
B5_PPT_Final-1
No ratings yet
B5_PPT_Final-1
15 pages
Phishing Website Detection Using ML 2-1
No ratings yet
Phishing Website Detection Using ML 2-1
20 pages
B5_Project Synopsis
No ratings yet
B5_Project Synopsis
5 pages
Malicious URL Detection Using Random Forest
No ratings yet
Malicious URL Detection Using Random Forest
36 pages
Phishing Phase1 Report
No ratings yet
Phishing Phase1 Report
20 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
25 pages
Phishing URL Detection Presentation[1]
No ratings yet
Phishing URL Detection Presentation[1]
12 pages
P1
No ratings yet
P1
13 pages
depuuuDOCNW[1]
No ratings yet
depuuuDOCNW[1]
28 pages
Final Yr Project PhishingAttack Ppt
No ratings yet
Final Yr Project PhishingAttack Ppt
12 pages
Automated Phishing Detection Through URL Analysis and Machine Learning
No ratings yet
Automated Phishing Detection Through URL Analysis and Machine Learning
9 pages
Machine Learning-Driven Phishing Detection: A Robust Browser Extension Solution
No ratings yet
Machine Learning-Driven Phishing Detection: A Robust Browser Extension Solution
4 pages
Major Project Final Report
No ratings yet
Major Project Final Report
53 pages
URL Phishing
No ratings yet
URL Phishing
36 pages
Detection of Url Based Phishing Attacks Using Machine Learning IJERTV8IS110269
No ratings yet
Detection of Url Based Phishing Attacks Using Machine Learning IJERTV8IS110269
8 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
19 pages
Cse3502-Information Security Management: Phishing Detection Using Data Mining Techniques
No ratings yet
Cse3502-Information Security Management: Phishing Detection Using Data Mining Techniques
25 pages
Fin Irjmets1682919970
No ratings yet
Fin Irjmets1682919970
5 pages
Paper 7AdvancesinEngineeringSoftware
No ratings yet
Paper 7AdvancesinEngineeringSoftware
6 pages
20mis0106 VL2023240103172 Pe003
No ratings yet
20mis0106 VL2023240103172 Pe003
5 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
16 pages
Fake Url
No ratings yet
Fake Url
64 pages
Presentation Slides
No ratings yet
Presentation Slides
42 pages
Major Proj Sumanthppt
No ratings yet
Major Proj Sumanthppt
13 pages
128 Submission
No ratings yet
128 Submission
7 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
24 pages
Final AB
No ratings yet
Final AB
2 pages
22 04 CPE Presentation
No ratings yet
22 04 CPE Presentation
18 pages
A Machine Learning Based Approach For Phishing Detection Using
No ratings yet
A Machine Learning Based Approach For Phishing Detection Using
14 pages
Jain 2018
No ratings yet
Jain 2018
14 pages
Phishing Seminar
No ratings yet
Phishing Seminar
19 pages
Phishing Website Detection by Machine Learning Techniques Presentation
No ratings yet
Phishing Website Detection by Machine Learning Techniques Presentation
12 pages
Phishing Detection Using Machine Learnin
No ratings yet
Phishing Detection Using Machine Learnin
5 pages
Midterm Project Report
No ratings yet
Midterm Project Report
21 pages
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
No ratings yet
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
6 pages
CyberSec Review3 Team10
No ratings yet
CyberSec Review3 Team10
28 pages
Real Time Phishing Website Detectionusing ML
No ratings yet
Real Time Phishing Website Detectionusing ML
4 pages
Final Synopsisi 2
No ratings yet
Final Synopsisi 2
11 pages
B5 Project Report Format SEM I 2022
No ratings yet
B5 Project Report Format SEM I 2022
16 pages
PHISHING PPT FINAL
No ratings yet
PHISHING PPT FINAL
24 pages
Machine_Learning_for_Detecting_the_Phishing_Threats
No ratings yet
Machine_Learning_for_Detecting_the_Phishing_Threats
6 pages
PUMMP: Phishing URL Detection Using Machine Learning With Monomorphic and Polymorphic Treatment of Features
No ratings yet
PUMMP: Phishing URL Detection Using Machine Learning With Monomorphic and Polymorphic Treatment of Features
20 pages
Project Report1
No ratings yet
Project Report1
83 pages
CSE3502-Final J Comp Report
No ratings yet
CSE3502-Final J Comp Report
20 pages
20mis0106 VL2023240102875 Pe003
No ratings yet
20mis0106 VL2023240102875 Pe003
42 pages
Detection of Phishing On Apps and Websites - Project Report
No ratings yet
Detection of Phishing On Apps and Websites - Project Report
21 pages
final ppt
No ratings yet
final ppt
26 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
7 pages
Review 4
No ratings yet
Review 4
9 pages
MINI PROJECT PHISHING WEBSITE DETECTION USING ML
No ratings yet
MINI PROJECT PHISHING WEBSITE DETECTION USING ML
45 pages
Comprehensive Guide to Nmap: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Nmap: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hacker’s Guide to Machine Learning Concepts
From Everand
Hacker’s Guide to Machine Learning Concepts
Trilokesh Khatri
No ratings yet
Bypass Globe
No ratings yet
Bypass Globe
4 pages
What Does It Mean To Poke Someone On Facebook LoveToKnow PDF
No ratings yet
What Does It Mean To Poke Someone On Facebook LoveToKnow PDF
1 page
Osi Model Notes
No ratings yet
Osi Model Notes
7 pages
Information System For Internship and Final Projec
No ratings yet
Information System For Internship and Final Projec
5 pages
Aja Bridge-Ndi-3g Manual v1.7
No ratings yet
Aja Bridge-Ndi-3g Manual v1.7
95 pages
Coastal Gateway Company Profile
No ratings yet
Coastal Gateway Company Profile
8 pages
Krishna Panchal Resume
No ratings yet
Krishna Panchal Resume
2 pages
Assignment 1 Mrs Nyambo
No ratings yet
Assignment 1 Mrs Nyambo
13 pages
SIMATIC S7-1500RH(F) TechSlides 2022-11-23 en
No ratings yet
SIMATIC S7-1500RH(F) TechSlides 2022-11-23 en
82 pages
Instant Download Proteome Informatics Conrad Bessant PDF All Chapter
100% (8)
Instant Download Proteome Informatics Conrad Bessant PDF All Chapter
45 pages
Summer 2024 - Etc
No ratings yet
Summer 2024 - Etc
44 pages
React js-DL-2023
No ratings yet
React js-DL-2023
17 pages
(MATH1013) (2016) (F) Final Cnemueu 53547
No ratings yet
(MATH1013) (2016) (F) Final Cnemueu 53547
10 pages
MPDUS0079EAB - Aplio A450 - High
No ratings yet
MPDUS0079EAB - Aplio A450 - High
24 pages
Bank Account Management System Project Report
No ratings yet
Bank Account Management System Project Report
31 pages
WPF Recipes in C 2008 A Problem Solution Approach 1st Edition Sam Bourton - The full ebook version is available, download now to explore
100% (1)
WPF Recipes in C 2008 A Problem Solution Approach 1st Edition Sam Bourton - The full ebook version is available, download now to explore
54 pages
Mis Syllabus
No ratings yet
Mis Syllabus
2 pages
m2221qlmg
No ratings yet
m2221qlmg
10 pages
Name: Chiranjit Saha Mobile: 7030998323/9804746127: OS: Languages: RPA Tools: Automation Anywhere, UI Path
No ratings yet
Name: Chiranjit Saha Mobile: 7030998323/9804746127: OS: Languages: RPA Tools: Automation Anywhere, UI Path
2 pages
Smarts EDAA Tutorial
No ratings yet
Smarts EDAA Tutorial
63 pages
Modicon M251 Logic Controller System Functions and Variables PLCSystem Library Guide
No ratings yet
Modicon M251 Logic Controller System Functions and Variables PLCSystem Library Guide
104 pages
Finite Autometa PDF
No ratings yet
Finite Autometa PDF
40 pages
ICH Specificity (RS) - User Manual
No ratings yet
ICH Specificity (RS) - User Manual
27 pages
Hand Gesture Recognition Using Matlab2
No ratings yet
Hand Gesture Recognition Using Matlab2
30 pages
Cybersecurity-Protecting-Our-Digital-World
No ratings yet
Cybersecurity-Protecting-Our-Digital-World
15 pages
CCCCCCCCCC
No ratings yet
CCCCCCCCCC
6 pages
Finite 5
No ratings yet
Finite 5
1 page
English-to-Malayalam_Machine_Translation_Framework_using_Transformers
No ratings yet
English-to-Malayalam_Machine_Translation_Framework_using_Transformers
5 pages