Robust Phishing Detection Using Random Forest with Isolation Forest Based Outlier Filtering

Authors

DOI:

https://doi.org/10.71129/ijaci.v1i2.pp99-105

Keywords:

Phishing Detection, Random Forest, Machine Learning, Cybersecurity, Classification, Outlier Removal

Abstract

Phishing remains a prevalent cybersecurity threat that exploits deceptive websites to extract confidential user information, resulting in significant financial and reputational damages. Conventional detection methods, such as rule-based systems and signature matching, often struggle with noisy data and fail to adapt swiftly to evolving attack strategies, limiting their effectiveness in real-world scenarios. To address these challenges, this study introduces a robust phishing detection framework that integrates outlier removal through the Isolation Forest algorithm with classification using the Random Forest model. The proposed approach aims to improve data quality by eliminating anomalous instances that could negatively impact model training and generalization. The experiments utilized a comprehensive dataset comprising 11,430 instances and 87 features, which was meticulously pre-processed to enhance model performance. The results demonstrate that the proposed framework achieves high accuracy (96.78%), precision (96.09%), recall (97.20%), and an F1-score of 96.64%, outperforming many existing approaches. The results indicate its strong ability to detect phishing websites while maintaining interpretability. This approach offers an effective and interpretable method for phishing detection, emphasizing clarity and robustness in machine learning-based security solutions. The findings underscore the potential of combining anomaly detection with ensemble classification techniques to develop resilient, accurate, and transparent phishing detection systems.

Downloads

Published

2025-07-28

Abstract

    126 views

PDF Download

    71 times

Similar Articles

1-10 of 15

You may also start an advanced similarity search for this article.