KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Read original: arXiv:2403.02253 - Published 6/18/2024 by Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Hoon Wei Lim, Bryan Hooi

KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Overview

This paper, titled "KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection", explores a novel approach to enhancing phishing detection by combining large language models with multimodal knowledge graphs.
The researchers address the challenges of existing phishing detection methods, which often struggle with evolving phishing techniques and the need for comprehensive, up-to-date knowledge.
The proposed KnowPhish system aims to leverage the power of large language models and multimodal knowledge graphs to improve the accuracy and robustness of phishing detection.

Plain English Explanation

The paper presents a new way to detect phishing attempts, which are fraudulent emails or websites that try to trick people into revealing sensitive information or downloading malware. Current phishing detection methods can have trouble keeping up with the constantly changing tactics used by cybercriminals. The researchers behind KnowPhish have come up with a solution that combines two powerful technologies: large language models and multimodal knowledge graphs.

Large language models are AI systems that have been trained on huge amounts of text data, allowing them to understand and generate human-like language. Multimodal knowledge graphs are databases that store information not just from text, but also from images, videos, and other media. By bringing these two technologies together, the KnowPhish system can more accurately identify phishing attempts by drawing on a vast, up-to-date knowledge base that covers a wide range of information relevant to phishing.

For example, if a user receives an email that claims to be from their bank, KnowPhish could quickly check the email against its knowledge graph to see if the email address, website links, and other details match what it knows about the user's real bank. This allows KnowPhish to catch phishing attempts that might slip through other detection methods.

Technical Explanation

The researchers propose the KnowPhish system, which combines large language models (LLMs) and multimodal knowledge graphs (MMKGs) to enhance reference-based phishing detection. LLMs, such as BERT and DeBERTa, are used to extract semantic and contextual information from phishing-related content. MMKGs, which store multimodal data (text, images, URLs, etc.) related to phishing, provide a comprehensive knowledge base for cross-referencing and validation.

The KnowPhish architecture consists of three main components: a Phishing Knowledge Extractor, a Phishing Knowledge Graph Builder, and a Phishing Detection Engine. The Phishing Knowledge Extractor uses LLMs to analyze phishing-related text, images, and URLs, extracting relevant features and entities. The Phishing Knowledge Graph Builder then ingests this information into a multimodal knowledge graph, enhancing it with additional data from external sources.

Finally, the Phishing Detection Engine leverages the MMKG to perform reference-based phishing detection. By cross-referencing the characteristics of a potentially malicious email or website against the comprehensive knowledge graph, the system can identify discrepancies and inconsistencies that are indicative of phishing attempts.

The researchers evaluate KnowPhish on several phishing datasets and compare its performance to PhishGuard, a state-of-the-art phishing detection model. The results show that KnowPhish outperforms PhishGuard in terms of accuracy, F1-score, and other key metrics, demonstrating the effectiveness of the combined LLM and MMKG approach.

Critical Analysis

The paper presents a novel and promising approach to enhancing phishing detection, addressing the limitations of existing methods that often struggle to keep up with the evolving tactics of cybercriminals. The incorporation of multimodal knowledge graphs, which can store and cross-reference a wide range of information relevant to phishing, is a key strength of the KnowPhish system.

However, the paper does not provide extensive details on the specific knowledge graph construction process or the methods used to integrate the LLM and MMKG components. Additionally, the evaluation is limited to a few phishing datasets, and further testing on a wider range of real-world scenarios would be valuable to fully assess the system's capabilities and limitations.

Furthermore, the paper does not address potential privacy and ethical concerns related to the collection and storage of sensitive user data within the knowledge graph. Careful consideration of these issues, as well as the development of robust security measures, would be important for the practical deployment of KnowPhish.

Overall, the KnowPhish approach represents an interesting and potentially impactful advancement in the field of phishing detection. However, further research and development, along with a thorough examination of the system's real-world performance and ethical implications, would be necessary to fully evaluate its effectiveness and viability.

Conclusion

The KnowPhish system proposed in this paper offers a novel approach to enhancing phishing detection by integrating large language models and multimodal knowledge graphs. By leveraging the complementary strengths of these technologies, KnowPhish aims to improve the accuracy and robustness of phishing detection, addressing the challenges posed by the constantly evolving tactics of cybercriminals.

The key innovation of KnowPhish is its ability to draw on a comprehensive, multimodal knowledge base to cross-reference and validate the characteristics of potentially malicious emails or websites. This approach shows promise in improving upon the limitations of existing phishing detection methods.

While the paper presents promising results, further research and development are needed to fully assess the system's capabilities, address potential privacy and ethical concerns, and explore its practical deployment in real-world scenarios. Nonetheless, the KnowPhish concept represents an exciting step forward in the ongoing effort to protect individuals and organizations from the growing threat of phishing attacks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Hoon Wei Lim, Bryan Hooi

Phishing attacks have inflicted substantial losses on individuals and businesses alike, necessitating the development of robust and efficient automated phishing detection approaches. Reference-based phishing detectors (RBPDs), which compare the logos on a target webpage to a known set of logos, have emerged as the state-of-the-art approach. However, a major limitation of existing RBPDs is that they rely on a manually constructed brand knowledge base, making it infeasible to scale to a large number of brands, which results in false negative errors due to the insufficient brand coverage of the knowledge base. To address this issue, we propose an automated knowledge collection pipeline, using which we collect a large-scale multimodal brand knowledge base, KnowPhish, containing 20k brands with rich information about each brand. KnowPhish can be used to boost the performance of existing RBPDs in a plug-and-play manner. A second limitation of existing RBPDs is that they solely rely on the image modality, ignoring useful textual information present in the webpage HTML. To utilize this textual information, we propose a Large Language Model (LLM)-based approach to extract brand information of webpages from text. Our resulting multimodal phishing detection approach, KnowPhish Detector (KPD), can detect phishing webpages with or without logos. We evaluate KnowPhish and KPD on a manually validated dataset, and a field study under Singapore's local context, showing substantial improvements in effectiveness and efficiency compared to state-of-the-art baselines.

6/18/2024

💬

Multimodal Large Language Models for Phishing Webpage Detection and Identification

Jehyun Lee, Peiyuan Lim, Bryan Hooi, Dinil Mon Divakaran

To address the challenging problem of detecting phishing webpages, researchers have developed numerous solutions, in particular those based on machine learning (ML) algorithms. Among these, brand-based phishing detection that uses models from Computer Vision to detect if a given webpage is imitating a well-known brand has received widespread attention. However, such models are costly and difficult to maintain, as they need to be retrained with labeled dataset that has to be regularly and continuously collected. Besides, they also need to maintain a good reference list of well-known websites and related meta-data for effective performance. In this work, we take steps to study the efficacy of large language models (LLMs), in particular the multimodal LLMs, in detecting phishing webpages. Given that the LLMs are pretrained on a large corpus of data, we aim to make use of their understanding of different aspects of a webpage (logo, theme, favicon, etc.) to identify the brand of a given webpage and compare the identified brand with the domain name in the URL to detect a phishing attack. We propose a two-phase system employing LLMs in both phases: the first phase focuses on brand identification, while the second verifies the domain. We carry out comprehensive evaluations on a newly collected dataset. Our experiments show that the LLM-based system achieves a high detection rate at high precision; importantly, it also provides interpretable evidence for the decisions. Our system also performs significantly better than a state-of-the-art brand-based phishing detection system while demonstrating robustness against two known adversarial attacks.

8/13/2024

Utilizing Large Language Models to Optimize the Detection and Explainability of Phishing Websites

Sayak Saha Roy, Shirin Nilizadeh

In this paper, we introduce PhishLang, an open-source, lightweight language model specifically designed for phishing website detection through contextual analysis of the website. Unlike traditional heuristic or machine learning models that rely on static features and struggle to adapt to new threats, and deep learning models that are computationally intensive, our model leverages MobileBERT, a fast and memory-efficient variant of the BERT architecture, to learn granular features characteristic of phishing attacks. PhishLang operates with minimal data preprocessing and offers performance comparable to leading deep learning anti-phishing tools, while being significantly faster and less resource-intensive. Over a 3.5-month testing period, PhishLang successfully identified 25,796 phishing URLs, many of which were undetected by popular antiphishing blocklists, thus demonstrating its potential to enhance current detection measures. Capitalizing on PhishLang's resource efficiency, we release the first open-source fully client-side Chromium browser extension that provides inference locally without requiring to consult an online blocklist and can be run on low-end systems with no impact on inference times. Our implementation not only outperforms prevalent (server-side) phishing tools, but is significantly more effective than the limited commercial client-side measures available. Furthermore, we study how PhishLang can be integrated with GPT-3.5 Turbo to create explainable blocklisting -- which, upon detection of a website, provides users with detailed contextual information about the features that led to a website being marked as phishing.

9/11/2024

💬

Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance

Het Patel, Umair Rehman, Farkhund Iqbal

Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's digital world. By leveraging clever social engineering elements and modern technology, cybercrime targets many individuals, businesses, and organizations to exploit trust and security. These cyber-attackers are often disguised in many trustworthy forms to appear as legitimate sources. By cleverly using psychological elements like urgency, fear, social proof, and other manipulative strategies, phishers can lure individuals into revealing sensitive and personalized information. Building on this pervasive issue within modern technology, this paper aims to analyze the effectiveness of 15 Large Language Models (LLMs) in detecting phishing attempts, specifically focusing on a randomized set of 419 Scam emails. The objective is to determine which LLMs can accurately detect phishing emails by analyzing a text file containing email metadata based on predefined criteria. The experiment concluded that the following models, ChatGPT 3.5, GPT-3.5-Turbo-Instruct, and ChatGPT, were the most effective in detecting phishing emails.

6/10/2024