Email Security Analysis Tools

This repository contains two Python utilities for email security analysis and processing:

Phishing Email Detection System
Email Processing Utility

Phishing Email Detection System

A machine learning-based system that analyzes emails to detect potential phishing attempts using a GGUF model.

Features

Analyzes email content, subject, sender, and return path
Classifies emails into three categories:
- Malicious (score > 0.49)
- Suspicious (score between 0.3 and 0.49)
- Benign (score < 0.3)
Provides detailed analysis including:
- Classification result
- Confidence percentage
- Brief explanation
- Key reasons for classification

Requirements

Python 3.x
llama-cpp-python
BeautifulSoup4
email (standard library)

Usage

from phishingtest_gguf_model import process_email, process_llm

# Process an email file
email_data = process_email("path/to/email.eml")

# Analyze the email
result = process_llm(email_data)

Email Processing Utility

A utility for processing and cleaning email files, particularly useful for preparing emails for analysis.

Features

Removes HTML tags from email content
Handles multiple email encodings (UTF-8, Latin-1, CP1252, ISO-8859-1)
Properly unfolds email headers according to RFC 5322
Removes X-headers
Extracts email components:
- Subject
- Body
- Sender
- Return-Path

Key Functions

remove_html_tags(): Cleans HTML content from email body
unfold_headers(): Properly unfolds email headers
remove_x_headers(): Removes X-header fields
get_email_body_from_string(): Extracts email components
truncate_text(): Truncates text while preserving word boundaries

Usage

from phishingtest_gguf_model import get_email_body_from_string

# Process raw email string
subject, body, sender, return_path = get_email_body_from_string(raw_email_string)

Installation

Clone the repository
Install required packages:

pip install llama-cpp-python beautifulsoup4

Model Requirements

The system uses a GGUF model file named phishingmodel.gguf. Make sure to:

Place the model file in the project directory
Ensure the model file is compatible with llama-cpp-python
Verify the model has been trained for phishing detection tasks

Output Format

The system outputs results in JSON format:

{
    "classification": "Malicious|Suspicious|Benign",
    "percentage": "0.0-1.0",
    "explanation": "Brief explanation",
    "reasons": ["reason1", "reason2", "reason3"]
}

License

[Add your license information here]

Contributing

[Add contribution guidelines here]

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
phishing_results_bert.json		phishing_results_bert.json
phishing_results_gguf.json		phishing_results_gguf.json
phishingtest_bert_model.py		phishingtest_bert_model.py
phishingtest_gguf_model.py		phishingtest_gguf_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Email Security Analysis Tools

Phishing Email Detection System

Features

Requirements

Usage

Email Processing Utility

Features

Key Functions

Usage

Installation

Model Requirements

Output Format

License

Contributing

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

biggeezerdevelopment/phishingtesting

Folders and files

Latest commit

History

Repository files navigation

Email Security Analysis Tools

Phishing Email Detection System

Features

Requirements

Usage

Email Processing Utility

Features

Key Functions

Usage

Installation

Model Requirements

Output Format

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages