IEEE-CIS-Fraud-detection

Build a high‑performance fraud detection model using XGBoost, leveraging unique cardholder identifiers and sophisticated feature engineering on the IEEE‑CIS dataset. The pipeline normalizes temporal features, applies frequency and group aggregations, and produces a submission ready for Kaggle competition.

Project Overview

The core of this solution is the identification of unique cardholders (UIDs) and the aggregation of their transaction behavior over time. By normalizing temporal features and analyzing transaction patterns, the model can effectively distinguish between legitimate users and fraudulent actors.

Key Features

D-Column Normalization: Converting relative time deltas to absolute points in time for stability.
Cardholder UID Creation: Combining multiple card and address features to track individual credit cards.
Advanced Encodings:
- Frequency Encoding for high-cardinality features.
- Group Aggregations (Mean, Std, Nunique) based on cardholder UIDs.
Optimized Pipeline: Uses pd.concat to avoid DataFrame fragmentation and improve performance.

Kaggle Results

Below is the result of our model performance on the Kaggle leaderboard:

Getting Started

Prerequisites

Python 3.x
pandas
numpy
xgboost
scikit-learn

Usage

Place the competition datasets (train_transaction.csv, train_identity.csv, test_transaction.csv, test_identity.csv) in the root directory.
Run the training script:
```
python xgb_magic_model.py
```
The script will generate a submission_xgb_magic.csv file ready for Kaggle submission.

Model Report

A detailed technical report of the model architecture and feature engineering process can be found in XGBoost_Model_Report.md.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
result.png		result.png
submission_xgb_magic.csv		submission_xgb_magic.csv
xgb_magic_model.py		xgb_magic_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IEEE-CIS-Fraud-detection

Project Overview

Key Features

Kaggle Results

Getting Started

Prerequisites

Usage

Model Report

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

IEEE-CIS-Fraud-detection

Project Overview

Key Features

Kaggle Results

Getting Started

Prerequisites

Usage

Model Report

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages