Intrusion Detection System (IDS): The "Accuracy Paradox" Case Study

🚀 MAJOR UPDATE — November 2025

This repository now hosts the code for our paper:
"A Reproducible and Explainable Intrusion Detection System: A Case Study on the 'Accuracy Paradox'".

🔑 Key Findings

XGBoost outperforms CNN+LSTM by 19.5× in throughput
Accuracy Paradox: 98.43% Accuracy but 0% Recall on critical attacks like Infiltration
Full Reproducibility: The entire pipeline now runs from end to end with a single script

Stay tuned!

Overview

This project implements an end-to-end, reproducible MLOps pipeline for Network Intrusion Detection.
We benchmark:

XGBoost (tabular ML, CPU)
CNN+LSTM (deep learning, GPU)

using the CICIDS2018 dataset.

Unlike typical IDS implementations, this work focuses on:

Reproducibility
Explainability

We integrate SHAP (Shapley Additive Explanations) to diagnose how models fail on minority classes even when reporting 98%+ accuracy.

Project Structure

Note: Data and trained models are generated automatically via the pipeline and are not stored in the repository.

├── Dockerfile                    # Reproducible container environment
├── requirements.txt              # Dependencies
├── setup.py                      # Package installer
├── create_master_dataset.py      # Generates stratified CICIDS2018 sample (CRITICAL)
├── run_benchmarks.py             # Latency & Throughput benchmarking
├── run_xai.py                    # SHAP explainability plots
├── Fig1_XGBoost_Matrix_NEW.png   # Results image
├── FigF2_CNN_Matrix_NEW.png      # Results image
│
├── src/
│   ├── components/
│   │   ├── data_ingestion.py         # Train/Test split
│   │   ├── data_transformation.py    # Scaling, Encoding, LDA Feature Selection
│   │   ├── model_trainer.py          # XGBoost + CNN-LSTM training
│   │   └── optuna_tuner.py           # Hyperparameter optimization
│   ├── utils.py
│   ├── logger.py
│   └── exception.py
│
├── artifacts/                   # Generated models (*.pkl, *.keras)
├── dataset/                     # Generated sampled CSV
└── logs/                        # Runtime logs

Key Features

Statistically Valid Sampling
create_master_dataset.py generates a memory-safe, stratified sample of the full CICIDS2018 dataset.
Comparative Benchmarking
run_benchmarks.py automates latency and throughput comparisons for CPU vs GPU models.
Explainable AI (XAI)
Integrated SHAP plots reveal feature-level reasoning and model bias.
Strict MLOps Principles
Modular code, typed exceptions, logging, and full Dockerization.

Performance & The Accuracy Paradox

Our "Golden Run" results highlight the risk of using accuracy as the primary IDS metric.

1. XGBoost Performance — Champion Model

Metric	Value
Accuracy	98.43%
F1 Score	97.96%
Balanced Accuracy	78.11%
Throughput	185,680 samples/sec

⚠️ The Accuracy Paradox:
Despite 98.43% overall accuracy, the model achieves 0% Recall on the critical Infiltration attack class.

Fig 1: XGBoost confusion matrix — complete failure on minority attacks.

2. CNN+LSTM Performance — Baseline

Accuracy: 96.31%
Balanced Accuracy: 55.16% (catastrophic)
Throughput: 9,522 samples/sec (19.5× slower than XGBoost)

Fig 2: CNN+LSTM confusion matrix — collapses into "predict Benign for everything."

Installation

1. Clone the Repository

git clone https://github.com/MohammedSaim-Quadri/Intrusion_Detection-System.git
cd Intrusion_Detection-System

2. Create a Virtual Environment

python -m venv venv

# Windows
venv\Scripts\activate

# Mac/Linux
source venv/bin/activate

3. Install Dependencies

pip install -r requirements.txt

Usage & Reproducibility

To reproduce the Golden Run, execute the pipeline in this order:

Step 1 — Generate the Master Dataset

Download all CICIDS2018 CSV files and run:

python create_master_dataset.py

Outputs:
dataset/train_data_SAMPLED.csv

Step 2 — Data Ingestion

python src/components/data_ingestion.py

Outputs:
artifacts/train.csv
artifacts/test.csv

Step 3 — Transform and Train

python src/components/data_transformation.py
python src/components/model_trainer.py

Outputs:
artifacts/model_trained.pkl (XGBoost)
artifacts/model_trained.keras (CNN+LSTM)

Step 4 — Benchmarks & XAI

python run_benchmarks.py
python run_xai.py

Docker Setup

Build the Docker Image

docker build -t ids-system .

Run the Container

docker run --rm ids-system

This executes the complete pipeline inside an isolated environment.

Future Work

Address Class Imbalance
Implement SMOTE/GAN-based synthetic oversampling to fix minority-class recall.
Real-Time Deployment
Connect prediction to live packet capture with CICFlowMeter.
Ensemble Stacking
Combine XGBoost + CNN to capture complementary patterns.

Contributing

Contributions are welcome!
Please fork the repository and submit a pull request.

git checkout -b feature/MyFeature
git commit -m "Add MyFeature"
git push origin feature/MyFeature

License

This project is licensed under the MIT License.
See the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Intrusion Detection System (IDS): The "Accuracy Paradox" Case Study

🚀 MAJOR UPDATE — November 2025

🔑 Key Findings

Table of Contents

Overview

Project Structure

Key Features

Performance & The Accuracy Paradox

1. XGBoost Performance — Champion Model

2. CNN+LSTM Performance — Baseline

Installation

1. Clone the Repository

2. Create a Virtual Environment

3. Install Dependencies

Usage & Reproducibility

Step 1 — Generate the Master Dataset

Step 2 — Data Ingestion

Step 3 — Transform and Train

Step 4 — Benchmarks & XAI

Docker Setup

Build the Docker Image

Run the Container

Future Work

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
diagrams		diagrams
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Fig1_XGBoost_Matrix_NEW.png		Fig1_XGBoost_Matrix_NEW.png
FigF2_CNN_Matrix_NEW.png		FigF2_CNN_Matrix_NEW.png
LICENSE		LICENSE
README.md		README.md
SHAP_summary_bar.png		SHAP_summary_bar.png
SHAP_summary_beeswarm.png		SHAP_summary_beeswarm.png
create_master_dataset.py		create_master_dataset.py
generate_plots.py		generate_plots.py
requirements.txt		requirements.txt
run_benchmarks.py		run_benchmarks.py
run_xai.py		run_xai.py
setup.py		setup.py

License

MohammedSaim-Quadri/Intrusion_Detection-System

Folders and files

Latest commit

History

Repository files navigation

Intrusion Detection System (IDS): The "Accuracy Paradox" Case Study

🚀 MAJOR UPDATE — November 2025

🔑 Key Findings

Table of Contents

Overview

Project Structure

Key Features

Performance & The Accuracy Paradox

1. XGBoost Performance — Champion Model

2. CNN+LSTM Performance — Baseline

Installation

1. Clone the Repository

2. Create a Virtual Environment

3. Install Dependencies

Usage & Reproducibility

Step 1 — Generate the Master Dataset

Step 2 — Data Ingestion

Step 3 — Transform and Train

Step 4 — Benchmarks & XAI

Docker Setup

Build the Docker Image

Run the Container

Future Work

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages