Performance of Machine Learning Classifiers for Anomaly Detection in Cyber Security Applications

Code for submitted paper for the 2025 International Conference on Advances in Computing Research (ACR’25) which will take place in Nice, France, July 7-9, 2025.

Markus Haug*, Gissel Velarde
IU International University of Applied Science, Erfurt, 99084, Germany
markus.haug@iu-study.org, gissel.velarde@iu.org

Instructions

Clone the repository
Create a new virtual environment

python3 -m venv venv
source venv/bin/activate

Install the dependencies

pip install -r requirements.txt

Open the Jupyter notebooks

Cite this work

If you use this code or reference the results in your research, please cite the following publication:

@inproceedings{10.1007/978-3-031-87647-9_25,
  abstract = {This work empirically evaluates machine learning models on two imbalanced public datasets (KDDCUP99 and Credit Card Fraud 2013). The method includes data preparation, model training, and evaluation, using an 80/20 (train/test) split. Models tested include eXtreme Gradient Boosting (XGB), Multi Layer Perceptron (MLP), Generative Adversarial Network (GAN), Variational Autoencoder (VAE), and Multiple-Objective Generative Adversarial Active Learning (MO-GAAL), with XGB and MLP further combined with Random-Over-Sampling (ROS) and Self-Paced-Ensemble (SPE). Evaluation involves 5-fold cross-validation and imputation techniques (mean, median, and IterativeImputer) with 10, 20, 30, and 50 {\%} missing data. Findings show XGB and MLP outperform generative models. IterativeImputer results are comparable to mean and median, but not recommended for large datasets due to increased complexity and execution time. The code used is publicly available on GitHub (github.com/markushaug/acr-25).},
  address = {Cham},
  author = {Haug, Markus and Velarde, Gissel},
  booktitle = {Proceedings of the Third International Conference on Advances in Computing Research (ACR'25)},
  editor = {Daimi, Kevin and Al Sadoon, Abeer},
  isbn = {978-3-031-87647-9},
  pages = {285--294},
  publisher = {Springer Nature Switzerland},
  title = {Performance of Machine Learning Classifiers for Anomaly Detection in Cyber Security Applications},
  year = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.vsconfig		.vsconfig
data/creditcard		data/creditcard
models		models
.gitignore		.gitignore
README.md		README.md
cm_kdd_vae.png		cm_kdd_vae.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Performance of Machine Learning Classifiers for Anomaly Detection in Cyber Security Applications

Instructions

Cite this work

About

Languages

markushaug/acr-25

Folders and files

Latest commit

History

Repository files navigation

Performance of Machine Learning Classifiers for Anomaly Detection in Cyber Security Applications

Instructions

Cite this work

About

Topics

Resources

Stars

Watchers

Forks

Languages