Data Generative Model to Detect the Anomalies for IDS Imbalance CICIDS2017 Dataset Cover Image

Data Generative Model to Detect the Anomalies for IDS Imbalance CICIDS2017 Dataset
Data Generative Model to Detect the Anomalies for IDS Imbalance CICIDS2017 Dataset

Author(s): Azhari Shouni Barkah, Siti Rahayu Selamat, Zaheera Zainal Abidin, Rizki Wahyudi
Subject(s): Electronic information storage and retrieval, ICT Information and Communications Technologies
Published by: UIKTEN - Association for Information Communication Technology Education and Science
Keywords: DGM; CGAN; imbalance data; IDS dataset

Summary/Abstract: The system of intrusion detection dataset enables machine learning to recognize attack activity in the network. The intrusion, however, is naturally imbalanced, most of the traffic is normal traffic. Moreover, a certain attack is more popular than others. Therefore, the number of cases is highly imbalanced with the majority of attacks dominated by Distributed Denial of Services (DDoS), Denial of Service Hulk (DoS_Hulk), and PortScan more than 4.5% of attacks data. While the minority attack such as DoS_goldeneye, DoS_slowloris, DoS_slowhttptest, Web Attacks, Infiltration, Bot, and Heartbleed was only recorded in less than 1% of attack data. We propose data generative model (DGM) using the Conditional Generative Adversarial Network (CGAN) to improve the class of minorities of the IDS dataset. In this study, we tested the performance of the Data Generative Model based on CGAN in the CICIDS2017 dataset. There are new attacks in this dataset, including Bot, Web_attacks, Infiltration and Heartbleed. According to our experiments, the model successfully detect new attacks and improves the weighted f1-score by 99,92% compared to that of achievers by existing methods using the CICIDS2017 dataset.

  • Issue Year: 12/2023
  • Issue No: 1
  • Page Range: 80-89
  • Page Count: 10
  • Language: English
Toggle Accessibility Mode