Loughborough University
Leicestershire, UK
LE11 3TU
+44 (0)1509 263171
Loughborough University

Loughborough University Institutional Repository

Please use this identifier to cite or link to this item: https://dspace.lboro.ac.uk/2134/17479

Title: Automatic dataset labelling and feature selection for intrusion detection systems
Authors: Aparicio-Navarro, Francisco J.
Kyriakopoulos, Konstantinos G.
Parish, David J.
Keywords: Automatic labelling
Feature selection
Genetic algorithm
Network traffic labelling
Unsupervised anomaly IDS
Issue Date: 2014
Publisher: © IEEE
Citation: APARICIO-NAVARRO, F.J., KYRIAKOPOULOS, K.G. and PARISH, D.J., 2014. Automatic dataset labelling and feature selection for intrusion detection systems. IN: Proceedings of the IEEE Military Communications Conference MILCOM 2014, pp. 46 - 51.
Abstract: Correctly labelled datasets are commonly required. Three particular scenarios are highlighted, which showcase this need. When using supervised Intrusion Detection Systems (IDSs), these systems need labelled datasets to be trained. Also, the real nature of the analysed datasets must be known when evaluating the efficiency of the IDSs when detecting intrusions. Another scenario is the use of feature selection that works only if the processed datasets are labelled. In normal conditions, collecting labelled datasets from real networks is impossible. Currently, datasets are mainly labelled by implementing off-line forensic analysis, which is impractical because it does not allow real-time implementation. We have developed a novel approach to automatically generate labelled network traffic datasets using an unsupervised anomaly based IDS. The resulting labelled datasets are subsets of the original unlabelled datasets. The labelled dataset is then processed using a Genetic Algorithm (GA) based approach, which performs the task of feature selection. The GA has been implemented to automatically provide the set of metrics that generate the most appropriate intrusion detection results.
Description: © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor: This work was supported by the Engineering and Physical Sciences Research Council (EPSRC) Grant number EP/K014307/1 and the MOD University Research Collaboration in Signal Processing.
Version: Accepted for publication
DOI: 10.1109/MILCOM.2014.17
URI: https://dspace.lboro.ac.uk/2134/17479
Publisher Link: http://dx.doi.org/10.1109/MILCOM.2014.17
ISBN: 9781479967704
Appears in Collections:Published Articles (Mechanical, Electrical and Manufacturing Engineering)

Files associated with this item:

File Description SizeFormat
Automatic Dataset Labelling and Feature Selection for Intrusion Detection Systems.pdfAccepted version618.41 kBAdobe PDFView/Open


SFX Query

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.