ArXiv | 2021

Active Learning for Network Traffic Classification: A Technical Survey

 
 
 
 

Abstract


[Note: This work has been submitted to the IEEE Transactions on Cognitive Communications and Networking journal for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible ] Abstract—Network Traffic Classification (NTC) has become an important component in a wide variety of network management operations, e.g., Quality of Service (QoS) provisioning and security purposes. Machine Learning (ML) algorithms as a common approach for NTC methods can achieve reasonable accuracy and handle encrypted traffic. However, ML-based NTC techniques suffer from the shortage of labeled traffic data which is the case in many real-world applications. This study investigates the applicability of an active form of ML, called Active Learning (AL), which reduces the need for a high number of labeled examples by actively choosing the instances that should be labeled. The study first provides an overview of NTC and its fundamental challenges along with surveying the literature in the field of using ML techniques in NTC. Then, it introduces the concepts of AL, discusses it in the context of NTC, and review the literature in this field. Further, challenges and open issues in the use of AL for NTC are discussed. Additionally, as a technical survey, some experiments are conducted to show the broad applicability of AL in NTC. The simulation results show that AL can achieve high accuracy with a small amount of data.

Volume abs/2106.06933
Pages None
DOI 10.1109/TCCN.2021.3119062
Language English
Journal ArXiv

Full Text