Publication | Open Access
ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification
377
Citations
36
References
2022
Year
Convolutional Neural NetworkInternet Traffic AnalysisEncrypted Traffic ClassificationMachine LearningEngineeringEncrypted TrafficInformation SecurityPre-training TransformersData SciencePattern RecognitionTraffic PredictionContextualized Datagram RepresentationVideo TransformerComputer ScienceDeep LearningTraffic MonitoringTraffic ClassificationData SecurityCryptographyRobust Traffic RepresentationNetwork Traffic Measurement
Encrypted traffic classification requires discriminative and robust traffic representation captured from content-invisible and imbalanced traffic data for accurate classification, which is challenging but indispensable to achieve network security and network management. The major limitation of existing solutions is that they highly rely on the deep features, which are overly dependent on data size and hard to generalize on unseen data. How to leverage the open-domain unlabeled traffic data to learn representation with strong generalization ability remains a key challenge. In this paper, we propose a new traffic representation model called Encrypted Traffic Bidirectional Encoder Representations from Transformer (ET-BERT), which pre-trains deep contextualized datagram-level representation from large-scale unlabeled data. The pre-trained model can be fine-tuned on a small number of task-specific labeled data and achieves state-of-the-art performance across five encrypted traffic classification tasks, remarkably pushing the F1 of ISCX-VPN-Service to 98.9% (5.2%↑), Cross-Platform (Android) to 92.5% (5.4%↑), CSTNET-TLS 1.3 to 97.4% (10.0%↑). Notably, we provide explanation of the empirically powerful pre-training model by analyzing the randomness of ciphers. It gives us insights in understanding the boundary of classification ability over encrypted traffic. The code is available at: https://github.com/linwhitehat/ET-BERT.
| Year | Citations | |
|---|---|---|
Page 1
Page 1