An Architecture for Payload Based Network Traffic Classification Via Super Resolution and Transfer Learning
نام عام مواد
[Thesis]
نام نخستين پديدآور
Muhammad, Waqar
نام ساير پديدآوران
Esposito, Flavio
وضعیت نشر و پخش و غیره
نام ناشر، پخش کننده و غيره
Saint Louis University
تاریخ نشرو بخش و غیره
2020
يادداشت کلی
متن يادداشت
48 p.
یادداشتهای مربوط به پایان نامه ها
جزئيات پايان نامه و نوع درجه آن
M.S.
کسي که مدرک را اعطا کرده
Saint Louis University
امتياز متن
2020
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
In modern data centers, a single high-speed computer network generateshundreds of gigabytes of flow statistics per day. These flows are often sampled and analyzed for many network management purposes, such as network debugging, traffic classification, and anomaly detection. Several of these DevOps operations are automated with modern machine learning and deep learning algorithms for faster and more accurate decision making. Regardless of the problem being solved, training any machine learning or deep learning algorithm over a large amount of data may consume a massive amount of resources. In computer network management, it is unfeasible to create these models from scratch for every network topology (the dynamic graph representing the servers and connections) since such topologies are different, and using data or model collected on one network is not always useful for another network. To overcome the problem of re-training, we propose a software architecture to avoid the need for training a machine learning model from scratch for every network topology but instead utilizing the models already created on a massive amount of data and having good performance using transfer learning. In particular, we design architecture to solve the problem of network traffic classification with transfer learning efficiently. Our architecture also includes a super-resolution component to increase the created model's performance by converting the low-resolution input of a small network topology into high-resolution. To test our architecture, we created a traffic matrix from our own generated dataset based on a topology of 50 nodes through the Mininet network emulator. Each router sends a data packet to every other router in the topology. We classified and assigned a label (low-payload) 0 or (high-payload) 1 to each router-to-router communication based on a flow duration at a certain threshold. Our study results indicate that applying transfer learning can increase the accuracy performance of the base model. Still, it can also lead to a severely over-fitted model if we do not have enough training data available.
اصطلاحهای موضوعی کنترل نشده
اصطلاح موضوعی
Computer science
اصطلاح موضوعی
Educational technology
نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )