Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A clustering-based analysis of DPI-labeled video flow characteristics in cellular networks
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013). (DIstributed systems and communication (DISCO))ORCID iD: 0000-0003-3461-7079
2017 (English)In: Proceedings of the IM 2017 - 2017 IFIP/IEEE International Symposium on Integrated Network Management, New York: IEEE, 2017, p. 1-4Conference paper, Published paper (Refereed)
Abstract [en]

Using a specially instrumented deep packet inspection (DPI) appliance placed inside the core network of a commercial cellular operator we collect data from almost four million flows produced by a `heavy-hitter' subset of the customer base. The data contains per packet information for the first 100 packets in each flow, along with the classification done by the DPI engine. The data is used with unsupervised learning to obtain clusters of typical video flow behaviors, with the intent to quantify the number of such clusters and examine their characteristics. Among the flows identified as belonging to video applications by the DPI engine, a subset are actually video application signaling flows or other flows not carrying actual transfers of video data. Given that DPI-labeled data can be used to train supervised machine learning models to identify flows carrying video transfers in encrypted traffic, the potential presence and structure of such `noise' flows in the ground truth is important to examine. In this study K-means and DBSCAN is used to cluster the flows marked by the DPI engine as being from a video application. The clustering techniques identify a set of 4 to 6 clusters with archetypal flow behaviors, and a subset of these clusters are found to represent flows that are not actually transferring video data.

Place, publisher, year, edition, pages
New York: IEEE, 2017. p. 1-4
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kau:diva-64540DOI: 10.23919/INM.2017.7987420ISBN: 978-3-901882-89-0 (electronic)OAI: oai:DiVA.org:kau-64540DiVA, id: diva2:1149177
Conference
Integrated Network and Service Management (IM), 2017 IFIP/IEEE Symposium 8-12 May 2017, Lisbon, Portugal
Projects
HITS
Funder
Knowledge Foundation, 4707Available from: 2017-10-13 Created: 2017-10-13 Last updated: 2018-08-14Bibliographically approved

Open Access in DiVA

fulltext(1200 kB)6 downloads
File information
File name FULLTEXT01.pdfFile size 1200 kBChecksum SHA-512
d8e9fad558fcc8c312aa07dd9130e56799b86ae56d699e277de6ba882752301150b0d855aaa8d25a10a706f46cd23fc660c911560a731f8410e2588975dd705f
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Garcia, Johan
By organisation
Department of Mathematics and Computer Science (from 2013)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 6 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 116 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf