Using Multinomial Mixture Models to Cluster Internet Traffic |
| |
Authors: | Murray Jorgensen |
| |
Institution: | Dept of Statistics, University of Waikato, Hamilton, New Zealand |
| |
Abstract: | The paper considers the clustering of two large sets of Internet traffic data consisting of information measured from headers of transmission control protocol packets collected on a busy arc of a university network connecting with the Internet. Packets are grouped into 'flows' thought to correspond to particular movements of information between one computer and another. The clustering is based on representing the flows as each sampled from one of a finite number of multinomial distributions and seeks to identify clusters of flows containing similar packet‐length distributions. The clustering uses the EM algorithm, and the data‐analytic and computational details are given. |
| |
Keywords: | EM algorithm Internet traffic data mixture model multinomial distribution packet-length distribution |
|
|