Publication | Closed Access
A NEW TRAFFIC MODEL FOR CURRENT USER WEB BROWSING BEHAVIOR
39
Citations
4
References
2007
Year
Unknown Venue
Access Log AnalysisInternet Traffic AnalysisEngineeringInformation RetrievalNetwork Traffic PatternsData ScienceSquid Proxy LogWeb PerformanceHttp Traffic ModelsWeb TrendInternet ModelingComputer ScienceNetwork Traffic MeasurementWeb Analytics
Given the wide use of HTTP traffic models to model user web browsing behaviour, it is important that the model be representative of a large variety of traffic and be continually updated to reflect the constantly evolving nature of web content and the exponential growth in number of users. In this paper, we analyzed an extensive set of proxy web server logs to understand changes in network traffic patterns. We found significant gaps in the methods previously proposed, specifically the major one being that it is almost impossible to detect a web request generated from a user click from one generated from various embedded scripts and frames. As a result, we modified the definition of a web request boundary. Due to the presence of large numbers of embedded objects from several different off-site sources, which cannot be traced back to the original request through following TCP/IP headers source addresses alone, newer heuristics need to be devised. We present our methodology for analyzing the squid proxy log in a way that preserves user privacy, and propose a new HTTP traffic model and traffic generator to represent current user web browsing behaviour. Comparison of independent statistics from the trace and the model shows a fair match.
| Year | Citations | |
|---|---|---|
Page 1
Page 1