Publication | Closed Access
A Methodology for Sampling the World Wide Web
24
Citations
3
References
2001
Year
EngineeringSemantic WebWeb AnalyticsTimely StatisticsComputational Social ScienceInformation RetrievalData ScienceIp AddressesDescriptive StatisticsLanguage StudiesContent AnalysisStatisticsKnowledge DiscoveryWebometricsWeb ScienceWeb MiningWeb PerformanceWeb Information SystemSurvey Methodology
Abstract The rapid growth in the number of libraries providing Web access services has created a need for reliable, timely statistics characterizing the content of Web-accessible information. The size of the Web makes it impractical to develop descriptive statistics based on an exhaustive survey. An alternative approach is to collect a representative sample of Web pages. This report describes a methodology for sampling the content of the Web through the use of randomly generated IP addresses.
| Year | Citations | |
|---|---|---|
Page 1
Page 1