Concepedia

Publication | Closed Access

SVMs for the Blogosphere: Blog Identification and Splog Detection

154

Citations

9

References

2006

Year

Abstract

Weblogs, or blogs have become an important new way to publish information, engage in discussions and form communities. The increasing popularity of blogs has given rise to search and analysis engines focusing on the “blogosphere”. A key requirement of such systems is to identify blogs as they crawl the Web. While this ensures that only blogs are indexed, blog search engines are also often overwhelmed by spam blogs (splogs). Splogs not only incur computational overheads but also reduce user satisfaction. In this paper we first describe experimental results of blog identification using Support Vector Ma-chines (SVM). We compare results of using different feature sets and introduce new features for blog iden-tification. We then report preliminary results on splog detection and identify future work.

References

YearCitations

Page 1