Publication | Closed Access
Blog search and mining in the business domain
32
Citations
13
References
2007
Year
Unknown Venue
EngineeringBusiness IntelligenceBlog SearchSemantic WebJournalismText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningContent AnalysisSocial Medium MiningSearch TechnologyKnowledge DiscoveryTerminology ExtractionLatent Semantic AnalysisSearch Engine DesignWeb MiningBusiness BlogsTopic ModelKeyword ExtractionEnterprise SearchArts
Weblogs, or blogs, have rapidly gained in popularity over the past few years. In particular, the growth of business blogs written by or providing commentary on businesses and companies opens up new opportunities for developing blog-specific search and mining techniques. In this paper, we propose probabilistic models for blog search and mining using two machine learning techniques, Latent Semantic Analysis (LSA) and Probabilistic Latent Semantic Analysis (PLSA). We implement the models in our database of business blogs, with the aim of achieving higher precision and recall. The probabilistic model is able to segment the business blogs into separate topic areas, which is useful for keywords detection on the blogosphere. Various term-weighting schemes and factor values were also studied in detail, which reveal interesting patterns in our database of business blogs. From our study, we can uncover domain-driven data mining techniques that can better strengthen business intelligence in complex enterprise applications.
| Year | Citations | |
|---|---|---|
Page 1
Page 1