Publication | Closed Access
Parallel algorithms for mining large-scale rich-media data
41
Citations
7
References
2009
Year
Unknown Venue
Cluster ComputingEngineeringMachine LearningUnsupervised Machine LearningParallel AlgorithmsText MiningInformation RetrievalData ScienceData MiningPattern RecognitionOnline PhotosSupport Vector MachinesContent AnalysisMultimedia MiningDocument ClusteringKnowledge DiscoveryComputer ScienceBig Data SearchLarge Scale ProblemsSimilarity SearchMassive Data ProcessingBig Data
The amount of online photos and videos is now at the scale of tens of billions. To organize, index, and retrieve these large-scale rich-media data, a system must employ scalable data management and mining algorithms. The research community needs to consider solving large scale problems rather than solving problems with small datasets that do not reflect real life scenarios. This tutorial introduces key challenges in large-scale rich-media data mining, and presents parallel algorithms for tackling such challenges. We present our parallel implementations of Spectral Clustering (PSC), FP-Growth (PFP), Latent Dirichlet Allocation (PLDA), and Support Vector Machines (PSVM).
| Year | Citations | |
|---|---|---|
Page 1
Page 1