Publication | Closed Access
DiFacto
53
Citations
20
References
2016
Year
Unknown Venue
Frequency Adaptive RegularizationDistributed SubgradientsEngineeringInformation RetrievalData ScienceData MiningMachine LearningFactorization MachinesSparse Neural NetworkMatrix FactorizationKnowledge DiscoveryMixture Of ExpertCold-start ProblemComputer ScienceDeep LearningCollaborative Filtering
Factorization Machines offer good performance and useful embeddings of data. However, they are costly to scale to large amounts of data and large numbers of features. In this paper we describe DiFacto, which uses a refined Factorization Machine model with sparse memory adaptive constraints and frequency adaptive regularization. We show how to distribute DiFacto over multiple machines using the Parameter Server framework by computing distributed subgradients on minibatches asynchronously. We analyze its convergence and demonstrate its efficiency in computational advertising datasets with billions examples and features.
| Year | Citations | |
|---|---|---|
Page 1
Page 1