DiFacto

Mu Li, Ziqi Liu, Alexander J. Smola, Yu-Xiang Wang
2016 Proceedings of the Ninth ACM International Conference on Web Search and Data Mining - WSDM '16  
Factorization Machines o↵er good performance and useful embeddings of data. However, they are costly to scale to large amounts of data and large numbers of features. In this paper we describe DiFacto, which uses a refined Factorization Machine model with sparse memory adaptive constraints and frequency adaptive regularization. We show how to distribute DiFacto over multiple machines using the Parameter Server framework by computing distributed subgradients on minibatches asynchronously. We
more » ... ze its convergence and demonstrate its efficiency in computational advertising datasets with billions examples and features.
doi:10.1145/2835776.2835781 dblp:conf/wsdm/LiLSW16 fatcat:c7mguxqqq5gfbgukg7msbsbcpu