Regularization in directable environments with application to Tetris

Jan Malte Lichtenberg, Özgür Simsek
2019 International Conference on Machine Learning  
Learning from small data sets is difficult in the absence of specific domain knowledge. We present a regularized linear model called STEW, which benefits from a generic and prevalent form of prior knowledge: feature directions. STEW shrinks weights toward each other, converging to an equalweights solution in the limit of infinite regularization. We provide theoretical results on the equalweights solution that explains how STEW can productively trade-off bias and variance. Across a wide range of
more » ... learning problems, including Tetris, STEW outperformed existing linear models, including ridge regression, the Lasso, and the non-negative Lasso, when feature directions were known. The model proved to be robust to unreliable (or absent) feature directions, outperforming alternative models under diverse conditions. Our results in Tetris were obtained by using a novel approach to learning in sequential decision environments based on multinomial logistic regression.
dblp:conf/icml/LichtenbergS19 fatcat:4svwlczxqrfvfluh2xc2cm6xaq