Transductive De-Noising and Dimensionality Reduction using Total Bregman Regression [chapter]

Sreangsu Acharyya
2006 Proceedings of the 2006 SIAM International Conference on Data Mining  
Our goal on one hand is to use labels or other forms of ground truth data to guide the tasks of de-noising and dimensionality reduction and balance the objectives of better prediction and better data summarization, on the other hand it is to explicitly model the noise in the feature values. We use a generalization of L 2 loss, on which PCA and K-Means are based, to the Bregman family which, as a consequence widens the applicability of the proposed algorithms to cases where the data may be
more » ... ained to lie on sets of integers or sets of labels rather than R d as in PCA or K-means formulations. This makes it possible to handle different prediction tasks such as classification and regression in an unified way. Two tasks are formulated (i) Transductive Total Bregman Regression (ii) Transductive Bregman PCA. 1 we do model the joint distribution of the output Y and input X, but through parametric assumptions on P (Y |X) rather than on P (X|Y ) that requires the handling of the partition function
doi:10.1137/1.9781611972764.51 dblp:conf/sdm/Acharyya06 fatcat:hwjkpqxmozdsvjtultizmnspfe