A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is
In this paper, we derive a new method for determining shared features of datasets by employing joint non-negative matrix factorization and analyzing the resulting factorizations. Our approach uses the joint factorization of two dataset matrices X_1,X_2 into non-negative matrices X_1 = AS_1, X_2 = AS_2 to derive a similarity measure that determines how well a shared basis for X_1, X_2 approximates each dataset. We also propose a dataset distance measure built upon this method and the learnedarXiv:2207.05112v1 fatcat:lby6oztm2jhcdppxxmn6b77pge