Axiomatic geometries for text documents [chapter]

G. Lebanon, Paolo Gibilisco, Eva Riccomagno, Maria Piera Rogantin, Henry P. Wynn
Algebraic and Geometric Methods in Statistics  
High-dimensional structured data such as text and images is often poorly understood and misrepresented in statistical modelling. Typical approaches to modelling such data involve, either explicitly or implicitly, arbitrary geometric assumptions. In this chapter, we consider statistical modelling of non-Euclidean data whose geometry is obtained by embedding the data in a statistical manifold. The resulting models perform better than their Euclidean counterparts on real world data and draw an
more » ... resting connection betweenČencov and Campbell's axiomatic characterisation of the Fisher information and the recently proposed diffusion kernels and square root embedding.
doi:10.1017/cbo9780511642401.018 fatcat:jnzwrmylkfg3xmmyi63idodng4