Reading the city through its neighbourhoods: Deep text embeddings of Yelp reviews as a basis for determining similarity and change [post]

Alex Olson, Fernando Calderon-Figueroa, Olimpia Bidian, Daniel Silver, Scott Sanner
2020 unpublished
This paper develops novel methods for using Yelp reviews as a window into the collective representations of a city and its neighbourhoods. Basing analysis on social media data such as Yelp is a challenging task because review data is highly sparse and direct analysis may fail to uncover hidden trends. To this end, we propose a deep autoencoder approach for embedding the language of neighbourhood-based business reviews into a reduced dimensional space that facilitates similarity comparison of
more » ... ghbourhoods and their change over time. Our model improves performance in distinguishing real and fake neighbourhood descriptions derived from real reviews, increasing performance in the task from an average accuracy of 0.46 to 0.77. This improvement in performance indicates that this novel application of embedded language analysis permits us to uncover comparative trends in neighbourhood change through the lens of their venues' reviews, providing a computational methodology for reading a city through its neighbourhoods. The resulting toolkit makes it possible to examine a city's current sociological trends in terms of its neighbourhoods' collective identities.
doi:10.31235/osf.io/8jbvg fatcat:mndii3ucvfdtpcedj4jqr7scg4