A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
BASIL: Effective Near-Duplicate Image Detection Using Gene Sequence Alignment
[chapter]
2010
Lecture Notes in Computer Science
In the dominance of social networks era, vast information is created and shared across the world each day. The uniqueness and the prevalence of these user-generated content present both challenges and opportunities. In this thesis, in particular, we study several tasks on mining the user-generated content with regard to textual content and link-based content. First, we study the home location estimation for Twitter users from their shared textual content. We employ Gaussian Mixture Model to
doi:10.1007/978-3-642-12275-0_22
fatcat:ou4wo4a6efdabkipzbkaxd5cyi