Search Facets and Ranking in Geospatial Dataset Search

Thomas Hervey, Sara Lafia, Werner Kuhn, Judith A. Verstegen, Krzysztof Janowicz
2020 International Conference Geographic Information Science  
This study surveys the state of search on open geospatial data portals. We seek to understand 1) what users are able to control when searching for geospatial data, 2) how these portals process and interpret a user's query, and 3) if and how user query reformulations alter search results. We find that most users initiate a search using a text input and several pre-created facets (such as a filter for tags or format). Some portals supply a map-view of data or topic explorers. To process and
more » ... ret queries, most portals use a vertical full-text search engine like Apache Solr to query data from a content-management system like CKAN. When processing queries, most portals initially filter results and then rank the remaining results using a common keyword frequency relevance metric (e.g., TF-IDF). Some portals use query expansion. We identify and discuss several recurring usability constraints across portals. For example, users are typically only given text lists to interact with search results. Furthermore, ranking is rarely extended beyond syntactic comparison of keyword similarity. We discuss several avenues for improving search for geospatial data including alternative interfaces and query processing pipelines.
doi:10.4230/lipics.giscience.2021.i.5 dblp:conf/giscience/HerveyLK21 fatcat:2fnxysem55dd7lzfdukv6d5bgy