A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is
Advanced Topics in Information Retrieval
This chapter introduces the basic types of data sources, as well as specific datasets and resources, that we will be working with in later chapters of the book. These may be placed on a spectrum of varying degrees of structure, from unstructured to structured data, as shown in Fig. 2 .1. Fig. 2.1 The data spectrum On the unstructured end of the spectrum we have plain text. Typically, these are documents written in natural language. 1 As a matter of fact, almost any type of data can be converteddoi:10.1007/978-3-319-93935-3_2 fatcat:mijb4wvy2zgxvkkvbafojxp2r4