Metadata for mixed-media access

Francine Chen, Marti Hearst, Julian Kupiec, Jan Pedersen, Lynn Wilcox
1994 SIGMOD record  
In this chapter, we discuss mixed-media access, an information access paradigm for multimedia data in which the media type of a query may di er from that of the data. This allows a single query to be used to retrieve information from data consisting of multiple types of media. In addition, multiple queries formulated in di erent media types can be used to more accurately specify the data to be retrieved. The types of media considered in this paper are speech, images of text, and full-length
more » ... and full-length text. Some examples of metadata for mixed-media access are locations of keywords in speech and images, identi cation of speakers, locations of emphasized regions in speech, and locations of topic boundaries in text. Algorithms for automatically generating this metadata are described, including word spotting, speaker segmentation, emphatic speech detection, and subtopic boundary location. We illustrate the use of mixedmedia access with an example of information access from multimedia data surrounding a formal presentation.
doi:10.1145/190627.190646 fatcat:d3hvu2ynq5arfg7iky5x6ajypu