2,021 Hits in 6.7 sec

Accessing Media Via an Audio-only Communication Channel: A Log Analysis

Johanne R. Trippas, Damiano Spina, Mark Sanderson, Lawrence Cavedon
2021 CUI 2021 - 3rd Conference on Conversational User Interfaces  
Studies of interaction log analysis are a common tool to investigate behavioural data and contribute to insights into users' interaction patterns with a system [11, 18] .  ...  The results are twofold, we highlight the (i) implications for the design of future voice-enabled systems such as "infinite-reading" mode, enhanced interaction management enabling file navigation or time-compression  ...  Our analyses suggest that audio-only interactions systems are still in the early stages of their development, as reflected in the need for improvement in navigational commands, query intent recognition  ... 
doi:10.1145/3469595.3469623 fatcat:ctkcefh5urcdpckdnmh32hcqhu

Multi-modal Conversational Search for People with Intellectual Disability: An Exploratory Study

Sirinthip Roomkham, Shannon Terris, Laurianne Sitbon
2022 CHI Conference on Human Factors in Computing Systems Extended Abstracts  
Like us all, people with intellectual disability are self-motivated learning and are enthusiastic information seekers.  ...  To conduct an exploratory study, we developed a Wizard of Oz conversational multi-modal system which records user activities, including touch, text and verbal interactions.  ...  Associate Professor Laurianne Sitbon is the recipient of an Australian Research Council Australian Future Fellowship (project number FT190100855) funded by the Australian Government.  ... 
doi:10.1145/3491101.3519821 fatcat:323kwy7du5fmflkaplb5fcpz4u

User interfaces for voice applications

C. Kamm
1995 Proceedings of the National Academy of Sciences of the United States of America  
With further technological improvements, the primary role of the user interface will gradually shift from a focus on adapting the user's input to fit the limitations of the technology to facilitating interactive  ...  user interfaces that are particularly critical in successful voice applications.  ...  As a general principle, user interfaces should also allow the user to initiate a graceful exit from the application at any stage in the interaction.  ... 
doi:10.1073/pnas.92.22.10031 pmid:7479721 pmcid:PMC40730 fatcat:fvosajggyze4rb6iuvzyay3rzm

Multimodal interfaces for dynamic interactive maps

Sharon Oviatt
1996 Proceedings of the SIGCHI conference on Human factors in computing systems common ground - CHI '96  
In the present research, interfaces supporting spoken, pen-based, and multimodal input were analyze for their potential effectiveness in interacting with this new generation of map systems.  ...  The error-proneness and unacceptability of speech-only input to maps was attributed in large part to people's difficulty generating spoken descriptions of spatial location.  ...  input streams are still by and large in the planning stages.  ... 
doi:10.1145/238386.238438 dblp:conf/chi/Oviatt96 fatcat:ila7oulginfz3i7xjd2tlgeclu

Browsing with Alexa: Interrogating the impact of voice assistants as web interfaces

Simone Natale, Henry Cooke
2020 Media Culture and Society  
Less attention, however, has been given to the fact that voice assistants are also web interfaces that might impact on how the web is accessed, understood and employed by users.  ...  access, the relationship between production and consumption online, and the role of affect in informing engagement with web resources.  ...  In the early stages of trying to wrap our mind around the concept of what it is to communicate with a computer, these moments of specificity help give people something to acclimate to" (Young, 2019: 117  ... 
doi:10.1177/0163443720983295 fatcat:x772sq6f6rfj3lj26ih7nkofpe

Productive Sounds [chapter]

Axel Volmar
2019 The Democratization of Artificial Intelligence  
(IPAs), belong to a class of software agents that can answer queries and perform tasks for users based on verbal commands and inquiries when equipped with a voice user interface (VUI).  ...  How are these emergent forms of voice-based cooperation structured and how does voice control change our relationship with and critical assessment of software technology?  ...  Hochheiser and Melissa Wasson of the AT&T Archives and History Center (Warren, NJ) for their generous support. Bibliography  ... 
doi:10.14361/9783839447192-004 fatcat:hmu4mrv6tfganpwxmr5lumm6ay

From multimedia retrieval to knowledge management

P.J. Moreno, J.-M. Van Thong, B. Logan, G.J.F. Jones
2002 Computer  
SpeechBot uses word transcriptions to provide a catalog of audio and video documents that feed the user interface a list of documents matching user queries.  ...  The search uses word transcriptions to provide a catalog of audio and video documents that feeds the user interface a list of documents matching user queries.  ... 
doi:10.1109/mc.2002.993772 fatcat:4o5oz2x6gvcb3b4cqpqs2w4aty

Spoken dialogue technology: enabling the conversational user interface

Michael F. McTear
2002 ACM Computing Surveys  
Voice portals, which provide a speech-based interface between a telephone user and Web-based services, are the most recent application of spoken dialogue technology.  ...  The origins of spoken dialogue systems can be traced back to Artificial Intelligence research in the 1950s concerned with developing conversational interfaces.  ...  Ronnie Smith, David James, and Ian O'Neill, and from the anonymous reviewers of the paper.  ... 
doi:10.1145/505282.505285 fatcat:56666shnuja5xiy3kju3v2kgbq

The Use and Promise of Conversational Agents in Digital Health

Tilman Dingler, Dominika Kwasnicka, Jing Wei, Enying Gong, Brian Oldenburg
2021 IMIA Yearbook of Medical Informatics  
Results: By responding to written and spoken language, conversational agents present a versatile, natural user interface and have the potential to make their services and applications more widely accessible  ...  We present our work on context-aware voice assistants capable of proactively engaging users and delivering health information and services.  ...  They found that conversational agents in healthcare are still in the early stage compared to other fields [31] .  ... 
doi:10.1055/s-0041-1726510 pmid:34479391 fatcat:6xdzz7yrmbfhrh27wu45ept2dq

Analyzing Deaf and Hard-of-Hearing Users' Behavior, Usage, and Interaction with a Personal Assistant Device that Understands Sign-Language Input

Abraham Glasser, Matthew Watkins, Kira Hart, Sooyeon Lee, Matt Huenerfauth
2022 CHI Conference on Human Factors in Computing Systems  
are emerging for many Deaf and Hard of Hearing (DHH) users.  ...  As voice-based personal assistant technologies proliferate, e.g., smart speakers in homes, and more generally as voice-control of technology becomes increasingly ubiquitous, new accessibility barriers  ...  that the proliferation of voice-controlled interfaces are posing for DHH users.  ... 
doi:10.1145/3491102.3501987 fatcat:xm63jrhn2nax7jryl2ujhs6scu

Speech-gesture driven multimodal interfaces for crisis management

R. Sharma, M. Yeasin, N. Krahntoever, I. Rauschert, Guoray Cai, I. Brewer, A.M. Maceachren, K. Sengupta
2003 Proceedings of the IEEE  
The first part discusses the needs of CM that can be potentially met by the development of appropriate interfaces.  ...  The second part discusses the issues related to the design and development of multimodal interfaces in the context of CM.  ...  To be useful and usable, the interface technologies must be human-centered, designed with input from practicing crisis management personnel at all stages of development.  ... 
doi:10.1109/jproc.2003.817145 fatcat:flbaisvreresla7wufztzpnvfq

Natural Language Processing: A Human-Computer Interaction Perspective [chapter]

Bill Manaris
1998 Advances in Computers  
As computers continue to become more affordable and accessible, the importance of user interfaces that are effective, robust, unobtrusive, and user-friendly -regardless of user expertise or impediments  ...  Keywords: natural language processing, human-computer interaction, speech recognition, speech understanding, natural language widgets, multimodal user interfaces, user interface development, user interface  ...  outline -especially on the phases of NLP evolution; István Berkeley for discussions on philosophical and connectionist issues; and Eleni Efthimiou for providing several important references.  ... 
doi:10.1016/s0065-2458(08)60665-8 fatcat:qkvovunv6bdqrn4wwuutmtwd6m

An overview of end-to-end language understanding and dialog management for personal digital assistants

R. Sarikaya, P. A. Crook, A. Marin, M. Jeong, J.P. Robichaud, A. Celikyilmaz, Y.B. Kim, A. Rochette, O. Z. Khan, X. Liu, D. Boies, T. Anastasakos (+6 others)
2016 2016 IEEE Spoken Language Technology Workshop (SLT)  
Spoken language understanding and dialog management have emerged as key technologies in interacting with personal digital assistants (PDAs).  ...  We describe how the quality of user experiences are measured end-to-end and also discuss open issues.  ...  Typically, for voice input, the system also generates a natural language response, which can be synthesized into speech with a text-to-speech (TTS) synthesis engine.  ... 
doi:10.1109/slt.2016.7846294 dblp:conf/slt/SarikayaCMJRCKR16 fatcat:jx2hxzpzrncrzddbq3vuw2u6ge

Activity Recognition and Personalized Feedback Solution for Active and Healthy Ageing

Thanos G. Stavropoulos, Georgios Meditskos, Stefanos Vrochidis, Ioannis Kompatsiaris
2018 International Joint Conference on Autonomous Agents & Multiagent Systems  
multimedia, and personalized spoken feedback based on context-sensing and user input.  ...  , and improvement in several neuropsychological areas, such as mood, physical functional and cognitive condition of elders.  ...  The second layer of intelligence capitalizes this information as context, which combined with spoken user input, can lead to further personalized feedback.  ... 
dblp:conf/atal/StavropoulosMVK18 fatcat:lkzeu7tmfbdqbffjiq435lvpmq

Automatic Summarization

Martha Larson
2012 Foundations and Trends in Information Retrieval  
Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR).  ...  This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues.  ...  The Query depicted on the left represents the user input to the system.  ... 
doi:10.1561/1500000020 fatcat:o424mjxnp5abbexhjsobtom2ry
« Previous Showing results 1 — 15 out of 2,021 results