Filters








13,372 Hits in 4.6 sec

An Empirical Study on Users' Acceptance of Speech Recognition Errors in Text-Messaging [chapter]

Shuang Xu, Santosh Basapur, Mark Ahlenius, Deborah Matteo
Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments  
This research is aimed to investigate users' acceptance of speech recognition errors in mobile text messaging.  ...  Our results show that even though the audio presentation of the text messages does help users understand the speech recognition errors, users indicate low satisfaction when sending or receiving text messages  ...  Conclusions This study investigated users' acceptance of speech recognition errors in text messaging.  ... 
doi:10.1007/978-3-540-73110-8_25 dblp:conf/hci/XuBAM07 fatcat:lljw4cy3ojhtpoygwsunun2wxa

Understanding users' perception of speech recognition errors in mobile communication

Shuang Xu
2009 International Journal of Mobile Learning and Organisation  
We reported that different error types had an impact on users' acceptance of recognition errors in messaging.  ...  speech recognition errors in our study.  ... 
doi:10.1504/ijmlo.2009.026315 fatcat:4ueb5kal6jeqffw6ztwqlkddv4

Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones

Sherry Ruan, Jacob O. Wobbrock, Kenny Liou, Andrew Ng, James A. Landay
2018 Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies  
With the ubiquity of mobile touchscreen devices like smartphones, two widely used text entry methods have emerged: small touch-based keyboards and speech recognition.  ...  Furthermore, although speech made fewer errors during entry (5.30% vs. 11.22% corrected error rate), it left slightly more errors in the final transcribed text (1.30% vs. 0.79% uncorrected error rate).  ...  ACKNOWLEDGMENTS The authors thank He Dang of Baidu's Speech Technology group, who performed a preliminary study that inspired this research project.  ... 
doi:10.1145/3161187 fatcat:gj5fh3vubjctlixxqr4hhfjswq

Evaluating Multi-modal Input Modes in a Wizard-of-Oz Study for the Domain of Web Search [chapter]

Alexandra Klein, Ingrid Schwank, Michel Généreux, Harald Trost
2001 People and Computers XV—Interaction without Frontiers  
In this paper, we are going to describe a Wizard-of-Oz (WoZ) study which has shown that users tend to prefer multi-modal input over traditional input modes in search tasks concerning German newspaper texts  ...  It means that free typed or spoken utterances can be used as means of access to documents on the Web, in addition to mouse clicks and typed strings as they are used to fill out forms, e.g. in search engines  ...  The authors would like to thank the users who participated in the study as well as the anonymous reviewers for their comments.  ... 
doi:10.1007/978-1-4471-0353-0_29 dblp:conf/bcshci/KleinSGT01 fatcat:bmmcrc3ds5egvh6hsnvmjdttrm

An Empirical Approach for the Evaluation of Voice User Interfaces [chapter]

Valeria Farinazzo, Martins Salvador, Andre Luiz S. Kawamoto, Joao Soares de Oliveira Neto
2010 User Interfaces  
is working in order to provide an answer (messages like "please, wait a moment" delivered to the user).  ...  VUI is perhaps the most critical factor in the success of any automated speech recognition (ASR) system, determining whether the user experience will be satisfying or frustrating, or even whether the customer  ...  The main features of this application are the recognition of words that are spoken by the user, and the text to speech conversion.  ... 
doi:10.5772/9490 fatcat:fk7nqcq3ybemhfzxkavmx24evq

Interaction techniques for ambiguity resolution in recognition-based interfaces

Jennifer Mankoff, Scott E. Hudson, Gregory D. Abowd
2000 Proceedings of the 13th annual ACM symposium on User interface software and technology - UIST '00  
We present a survey of existing error correction techniques in the user interface. These mediation techniques most commonly fall into one of two strategies, repetition and choice.  ...  Figure 1 : An n-best list from the ViaVoice™ speech system [3] . Note that it provides a text entry area for mediation by repetition. Illustration reprinted with permission from IBM Corporation.  ...  Many thanks to Chris Long for the use of his recognizer [18] , and to the reviewers.  ... 
doi:10.1145/354401.354407 dblp:conf/uist/MankoffHA00 fatcat:qwjtjlxqerbmbpdmdr4pu5k3ia

Interaction techniques for ambiguity resolution in recognition-based interfaces

Jennifer Mankoff, Scott E. Hudson, Gregory D. Abowd
2007 ACM SIGGRAPH 2007 courses on - SIGGRAPH '07  
We present a survey of existing error correction techniques in the user interface. These mediation techniques most commonly fall into one of two strategies, repetition and choice.  ...  Figure 1 : An n-best list from the ViaVoice™ speech system [3] . Note that it provides a text entry area for mediation by repetition. Illustration reprinted with permission from IBM Corporation.  ...  Many thanks to Chris Long for the use of his recognizer [18] , and to the reviewers.  ... 
doi:10.1145/1281500.1281522 dblp:conf/siggraph/MankoffHA07 fatcat:earjq2jrsnhudpzpfcsle7z5yu

Interaction techniques for ambiguity resolution in recognition-based interfaces

Jennifer Mankoff, Scott E. Hudson, Gregory D. Abowd
2006 ACM SIGGRAPH 2006 Courses on - SIGGRAPH '06  
We present a survey of existing error correction techniques in the user interface. These mediation techniques most commonly fall into one of two strategies, repetition and choice.  ...  Figure 1 : An n-best list from the ViaVoice™ speech system [3] . Note that it provides a text entry area for mediation by repetition. Illustration reprinted with permission from IBM Corporation.  ...  Many thanks to Chris Long for the use of his recognizer [18] , and to the reviewers.  ... 
doi:10.1145/1185657.1185767 dblp:conf/siggraph/MankoffHA06 fatcat:pg3ljacbrfdahigs5zv2rgezgq

Using non-speech sounds to increase web image accessibility for screen-reader users

Ratan Bahadur Thapa, Mexhid Ferati, G. Anthony Giannoumis
2017 Proceedings of the 35th ACM International Conference on the Design of Communication - SIGDOC '17  
This paper, poses an alternative approach using an experiment that compares audemes, a type of non-speech sounds, with alternative text delivered using synthetic speech.  ...  Screen-reader users access images on the Web using alternative text delivered via synthetic speech.  ...  Another study reports that alt text should be meaningful and easy to perceive in order to convey the appropriate message to the user [8] .  ... 
doi:10.1145/3121113.3121231 dblp:conf/sigdoc/ThapaFG17 fatcat:4i6xif2i2nbydfosrpup5ldtkm

A voice and ink XML multimodal architecture for mobile e-commerce systems

Zouheir Trabelsi, Sung-Hyuk Cha, Darshan Desai, Charles Tappert
2002 Proceedings of the 2nd international workshop on Mobile commerce - WMC '02  
By providing mutual disambiguation of input signals and superior error handling this architecture should broaden the spectrum of users to the general population, including permanently and temporarily disabled  ...  users.  ...  First, empirical studies have demonstrated that users select the input mode (speech, ink, or touch-tone input) they judge to be less error prone, and this leads to fewer errors.  ... 
doi:10.1145/570705.570724 dblp:conf/wmc/TrabelsiCDT02 fatcat:d7tdmkk7pjcjbhuqmnse7gcv34

A voice and ink XML multimodal architecture for mobile e-commerce systems

Zouheir Trabelsi, Sung-Hyuk Cha, Darshan Desai, Charles Tappert
2002 Proceedings of the 2nd international workshop on Mobile commerce - WMC '02  
By providing mutual disambiguation of input signals and superior error handling this architecture should broaden the spectrum of users to the general population, including permanently and temporarily disabled  ...  users.  ...  First, empirical studies have demonstrated that users select the input mode (speech, ink, or touch-tone input) they judge to be less error prone, and this leads to fewer errors.  ... 
doi:10.1145/570719.570724 fatcat:clna4mvox5b3lkqm5s6ebla4ly

Voice Versus Keyboard and Mouse for Text Creation on Arabic User Interfaces

Khalid Majrashi
2022 ˜The œinternational Arab journal of information technology  
Voice User Interfaces (VUIs) are increasingly popular owing to improvements in automatic speech recognition.  ...  Hence, this research compared user performance, learnability, and satisfaction when using voice and keyboard-and-mouse input modalities for text creation on Arabic user interfaces.  ...  Therefore, we conducted this study to empirically investigate the usability of voice input for text entry on an Arabic VUI that is powered by a modern ASR.  ... 
doi:10.34028/iajit/19/1/15 fatcat:gzn6mhvlk5gmrol33p44l5pzne

Technical Report - Systematic Mapping Study

Guilherme Guerino
2020 figshare.com  
This report presents the execution of a Systematic Literature Mapping of technologies being used to evaluate the Usability and/or User Experience in Conversational Systems.  ...  Besides, an analysis of the resulting papers is performed.  ...  However, if the number of errors increases with the reference text, the impact of the errors will be offset (note the subtraction of a term log (n) in the denominator).  ... 
doi:10.6084/m9.figshare.12840773.v4 fatcat:bwlrlfs5yzefvm3aepc2yp4pri

Towards robust person recognition on handheld devices using face and speaker identification technologies

Timothy J. Hazen, Eugene Weinstein, Alex Park
2003 Proceedings of the 5th international conference on Multimodal interfaces - ICMI '03  
In this study, we investigate the application of existing face and speaker identification techniques to a person identification task on a handheld device.  ...  However, the use of these technologies in variable environments and with the help of the inexpensive sound and image capture hardware present in mobile devices presents an additional challenge.  ...  The authors wish to thank Dave Dopson and Ken Steele, who helped in the development of the application, and Bernd Heisele, who has provided the face identification algorithms and on-going assistance.  ... 
doi:10.1145/958468.958485 fatcat:2u4uea5wh5dfla6euysrqz5xje

Towards robust person recognition on handheld devices using face and speaker identification technologies

Timothy J. Hazen, Eugene Weinstein, Alex Park
2003 Proceedings of the 5th international conference on Multimodal interfaces - ICMI '03  
In this study, we investigate the application of existing face and speaker identification techniques to a person identification task on a handheld device.  ...  However, the use of these technologies in variable environments and with the help of the inexpensive sound and image capture hardware present in mobile devices presents an additional challenge.  ...  The authors wish to thank Dave Dopson and Ken Steele, who helped in the development of the application, and Bernd Heisele, who has provided the face identification algorithms and on-going assistance.  ... 
doi:10.1145/958432.958485 dblp:conf/icmi/HazenWP03 fatcat:2e2takyauzctrnwjay7kzeogqi
« Previous Showing results 1 — 15 out of 13,372 results