Filters








5,637 Hits in 4.4 sec

The challenge of spoken language systems: Research directions for the nineties

R. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, L. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson (+12 others)
1995 IEEE Transactions on Speech and Audio Processing  
; 5) natural language response generation; 6) speech synthesis and speech generation; 7) multilingual systems; and 8) interactive multimodal systems.  ...  A spoken language system combines speech recognition, natural language processing and h h a n interface technology.  ...  authors into a cohesive final document.  ... 
doi:10.1109/89.365385 fatcat:ogivf5rdovajrhcmez4c6hynne

FreeWalk: a social interaction platform for group behaviour in a virtual space

Hideyuki Nakanishi
2004 International Journal of Human-Computer Studies  
When the agent notes an awkward pause in a conversation, it approaches those involved in the conversation with a suggestion for a new topic to talk about.  ...  Many systems use a 3D virtual space as a multiuser environment (Sugawara et al.. Those systems usually aim to construct brilliant virtual worlds.  ...  I received a lot of support in the preparation of the facilities used in the cross-cultural experiment from NTT GEMNet project, Fumio Hattori, Tomoaki Tanaka, Takeshi Oguro, Kaoru Hiramatsu, and Jun-ichi  ... 
doi:10.1016/j.ijhcs.2003.11.003 fatcat:xlgkovr7hvanlaw64g5v7m3zfm

Conversational agents in healthcare: a systematic review

Liliana Laranjo, Adam G Dunn, Huong Ly Tong, Ahmet Baki Kocaballi, Jessica Chen, Rabia Bashir, Didi Surian, Blanca Gallego, Farah Magrabi, Annie Y S Lau, Enrico Coiera
2018 JAMIA Journal of the American Medical Informatics Association  
Objective: Our objective was to review the characteristics, current applications, and evaluation measures of conversational agents with unconstrained natural language input capabilities used for health-related  ...  Methods: We searched PubMed, Embase, CINAHL, PsycInfo, and ACM Digital using a predefined search strategy.  ...  ACKNOWLEDGEMENTS We thank Mohamed Khalifa for the help with screening reference lists and extracting information on funding sources and conflict of interests.  ... 
doi:10.1093/jamia/ocy072 pmid:30010941 fatcat:6zdhay4jenal3ogiegkc7v5m6y

Complex Cepstrum Based Voice Conversion Using Radial Basis Function

Jagannath Nirmal, Suprava Patnaik, Mukesh Zaveri, Pramod Kachare
2014 ISRN Signal Processing  
The evaluation measures reveal that the proposed complex cepstrum based voice conversion system approximate the converted speech signal with better accuracy than the model based on the Mel cepstrum envelope  ...  A comparison of the proposed complex cepstrum based model has been made with the state-of-the-art Mel Cepstrum Envelope based voice conversion model with objective and subjective evaluations.  ...  Application of VC includes the personification of text to speech, design of multispeaker based speech synthesis system, audio dubbing, karaoke applications, security related system, the design of speaking  ... 
doi:10.1155/2014/357048 fatcat:7r7aeiqfubbujdavcaroy5s66u

Design and evaluation of the asynchronous voice meeting system AVM

Takuya Nishimoto, Hidehiro Yuki, Takehiko Kawahara, Masahiro Araki, Yasuhisa Niimi
2002 Systems and Computers in Japan  
A voice communication system for asynchronous meetings held in non-real time will attract a broad range of users because it is convenient and easy to use with mobile devices.  ...  This study reports the design and evaluation of a new interface for effective asynchronous voice meetings, the AVM (Asynchronous Voice Meeting System), a client-server type meeting system that uses overlapping  ...  A ready-to-use application system with existing technology enlightens system users and opens a new market of voice systems.  ... 
doi:10.1002/scj.1164 fatcat:jqywvpvl2jcbbd3etprov5x3gq

Emotion-awareness for intelligent vehicle assistants

Hans-Jörg Vögel, Raphaël Troncy, Benoit Huet, Melek Önen, Adlen Ksentini, Jörg Conradt, Asaf Adi, Alexander Zadorojniy, Jacques Terken, Jonas Beskow, Ann Morrison, Christian Süß (+11 others)
2018 Proceedings of the 1st International Workshop on Software Engineering for AI in Autonomous Systems - SEFAIS '18  
EVA requires a multi-disciplinary approach, combining a number of critical building blocks into a cybernetics systems/software architecture: emotion aware systems and algorithms, multimodal interaction  ...  EVA requires a multi-disciplinary approach, combining a number of critical building blocks into a cybernetics systems/software architecture: emotion aware systems and algorithms, multimodal interaction  ...  While (embodied) conversational agents have long been considered a promising ally for the vehicular context, from an interaction perspective such conversational agents raise serious concerns along a number  ... 
doi:10.1145/3194085.3194094 fatcat:stdxabm2jjevbijg4vboixszze

Emotional speech: Towards a new generation of databases

Ellen Douglas-Cowie, Nick Campbell, Roddy Cowie, Peter Roach
2003 Speech Communication  
Research on speech and emotion is moving from a period of exploratory research into one where there is a prospect of substantial applications, notably in human-computer interaction.  ...  direction dans laquelle devraient sÕorienter les recherches a a venir. : S 0 1 6 7 -6 3 9 3 ( 0 2 ) 0 0 0 7 0 -5 Speech Communication 40 (2003) 33-60 www.elsevier.com/locate/specom 34 E.  ...  of casual conversational speech.  ... 
doi:10.1016/s0167-6393(02)00070-5 fatcat:wxgzrps6mjggtkbrh47g4j3pcu

MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System

Yuqing Gao, Bowen Zhou, Zijian Diao, Jeffrey Sorensen, Michael Picheny
2002 Machine Translation  
We present MARS (Multilingual Automatic tRanslation System), a research prototype speech-to-speech translation system.  ...  MARS is aimed at two-way conversational spoken language translation between English and Mandarin Chinese for limited domains, such as air travel reservations.  ...  The authors also thank the Machine Translation Special Issue editors and two reviewers for their careful review and useful suggestions.  ... 
doi:10.1023/b:coat.0000010802.38267.29 fatcat:6wjmgrzi7famlojqzy3vuogncm

Productive Sounds [chapter]

Axel Volmar
2019 The Democratization of Artificial Intelligence  
(IPAs), belong to a class of software agents that can answer queries and perform tasks for users based on verbal commands and inquiries when equipped with a voice user interface (VUI).  ...  However, it is not necessary to understand how they work algorithmically in every detail to understand their politics; it is sufficient to study what they are used for and how they are marketed to different  ...  I would like to express my gratitude to Kyle Stine for critical remarks and suggestions and Thomas Bjørnsten for valuable input to the paper. I would also like to thank Sheldon H.  ... 
doi:10.14361/9783839447192-004 fatcat:hmu4mrv6tfganpwxmr5lumm6ay

The role of voice input for human-machine communication

P. R. Cohen, S. L. Oviatt
1995 Proceedings of the National Academy of Sciences of the United States of America  
System prototypes have recently been built that demonstrate speaker-independent real-time speech recogni-  ...  Optimism is growing that the near future will witness rapid growth in human-computer interaction using voice.  ...  Such systems are currently designed to incorporate speech recognition, machine translation, and speech synthesis subsystems, and to interpret one sentence at a time.  ... 
doi:10.1073/pnas.92.22.9921 pmid:7479803 pmcid:PMC40712 fatcat:js3j6ovthzcujhfhumwke5tqfi

HMM-based speech synthesis with various degrees of articulation: A perceptual study

Benjamin Picart, Thomas Drugman, Thierry Dutoit
2014 Neurocomputing  
HMM-based speech synthesis is very convenient for creating a synthesizer whose speaker characteristics and speaking styles can be easily modified.  ...  This can be obtained by adapting a source speaker's model to a target speaker's model, using intra-speaker voice adaptation techniques.  ...  Hypo/hyperarticulated speech synthesis has many applications: expressive voice conversion (e.g. for embedded systems and video games), "reading speed" control for visually impaired people (i.e. fast speech  ... 
doi:10.1016/j.neucom.2012.10.040 fatcat:mjp3lcawajh7zexnghedzz2czy

The roles of language processing in a spoken language interface

L. Hirschman
1995 Proceedings of the National Academy of Sciences of the United States of America  
This kind of interaction requires the system to have both input and output capabilities, that is, for speech, both recognition and synthesis, and for language, both understanding and generation.  ...  This is a serious problem-it is clearly desirable and useful to have a static set of data, with answers, so that experiments can be run repeatedly, either for optimization purposes or simply to experiment  ... 
doi:10.1073/pnas.92.22.9970 pmid:7479811 pmcid:PMC40720 fatcat:qlaxn6zs45gr3mc76xfvpppaeu

Field evaluation with cognitively-impaired older adults of attention management in the Embodied Conversational Agent Louise

Pierre Wargnier, Giovanni Carletti, Yann Laurent-Corniquet, Samuel Benveniste, Pierre Jouvelot, Anne-Sophie Rigaud
2016 2016 IEEE International Conference on Serious Games and Applications for Health (SeGAH)  
Finally, to gain further insights on conversation management and provide evidence-based suggestions for future work, we performed an anthropological analysis of the whole experiment.  ...  Louise is a new, semi-automatic prototype of an Embodied Conversational Agent (ECA), a virtual character interacting with users through social-like communication, adapted to the special needs of older  ...  ACKNOWLEDGMENT The authors would like to thank RégionÎle-de-France for funding part of this project.  ... 
doi:10.1109/segah.2016.7586282 dblp:conf/segah/WargnierCLBJR16 fatcat:a7gfjuehs5gulbhhdyzywpz5za

Personal computing

Jim Warren
1977 Proceedings of the June 13-16, 1977, national computer conference on - AFIPS '77  
Characteristics of current hardware, software, and systems configurations are discussed. Mention is made of a variety of activities for which these systems are currently being used.  ...  Emphasis is placed on those features and uses that are unique to personal and hobby computing. Differences are noted between personal computing and professional computing.  ...  The hardware that was available in a consumer's price range was too complex l for the casual user who was primarily interested in serious personal applications.  ... 
doi:10.1145/1499402.1499491 dblp:conf/afips/Warren77 fatcat:3pu7plt75jaipcqns2nh7exjnq

The five Is: Key principles for interpretable and safe conversational AI [article]

Mattias Wahde, Marco Virgolin
2021 arXiv   pre-print
for use.  ...  In an effort to initiate a discussion on possible alternatives, we outline and exemplify how our five principles enable the development of conversational AI systems that are transparent and thus safer  ...  Those applications involve the use of conversational agents, which are systems intended for natural, multimodal interaction with human users, using text, speech, touch, gestures, and so on.  ... 
arXiv:2108.13766v1 fatcat:c2wxvq6okrdyvnyiv3messalxm
« Previous Showing results 1 — 15 out of 5,637 results