A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
The challenge of spoken language systems: Research directions for the nineties
1995
IEEE Transactions on Speech and Audio Processing
; 5) natural language response generation; 6) speech synthesis and speech generation; 7) multilingual systems; and 8) interactive multimodal systems. ...
A spoken language system combines speech recognition, natural language processing and h h a n interface technology. ...
authors into a cohesive final document. ...
doi:10.1109/89.365385
fatcat:ogivf5rdovajrhcmez4c6hynne
FreeWalk: a social interaction platform for group behaviour in a virtual space
2004
International Journal of Human-Computer Studies
When the agent notes an awkward pause in a conversation, it approaches those involved in the conversation with a suggestion for a new topic to talk about. ...
Many systems use a 3D virtual space as a multiuser environment (Sugawara et al.. Those systems usually aim to construct brilliant virtual worlds. ...
I received a lot of support in the preparation of the facilities used in the cross-cultural experiment from NTT GEMNet project, Fumio Hattori, Tomoaki Tanaka, Takeshi Oguro, Kaoru Hiramatsu, and Jun-ichi ...
doi:10.1016/j.ijhcs.2003.11.003
fatcat:xlgkovr7hvanlaw64g5v7m3zfm
Conversational agents in healthcare: a systematic review
2018
JAMIA Journal of the American Medical Informatics Association
Objective: Our objective was to review the characteristics, current applications, and evaluation measures of conversational agents with unconstrained natural language input capabilities used for health-related ...
Methods: We searched PubMed, Embase, CINAHL, PsycInfo, and ACM Digital using a predefined search strategy. ...
ACKNOWLEDGEMENTS We thank Mohamed Khalifa for the help with screening reference lists and extracting information on funding sources and conflict of interests. ...
doi:10.1093/jamia/ocy072
pmid:30010941
fatcat:6zdhay4jenal3ogiegkc7v5m6y
Complex Cepstrum Based Voice Conversion Using Radial Basis Function
2014
ISRN Signal Processing
The evaluation measures reveal that the proposed complex cepstrum based voice conversion system approximate the converted speech signal with better accuracy than the model based on the Mel cepstrum envelope ...
A comparison of the proposed complex cepstrum based model has been made with the state-of-the-art Mel Cepstrum Envelope based voice conversion model with objective and subjective evaluations. ...
Application of VC includes the personification of text to speech, design of multispeaker based speech synthesis system, audio dubbing, karaoke applications, security related system, the design of speaking ...
doi:10.1155/2014/357048
fatcat:7r7aeiqfubbujdavcaroy5s66u
Design and evaluation of the asynchronous voice meeting system AVM
2002
Systems and Computers in Japan
A voice communication system for asynchronous meetings held in non-real time will attract a broad range of users because it is convenient and easy to use with mobile devices. ...
This study reports the design and evaluation of a new interface for effective asynchronous voice meetings, the AVM (Asynchronous Voice Meeting System), a client-server type meeting system that uses overlapping ...
A ready-to-use application system with existing technology enlightens system users and opens a new market of voice systems. ...
doi:10.1002/scj.1164
fatcat:jqywvpvl2jcbbd3etprov5x3gq
Emotion-awareness for intelligent vehicle assistants
2018
Proceedings of the 1st International Workshop on Software Engineering for AI in Autonomous Systems - SEFAIS '18
EVA requires a multi-disciplinary approach, combining a number of critical building blocks into a cybernetics systems/software architecture: emotion aware systems and algorithms, multimodal interaction ...
EVA requires a multi-disciplinary approach, combining a number of critical building blocks into a cybernetics systems/software architecture: emotion aware systems and algorithms, multimodal interaction ...
While (embodied) conversational agents have long been considered a promising ally for the vehicular context, from an interaction perspective such conversational agents raise serious concerns along a number ...
doi:10.1145/3194085.3194094
fatcat:stdxabm2jjevbijg4vboixszze
Emotional speech: Towards a new generation of databases
2003
Speech Communication
Research on speech and emotion is moving from a period of exploratory research into one where there is a prospect of substantial applications, notably in human-computer interaction. ...
direction dans laquelle devraient sÕorienter les recherches a a venir. : S 0 1 6 7 -6 3 9 3 ( 0 2 ) 0 0 0 7 0 -5 Speech Communication 40 (2003) 33-60 www.elsevier.com/locate/specom 34 E. ...
of casual conversational speech. ...
doi:10.1016/s0167-6393(02)00070-5
fatcat:wxgzrps6mjggtkbrh47g4j3pcu
MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System
2002
Machine Translation
We present MARS (Multilingual Automatic tRanslation System), a research prototype speech-to-speech translation system. ...
MARS is aimed at two-way conversational spoken language translation between English and Mandarin Chinese for limited domains, such as air travel reservations. ...
The authors also thank the Machine Translation Special Issue editors and two reviewers for their careful review and useful suggestions. ...
doi:10.1023/b:coat.0000010802.38267.29
fatcat:6wjmgrzi7famlojqzy3vuogncm
Productive Sounds
[chapter]
2019
The Democratization of Artificial Intelligence
(IPAs), belong to a class of software agents that can answer queries and perform tasks for users based on verbal commands and inquiries when equipped with a voice user interface (VUI). ...
However, it is not necessary to understand how they work algorithmically in every detail to understand their politics; it is sufficient to study what they are used for and how they are marketed to different ...
I would like to express my gratitude to Kyle Stine for critical remarks and suggestions and Thomas Bjørnsten for valuable input to the paper. I would also like to thank Sheldon H. ...
doi:10.14361/9783839447192-004
fatcat:hmu4mrv6tfganpwxmr5lumm6ay
The role of voice input for human-machine communication
1995
Proceedings of the National Academy of Sciences of the United States of America
System prototypes have recently been built that demonstrate speaker-independent real-time speech recogni- ...
Optimism is growing that the near future will witness rapid growth in human-computer interaction using voice. ...
Such systems are currently designed to incorporate speech recognition, machine translation, and speech synthesis subsystems, and to interpret one sentence at a time. ...
doi:10.1073/pnas.92.22.9921
pmid:7479803
pmcid:PMC40712
fatcat:js3j6ovthzcujhfhumwke5tqfi
HMM-based speech synthesis with various degrees of articulation: A perceptual study
2014
Neurocomputing
HMM-based speech synthesis is very convenient for creating a synthesizer whose speaker characteristics and speaking styles can be easily modified. ...
This can be obtained by adapting a source speaker's model to a target speaker's model, using intra-speaker voice adaptation techniques. ...
Hypo/hyperarticulated speech synthesis has many applications: expressive voice conversion (e.g. for embedded systems and video games), "reading speed" control for visually impaired people (i.e. fast speech ...
doi:10.1016/j.neucom.2012.10.040
fatcat:mjp3lcawajh7zexnghedzz2czy
The roles of language processing in a spoken language interface
1995
Proceedings of the National Academy of Sciences of the United States of America
This kind of interaction requires the system to have both input and output capabilities, that is, for speech, both recognition and synthesis, and for language, both understanding and generation. ...
This is a serious problem-it is clearly desirable and useful to have a static set of data, with answers, so that experiments can be run repeatedly, either for optimization purposes or simply to experiment ...
doi:10.1073/pnas.92.22.9970
pmid:7479811
pmcid:PMC40720
fatcat:qlaxn6zs45gr3mc76xfvpppaeu
Field evaluation with cognitively-impaired older adults of attention management in the Embodied Conversational Agent Louise
2016
2016 IEEE International Conference on Serious Games and Applications for Health (SeGAH)
Finally, to gain further insights on conversation management and provide evidence-based suggestions for future work, we performed an anthropological analysis of the whole experiment. ...
Louise is a new, semi-automatic prototype of an Embodied Conversational Agent (ECA), a virtual character interacting with users through social-like communication, adapted to the special needs of older ...
ACKNOWLEDGMENT The authors would like to thank RégionÎle-de-France for funding part of this project. ...
doi:10.1109/segah.2016.7586282
dblp:conf/segah/WargnierCLBJR16
fatcat:a7gfjuehs5gulbhhdyzywpz5za
Personal computing
1977
Proceedings of the June 13-16, 1977, national computer conference on - AFIPS '77
Characteristics of current hardware, software, and systems configurations are discussed. Mention is made of a variety of activities for which these systems are currently being used. ...
Emphasis is placed on those features and uses that are unique to personal and hobby computing. Differences are noted between personal computing and professional computing. ...
The hardware that was available in a consumer's price range was too complex l for the casual user who was primarily interested in serious personal applications. ...
doi:10.1145/1499402.1499491
dblp:conf/afips/Warren77
fatcat:3pu7plt75jaipcqns2nh7exjnq
The five Is: Key principles for interpretable and safe conversational AI
[article]
2021
arXiv
pre-print
for use. ...
In an effort to initiate a discussion on possible alternatives, we outline and exemplify how our five principles enable the development of conversational AI systems that are transparent and thus safer ...
Those applications involve the use of conversational agents, which are systems intended for natural, multimodal interaction with human users, using text, speech, touch, gestures, and so on. ...
arXiv:2108.13766v1
fatcat:c2wxvq6okrdyvnyiv3messalxm
« Previous
Showing results 1 — 15 out of 5,637 results