Filters








32 Hits in 2.4 sec

Retrieval of broadcast news documents with the THISL system

D. Abberley, S. Renals, G. Cook
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)  
This paper describes the THISL system that participated in the TREC-7 evaluation, Spoken Document Retrieval (SDR) Track, and presents the results obtained, together with some analysis.  ...  The THISL system is based on the ABBOT speech recognition system and the thislIR text retrieval system.  ...  INTRODUCTION THISL is an ESPRIT Long Term Research project that is investigating the development of a news-on-demand system using speech recognition, natural language processing and text retrieval.  ... 
doi:10.1109/icassp.1998.679707 dblp:conf/icassp/AbberleyRC98 fatcat:2o2ohirf5jboxnucy7lwcklk3u

The Thisl SDR System at TREC-9

Steve Renals, Dave Abberley
2000 Text Retrieval Conference  
The THISL SDR system consists of a realtime version of a hybrid connectionist/HMM large vocabulary speech recognition system and a probabilistic text retrieval system.  ...  This paper describes our participation in the TREC-9 Spoken Document Retrieval (SDR) track.  ...  ACKNOWLEDGEMENTS Dan Ellis and Tony Robinson worked on the system for TREC-8 SDR and the work described here uses the fruits of their labour.  ... 
dblp:conf/trec/RenalsA00 fatcat:ffas3d2qt5adjgts3bh7b743uu

Robust spoken document retrieval methods for misrecognition and out-of-vocabulary keywords

Hiromitsu Nishizaki, Seiichi Nakagawa
2004 Systems and Computers in Japan  
This paper describes a Japanese spoken document retrieval (SDR) system that is robust for Out-of-Vocabulary (OOV) words.  ...  Evaluation results show that the proposed technique is quite effective in robustly retrieving spoken documents.  ...  Re- trieval of broadcast news documents with the THISL system. In Proc. of ICASSP'98, pp. 3781-3784, 1998. [3] Tony Robinson, Dave Abberley, David Kirby, and Steve Renals.  ... 
doi:10.1002/scj.10697 fatcat:liynm3th55hkpmexm7ovomr7qi

Automated Metadata Enrichment of Large Speech Radio Archives

Y. Raimond, C. Lowis, R. Hodgson, D. Tinley
2014 SMPTE Motion Imaging Journal  
We evaluate this framework against manually applied tags, and compare it with related work. We find that this framework is good enough to bootstrap the interlinking process of archived content.  ...  However, given the very large number of programmes available in the archive, most of them having very little metadata attached to them, we need a way of automatically assigning tags to programmes.  ...  The THISL system [1] applies an automated speech recognition system (ABBOT) on BBC news broadcasts and uses a bag-of-words model on the resulting transcripts for programme retrieval.  ... 
doi:10.5594/j18370xy fatcat:g5dql24le5c5noeifruiikiyoe

Indexing and retrieval of broadcast news

Steve Renals, Dave Abberley, David Kirby, Tony Robinson
2000 Speech Communication  
This paper describes a spoken document retrieval (SDR) system for British and North American Broadcast News.  ...  We discuss the development of a realtime Broadcast News speech recognizer, and its integration into an SDR system.  ...  Acknowledgments This work was supported by ESPRIT Long Term Research Project THISL (EP23495). Thanks to Gary Cook for assistance with North American English broadcast speech  ... 
doi:10.1016/s0167-6393(00)00020-0 fatcat:wdj3zvhtkvb2zbbzcryqoqh2vy

Statistical Language Modelling [chapter]

Yoshihiko Gotoh, Steve Renals
2003 Lecture Notes in Computer Science  
In a speech recognition system the role of the language model is to assign probabilities to word sequences.  ...  occurrence rate of a word is not uniform, but varies between documents.  ...  The Text Retrieval Conference (TREC) has been a forum for the evaluation of text retrieval systems for a variety of tasks including routing, filtering and spoken document retrieval.  ... 
doi:10.1007/978-3-540-45115-0_4 fatcat:tyxdr6nv7jbhxkxtndi7f7qkra

A speech interface for open-domain question-answering

Edward Schofield, Zhiping Zheng
2003 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03  
We describe a small evaluation of the effect of recognition errors on the precision of the answers returned and make some concrete recommendations for modifying a question-answering system for improving  ...  of errors, and an open-domain question-answering system, AnswerBus, which is freely available on the Web.  ...  Acknowledgements The authors would like to thank Stefan Rüger for his suggestions and moral support. Ed Schofield's research is supported by a Marie Curie Fellowship of the European Commission.  ... 
doi:10.3115/1075178.1075210 dblp:conf/acl/SchofieldZ03 fatcat:hfbcjhjroraf7djzsurkctxc54

Web-assisted annotation, semantic indexing and search of television and radio news

Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov
2005 Proceedings of the 14th international conference on World Wide Web - WWW '05  
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described.  ...  The semantic annotations can form a part of the Semantic Web, and an evaluation shows that the system operates with high precision, and with a moderate level of recall.  ...  ACKNOWLEDGMENTS The research for this paper was conducted as part of the European Union Sixth Framework Program projects PrestoSpace (FP6-507336) and SEKT (EU IST IP 2003-506826).  ... 
doi:10.1145/1060745.1060781 dblp:conf/www/DowmanTCP05 fatcat:s3u5pphtundlhd7towk6b5uxbi

Content-based access to spoken audio

K. Koumpis, S. Renals
2005 IEEE Signal Processing Magazine  
These systems have addressed diverse tasks including indexing and retrieving voicemail messages, searching for broadcast news, and extracting information from recordings of meetings and lectures.  ...  One way to facilitate retrieval is by classifying content into categories. The last and perhaps the least explored phase deals with the delivery of the retrieved content to users.  ...  Koumpis and an MSc and a PhD from the University of Edinburgh. His research is in the areas of speech recognition, information access from spoken audio and models for multimodal data.  ... 
doi:10.1109/msp.2005.1511824 fatcat:a7p7ay3lmfen5brbmvqwoi6ete

The THISL SDR system at TREC-8

Dave Abberley, Steve Renals, Daniel P. W. Ellis, Tony Robinson
2017
This paper describes the participation of the THISL group at the TREC-8 Spoken Document Retrieval (SDR) track.  ...  The THISL SDR system consists of the realtime version of the ABBOT large vocabulary speech recognition system and the THISLIR text retrieval system.  ...  The THISL 1 spoken document retrieval system consists of the 'real time' version of the ABBOT large vocabulary continuous speech recognizer [2] and the THISLIR text retrieval system [3] .  ... 
doi:10.7916/d8fj2s36 fatcat:crz5utpn6nhq3g7gtc33dyzkga

Speaker change detection and tracking in real-time news broadcasting analysis

Lie Lu, Hong-Jiang Zhang
2002 Proceedings of the tenth ACM international conference on Multimedia - MULTIMEDIA '02  
This paper addresses the problem of real time speaker change detection and speaker tracking in broadcasted news video analysis.  ...  The algorithm has low complexity and runs in real-time with a very limited delay in analysis. Our experiments show that the algorithms produce very satisfactory results.  ...  Though this database is originally designed for Spoken Document Retrieval, it is more suitable for our intended application: speaker tracking for news broadcasting.  ... 
doi:10.1145/641007.641127 dblp:conf/mm/LuZ02 fatcat:s52emjd2u5eghgdzoulgw2gk3i

The THISL spoken document retrieval project

S. Renals
Proceedings IEEE International Conference on Multimedia Computing and Systems  
THISL is an ESPRIT Long Term Research Project focused the development and construction of a system to items from an archive of television and radio news broadcasts.  ...  The system has been evaluated as part of the TREC-6 and TREC-7 spoken document retrieval evaluations and we report on the results of the TREC-7 evaluation based on a document collection of 100 hours of  ...  This work has benefited from collaboration with the partners of the THISL and SPRACH projects, in particular Tony Robinson (Cambridge University and SoftSound) and Gary Cook (Cambridge University).  ... 
doi:10.1109/mmcs.1999.778655 dblp:conf/icmcs/Renals99 fatcat:w2kj4jma2vdkfgx5hr3xlj6htm

Speaker change detection and tracking in real-time news broadcasting analysis

Lie Lu, Hong-Jiang Zhang
2002 Proceedings of the tenth ACM international conference on Multimedia - MULTIMEDIA '02  
This paper addresses the problem of real time speaker change detection and speaker tracking in broadcasted news video analysis.  ...  The algorithm has low complexity and runs in real-time with a very limited delay in analysis. Our experiments show that the algorithms produce very satisfactory results.  ...  Though this database is originally designed for Spoken Document Retrieval, it is more suitable for our intended application: speaker tracking for news broadcasting.  ... 
doi:10.1145/641124.641127 fatcat:wjyaydsmqvgq7ptp5oed25pvbi

Unsupervised speaker segmentation and tracking in real-time audio content analysis

Lie Lu, Hong-Jiang Zhang
2005 Multimedia Systems  
Experiments show that the proposed algorithm can recall 89% of speaker change boundaries with 15% false alarms, and 76% of speakers can be unsupervised identified with 20% false alarms.  ...  Compared with previous works, the algorithm also has low computation complexity and can perform in 15% of real time with a very limited delay in analysis.  ...  Though this database was originally designed for spoken document retrieval, it is also suitable for our intended application: speaker segmentation and tracking for news broadcasting.  ... 
doi:10.1007/s00530-004-0160-5 fatcat:3ccwowz66ng6jk5muwy73nx67u

A unified language model for large vocabulary continuous speech recognition of Turkish

Ebru Arısoy, Helin Dutağacı, Levent M. Arslan
2006 Signal Processing  
The proposed model resulted in letter error rates (LER's) of approximately 28% for a speaker independent system and 20% for a speaker dependent system.  ...  We have designed a Turkish dictation system for newspaper content transcription application. Turkish is an agglutinative language with free word order.  ...  This research is supported by SIMILAR Network of Excellence and TÜ BİTAK BDP (Unified Doctorate Program of the Scientific and Technological Research Council of Turkey).  ... 
doi:10.1016/j.sigpro.2005.12.002 fatcat:kvh2k6pr5feg3nkoenuxnnifra
« Previous Showing results 1 — 15 out of 32 results