1,192 Hits in 6.3 sec

Collaborative Speech Data Acquisition for Under Resourced Languages through Crowdsourcing

Sunita Arora, Karunesh Kumar Arora, Mukund Kumar Roy, S.S. Agrawal, B.K. Murthy
2016 Procedia Computer Science  
Scarcity of resources in under resourced languages may leave these languages behind in race of development of data driven NLP systems.  ...  Crowdsourcing has come up as a technique to bridge this gap, as it offers approach for collecting such resources in collaborative manner.  ...  Speech data collection through crowdsourcing would save cost and effort in collecting such data.  ... 
doi:10.1016/j.procs.2016.04.027 fatcat:xpezpsoqkjaq7edzd7wh6woqju

Crowdsourcing research opportunities

Marta Sabou, Kalina Bontcheva, Arno Scharl
2012 Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies - i-KNOW '12  
We address this lack of awareness, firstly by highlighting the positive impacts that crowdsourcing has had on Natural Language Processing research.  ...  We conclude with future trends and opportunities of crowdsourcing for science, including its potential for disseminating results, making science more accessible, and enriching educational programs.  ...  of dialog systems [38] thus lowering the traditionally high acquisition barrier for speech based resources.  ... 
doi:10.1145/2362456.2362479 dblp:conf/iknow/SabouBS12 fatcat:rt4sgpjzabh2jdozi42w5n6b3a

Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use [chapter]

Karën Fort, Gilles Adda, Benoît Sagot, Joseph Mariani, Alain Couillault
2014 Lecture Notes in Computer Science  
resources development, while limiting the risks of ethical and legal issues without letting go price or quality, 4-to introduce an Ethics and Big Data Charter for the documentation of language resources  ...  Crowdsourcing refers to the fact that the job is outsourced via the web and done by many people (paid or not).  ...  This work was partly realized as part of the Quaero Programme, funded by oseo, French State agency for innovation, as well as part of the French anr project edylex (anr-09-cord-008) and of the Network  ... 
doi:10.1007/978-3-319-08958-4_25 fatcat:tcfzsgvl2raspf44f34yd5tsey

Introduction to the special issue

Laurette Pretorius, Claudia Soria
2017 Language Resources and Evaluation  
Yet, these languages stand to benefit most from emergent collaborative approaches and technologies for language resource development.  ...  Indeed, these approaches seem particularly well-suited for collecting the data needed for the development of language technology applications & Laurette Pretorius  ...  transfer when linguistically annotated data is scarce, as is the case for many under-resourced languages.  ... 
doi:10.1007/s10579-017-9405-8 fatcat:hufsahibvfakhd6jpgk2dsyoly

COLLECTIVE INTELLIGENCE (CROWD SOURCING) ON THE INTERNET: a collaborative approach in information and knowledge management

Celeste Aida Jannuzzi, Orandi Mina Falsarella, Larissa Moraes de Oliveira
2019 International journal for innovation education and research  
their use of the virtual environment in the process of knowledge generation for their own benefit.  ...  The present work proposes to investigate the practice of collective intelligence (crowdsourcing) on the Internet by scientific institutions that develop or not sustainable actions, in order to characterize  ...  through specific criteria; and have predilection for knowledge acquisition in the form of identification of diverse contents, which is in line with the initial proposal of crowdsourcing: to mobilize participation  ... 
doi:10.31686/ijier.vol7.iss4.1406 fatcat:llw6na4ypbet3iaagi2ezdfkfq

Construction and Application of a Human-Computer Collaborative Multimodal Practice Teaching Model for Preschool Education

Meimei Tuo, Baoxin Long, Gengxin Sun
2022 Computational Intelligence and Neuroscience  
designed to combine the basic lesson types of preschool classroom teaching and the secondary objectives of the English curriculum standards, including "reading text–reading aloud evaluation," "playing speech–sound  ...  Combined with Gagne's nine teaching events, a model of the English teaching process based on human-computer collaboration was constructed.  ...  Crowdsourcing markup acquisition is usually divided into two phases: worker labeling and markup aggregation.  ... 
doi:10.1155/2022/2973954 pmid:35785056 pmcid:PMC9249456 fatcat:sv4dyc7fendgplihxarqqy75by

An Open Linguistic Infrastructure for Annotated Corpora [chapter]

Nancy Ide
2013 The People's Web Meets NLP  
layers from part of speech through discourse structure.  ...  , making data acquisition the major issue for ANC-OLI development.  ... 
doi:10.1007/978-3-642-35085-6_10 dblp:series/tanlp/Ide13 fatcat:np45jttao5avvnckrvrcuruzby

Humanities Crowdsourcing

Laurence Favier
2016 Zagadnienia informacji naukowej  
In knowledge acquisition for computing related fields (artificial intelligence, Semantic Web, machine learning, natural language processing, speech processing) Crowdflower, a MLab platform, has been used  ...  Collaboration in humanities is not based on a single data model (Favier, 2015) .  ... 
doi:10.36702/zin.300 fatcat:x6aq7ami4re6rewxwb3qhsiege

Multimodality, interactivity, and crowdsourcing for document transcription

Emilio Granell, Verónica Romero, Carlos D. Martínez-Hinarejos
2018 Computational intelligence  
In this case, when collaborators employ mobile devices, speech dictation can be used as transcription source, and speech and handwritten text recognition can be fused to provide a better draft transcription  ...  The novel contributions presented in this work include the study of the data fusion on a multimodal crowdsourcing framework and its integration with an interactive system.  ...  As previously said, crowdsourcing approaches for HTR are very useful. Speech recog-nition is other field where crowdsourcing approaches can be applied.  ... 
doi:10.1111/coin.12169 fatcat:ytgrih32zzdhpa5xgjjopch5wu

Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges

Evelina Leivada, Roberta D'Alessandro, Kleanthes K. Grohmann
2019 Frontiers in Psychology  
Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages  ...  for addressing it effectively.  ...  However, the acquisition of such data may pose more pronounced challenges for the linguist working with small, young, and/or non-standard languages.  ... 
doi:10.3389/fpsyg.2019.00313 pmid:30837922 pmcid:PMC6382742 fatcat:ypgkorqupndvbcpotovz3pxs6y

Directions for the future of technology in pronunciation research and teaching

Mary Grantham O'Brien, Tracey M. Derwing, Catia Cucchiarini, Debra M. Hardison, Hansjörg Mixdorff, Ron I. Thomson, Helmer Strik, John M. Levis, Murray J. Munro, Jennifer A. Foote, Greta Muller Levis
2018 Journal of Second Language Pronunciation  
Next, we discuss the nature of data in pronunciation research, pointing to ways in which future work can build on advances in corpus research and crowdsourcing.  ...  Finally, we consider how these insights pave the way for researchers and developers working to create research-informed, computer-assisted pronunciation teaching resources.  ...  Acknowledgements The authors thank Scott Jarvis, Executive Director of Language Learning, and the sub-committee of the Language Learning Board of Directors, for funding the Roundtable event at PSLLT 2016  ... 
doi:10.1075/jslp.17001.obr fatcat:rdgxk3woxzhxxfgvcpbicjnyre

Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning

Lionel Nicolas, Verena Lyding, Claudia Borg, Corina Forascu, Karën Fort, Katerina Zdravkova, Iztok Kosem, Jaka Cibej, Spela Arhar Holdt, Alice Millour, Alexander König, Christos T. Rodosthenous (+6 others)
2020 International Conference on Language Resources and Evaluation  
We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners  ...  We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation  ...  for the enhancement of NLP resources.  ... 
dblp:conf/lrec/NicolasLBFFZKCH20 fatcat:yp4g6ynrhrgpjmgatmln4qt3c4

Crawl and crowd to bring machine translation to under-resourced languages

Antonio Toral, Miquel Esplá-Gomis, Filip Klubička, Nikola Ljubešić, Vassilis Papavassiliou, Prokopis Prokopidis, Raphael Rubino, Andy Way
2016 Language Resources and Evaluation  
We present a widely applicable methodology to bring machine translation (MT) to under-resourced languages in a cost-effective and rapid manner.  ...  Our proposal relies on web crawling to automatically acquire parallel data to train statistical MT systems if any such data can be found for the language pair and domain of interest.  ...  In this work we built MT systems between an under-resourced language (Croatian) and, arguably, the best-resourced language (English).  ... 
doi:10.1007/s10579-016-9363-6 fatcat:kl7gpyhu6ncuphi4pdk3yf55qy

CrowdHeritage: Crowdsourcing for Improving the Quality of Cultural Heritage Metadata

Eirini Kaldeli, Orfeas Menis-Mastromichalakis, Spyros Bekiaris, Maria Ralli, Vassilis Tzouvaras, Giorgos Stamou
2021 Information  
In this context, metadata enrichment services through automated analysis and feature extraction along with crowdsourcing annotation services can offer a great opportunity for improving the metadata quality  ...  To address this need, we propose the CrowdHeritage open end-to-end enrichment and crowdsourcing ecosystem, which supports an end-to-end workflow for the improvement of cultural heritage metadata by employing  ...  The availability of human-annotated data can produce a considerable improvement in accuracy, however, the acquisition of appropriate labeled data is a costly process.  ... 
doi:10.3390/info12020064 fatcat:bvq34ojiojbtphnv5j46m3pxca

Designing, Realizing, Running, and Evaluating Virtual Museum – a Survey on Innovative Concepts and Technologies

Nelson Baloian, Daniel Biella, Wolfram Luther, José Pino, Daniel Sacher
2021 Journal of universal computer science (Online)  
As a result, this survey identifies different approaches and advocates for stakeholders’ collaboration throughout the life cycle in determining the ViM's direction and evolution, its concepts  ...  Based on their categories and features, we distinguish between content-, communication- and collaboration-centric museums with a special focus on learning and co-curation.  ...  Acknowledgements We would like to express our gratitude to all those involved in the aforementioned museum projects and to thank our anonymous reviewers for their helpful comments and suggestions, which  ... 
doi:10.3897/jucs.77153 fatcat:3fhjracmurhvxagsyvchjjyz74
« Previous Showing results 1 — 15 out of 1,192 results