A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Filters
Collaborative Speech Data Acquisition for Under Resourced Languages through Crowdsourcing
2016
Procedia Computer Science
Scarcity of resources in under resourced languages may leave these languages behind in race of development of data driven NLP systems. ...
Crowdsourcing has come up as a technique to bridge this gap, as it offers approach for collecting such resources in collaborative manner. ...
Speech data collection through crowdsourcing would save cost and effort in collecting such data. ...
doi:10.1016/j.procs.2016.04.027
fatcat:xpezpsoqkjaq7edzd7wh6woqju
Crowdsourcing research opportunities
2012
Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies - i-KNOW '12
We address this lack of awareness, firstly by highlighting the positive impacts that crowdsourcing has had on Natural Language Processing research. ...
We conclude with future trends and opportunities of crowdsourcing for science, including its potential for disseminating results, making science more accessible, and enriching educational programs. ...
of dialog systems [38] thus lowering the traditionally high acquisition barrier for speech based resources. ...
doi:10.1145/2362456.2362479
dblp:conf/iknow/SabouBS12
fatcat:rt4sgpjzabh2jdozi42w5n6b3a
Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use
[chapter]
2014
Lecture Notes in Computer Science
resources development, while limiting the risks of ethical and legal issues without letting go price or quality, 4-to introduce an Ethics and Big Data Charter for the documentation of language resources ...
Crowdsourcing refers to the fact that the job is outsourced via the web and done by many people (paid or not). ...
This work was partly realized as part of the Quaero Programme, funded by oseo, French State agency for innovation, as well as part of the French anr project edylex (anr-09-cord-008) and of the Network ...
doi:10.1007/978-3-319-08958-4_25
fatcat:tcfzsgvl2raspf44f34yd5tsey
Introduction to the special issue
2017
Language Resources and Evaluation
Yet, these languages stand to benefit most from emergent collaborative approaches and technologies for language resource development. ...
Indeed, these approaches seem particularly well-suited for collecting the data needed for the development of language technology applications & Laurette Pretorius ...
transfer when linguistically annotated data is scarce, as is the case for many under-resourced languages. ...
doi:10.1007/s10579-017-9405-8
fatcat:hufsahibvfakhd6jpgk2dsyoly
COLLECTIVE INTELLIGENCE (CROWD SOURCING) ON THE INTERNET: a collaborative approach in information and knowledge management
2019
International journal for innovation education and research
their use of the virtual environment in the process of knowledge generation for their own benefit. ...
The present work proposes to investigate the practice of collective intelligence (crowdsourcing) on the Internet by scientific institutions that develop or not sustainable actions, in order to characterize ...
through specific criteria; and have predilection for knowledge acquisition in the form of identification of diverse contents, which is in line with the initial proposal of crowdsourcing: to mobilize participation ...
doi:10.31686/ijier.vol7.iss4.1406
fatcat:llw6na4ypbet3iaagi2ezdfkfq
Construction and Application of a Human-Computer Collaborative Multimodal Practice Teaching Model for Preschool Education
2022
Computational Intelligence and Neuroscience
designed to combine the basic lesson types of preschool classroom teaching and the secondary objectives of the English curriculum standards, including "reading text–reading aloud evaluation," "playing speech–sound ...
Combined with Gagne's nine teaching events, a model of the English teaching process based on human-computer collaboration was constructed. ...
Crowdsourcing markup acquisition is usually divided into two phases: worker labeling and markup aggregation. ...
doi:10.1155/2022/2973954
pmid:35785056
pmcid:PMC9249456
fatcat:sv4dyc7fendgplihxarqqy75by
An Open Linguistic Infrastructure for Annotated Corpora
[chapter]
2013
The People's Web Meets NLP
layers from part of speech through discourse structure. ...
, making data acquisition the major issue for ANC-OLI development. ...
doi:10.1007/978-3-642-35085-6_10
dblp:series/tanlp/Ide13
fatcat:np45jttao5avvnckrvrcuruzby
Humanities Crowdsourcing
2016
Zagadnienia informacji naukowej
In knowledge acquisition for computing related fields (artificial intelligence, Semantic Web, machine learning, natural language processing, speech processing) Crowdflower, a MLab platform, has been used ...
Collaboration in humanities is not based on a single data model (Favier, 2015) . ...
doi:10.36702/zin.300
fatcat:x6aq7ami4re6rewxwb3qhsiege
Multimodality, interactivity, and crowdsourcing for document transcription
2018
Computational intelligence
In this case, when collaborators employ mobile devices, speech dictation can be used as transcription source, and speech and handwritten text recognition can be fused to provide a better draft transcription ...
The novel contributions presented in this work include the study of the data fusion on a multimodal crowdsourcing framework and its integration with an interactive system. ...
As previously said, crowdsourcing approaches for HTR are very useful. Speech recog-nition is other field where crowdsourcing approaches can be applied. ...
doi:10.1111/coin.12169
fatcat:ytgrih32zzdhpa5xgjjopch5wu
Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
2019
Frontiers in Psychology
Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages ...
for addressing it effectively. ...
However, the acquisition of such data may pose more pronounced challenges for the linguist working with small, young, and/or non-standard languages. ...
doi:10.3389/fpsyg.2019.00313
pmid:30837922
pmcid:PMC6382742
fatcat:ypgkorqupndvbcpotovz3pxs6y
Directions for the future of technology in pronunciation research and teaching
2018
Journal of Second Language Pronunciation
Next, we discuss the nature of data in pronunciation research, pointing to ways in which future work can build on advances in corpus research and crowdsourcing. ...
Finally, we consider how these insights pave the way for researchers and developers working to create research-informed, computer-assisted pronunciation teaching resources. ...
Acknowledgements The authors thank Scott Jarvis, Executive Director of Language Learning, and the sub-committee of the Language Learning Board of Directors, for funding the Roundtable event at PSLLT 2016 ...
doi:10.1075/jslp.17001.obr
fatcat:rdgxk3woxzhxxfgvcpbicjnyre
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
2020
International Conference on Language Resources and Evaluation
We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners ...
We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation ...
for the enhancement of NLP resources. ...
dblp:conf/lrec/NicolasLBFFZKCH20
fatcat:yp4g6ynrhrgpjmgatmln4qt3c4
Crawl and crowd to bring machine translation to under-resourced languages
2016
Language Resources and Evaluation
We present a widely applicable methodology to bring machine translation (MT) to under-resourced languages in a cost-effective and rapid manner. ...
Our proposal relies on web crawling to automatically acquire parallel data to train statistical MT systems if any such data can be found for the language pair and domain of interest. ...
In this work we built MT systems between an under-resourced language (Croatian) and, arguably, the best-resourced language (English). ...
doi:10.1007/s10579-016-9363-6
fatcat:kl7gpyhu6ncuphi4pdk3yf55qy
CrowdHeritage: Crowdsourcing for Improving the Quality of Cultural Heritage Metadata
2021
Information
In this context, metadata enrichment services through automated analysis and feature extraction along with crowdsourcing annotation services can offer a great opportunity for improving the metadata quality ...
To address this need, we propose the CrowdHeritage open end-to-end enrichment and crowdsourcing ecosystem, which supports an end-to-end workflow for the improvement of cultural heritage metadata by employing ...
The availability of human-annotated data can produce a considerable improvement in accuracy, however, the acquisition of appropriate labeled data is a costly process. ...
doi:10.3390/info12020064
fatcat:bvq34ojiojbtphnv5j46m3pxca
Designing, Realizing, Running, and Evaluating Virtual Museum – a Survey on Innovative Concepts and Technologies
2021
Journal of universal computer science (Online)
As a result, this survey identifies different approaches and advocates for stakeholders’ collaboration throughout the life cycle in determining the ViM's direction and evolution, its concepts ...
Based on their categories and features, we distinguish between content-, communication- and collaboration-centric museums with a special focus on learning and co-curation. ...
Acknowledgements We would like to express our gratitude to all those involved in the aforementioned museum projects and to thank our anonymous reviewers for their helpful comments and suggestions, which ...
doi:10.3897/jucs.77153
fatcat:3fhjracmurhvxagsyvchjjyz74
« Previous
Showing results 1 — 15 out of 1,192 results