A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is
Text, Speech and Language Technology
In preparation for publication, shared corpora are generally associated with metadata and documented to indicate the authors and annotators of the data, the volume and types of raw material included, the ... Data planning depends upon the purpose of the project, the linguistic resources needed, the internal and external limitations on acquiring them, availability of data, bandwidth and distribution requirements ... Each year new communities embrace the practice of sharing language resources. ...doi:10.1007/978-1-4020-5817-2_8 fatcat:fby3bpfg7ngojhqzx7cqhcs7ly