Filters








2,620 Hits in 3.1 sec

A curated and evolving linguistic linked dataset

Emanuele Di Buccio, Giorgio Maria Di Nunzio, Gianmaria Silvello
2013 Semantic Web Journal  
Both the ASIt linguistic linked dataset and the Resource Description Framework Schema (RDF/S) on which it is based are publicly available and released with a Creative Commons license (CC BY-NC-SA 3.0).  ...  This paper describes the Atlante Sintattico d'Italia, Syntactic Atlas of Italy (ASIt) linguistic linked dataset.  ...  Acknowledgments The authors wish to thanks Maristella Agosti for her support and contribution in the design and de-velopment of the ASIt Digital Library.  ... 
doi:10.3233/sw-2012-0083 fatcat:akiejrvzrfhaljvde6qrlbxt3e

Cross-Linguistic Data Formats, advancing data sharing and re-use in comparative linguistics

Robert Forkel, Johann-Mattis List, Simon J Greenhill, Christoph Rzymski, Sebastian Bank, Michael Cysouw, Harald Hammarström, Martin Haspelmath, Gereon A Kaiping, Russell D Gray
2018 Scientific Data  
The new specification for cross-linguistic data formats comes along with a software package for validation and manipulation, a basic ontology which links to more general frameworks, and usage examples  ...  The Cross-Linguistic Data Formats initiative proposes new standards for two basic types of data in historical and typological language comparison (word lists, structural datasets) and a framework to incorporate  ...  Acknowledgements This research would not have been possible without the generous support by many institutes and funding agencies.  ... 
doi:10.1038/sdata.2018.205 pmid:30325347 pmcid:PMC6190742 fatcat:u2ojddtwbbah7h4sbeeqlqeyv4

Glottocodes: Identifiers linking families, languages and dialects to comprehensive reference information

Robert Forkel, Harald Hammarström, Julia Bosque-Gil, Milan Dojchinovski, Philipp Cimiano
2022 Semantic Web Journal  
As such the glottocode-system responds to an important challenge in the realm of Linguistic Linked Data with numerous NLP applications.  ...  In this paper, we summarize the motivation and history behind the system of glottocodes and describe the principles and practices of data curation, technical infrastructure and update/version-tracking  ...  Hammarström / Glottocodes: Identifiers linking families, languages and dialects 3 – Thus, collaboration and curation workflows can make use of git, a distributed version control system to track  ... 
doi:10.3233/sw-212843 fatcat:aqiht6utdjfwzgzey5ouj7zhbq

Using Data Curation Profiles to Design the Datastar Dataset Registry

Sarah J. Wright, Wendy A. Kozlowski, Dianne Dietrich, Huda J. Khan, Gail S. Steinhart, Leslie McIntosh
2013 D-Lib Magazine  
Linguistics. Peter Buneman, S. Khanna, and W. C. Tan, "Why and Where: A Characterization of Data Provenance".  ...  , monitoring and linking of publications (e.g.  ... 
doi:10.1045/july2013-wright fatcat:pjkhq4buebcynj4zhrkq3v7mje

LACLICHEV: Exploring the History of Climate Change in Latin America within Newspapers Digital Collections [article]

Genoveva Vargas-Solar, José-Luis Zechinelli-Martini, Javier A. Espinosa-Oviedo, Luis M. Vilches-Blázquez
2021 arXiv   pre-print
This environment provides tools for curating, exploring and analyzing historical newspapers articles, their description and location, and the vocabularies used for referring to meteorological events.  ...  The objective being to understand the content of newspapers and identifying possible patterns and models that can build a view of the history of climate change in the Latin American region.  ...  Curating historical newspapers articles The objective of curating historical newspapers articles is to build a dataset of documents reporting meteorological events and associating them with meta-data,  ... 
arXiv:2105.00792v1 fatcat:ok5lb5sd7zdclireezcjjkqnem

The Ancestry Of Sino-Tibetan Populations And Languages

Mei-Shin Wu, Yunfan Lai, Johann-Mattis List
2018 Zenodo  
This is my oral presentation handout for "The 51st International Conference on Sino-Tibetan Languages and Linguistics" (2018/09/25-2018/09/28, Kyoto, Japan)  ...  code, and also submit a draft version to a stable archive to guarantee that the data will be long-term archived.  ...  Hugo Reyes-Centeno from the DFG Center for Advanced Studies "Words, Bones, Genes, Tools: Tracking Linguistic, Cultural and Biological Trajectories of the Human Past" (University of Tübingen), and Dr.  ... 
doi:10.5281/zenodo.1306622 fatcat:4idlqq3usrfsxobmr24lin7h6e

Linking norms, ratings, and relations of words and concepts across multiple language varieties

Annika Tjuka, Robert Forkel, Johann-Mattis List
2021 Behavior Research Methods  
The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data.  ...  Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data  ...  Under the tab Datasets a list appears and each data set shows up with a label for the data type (norms, ratings, or relations) and the language.  ... 
doi:10.3758/s13428-021-01650-1 pmid:34357536 pmcid:PMC9046307 fatcat:ot4c4tpwqzf5ve256twfcra7gi

Knowledge Graphs and Knowledge Networks: The Story in Brief [article]

Amit Sheth, Swati Padhee, Amelie Gyrard
2020 arXiv   pre-print
there is a need to represent the changing nodes, attributes, and edges over time.  ...  KGs are significantly contributing to various AI applications from link prediction, entity relations prediction, node classification to recommendation and question answering systems.  ...  Some of the endeavors in KG curation include: Linked Open Data (LOD) provides diverse sources of knowledge to populate and enrich KGs. By March 2019, it covered 1,239 datasets with 16,147 links.  ... 
arXiv:2003.03623v1 fatcat:zle7g626mzeqrkmobgrr6xtfae

SSHOC D7.1 System Specification - SSH Open Marketplace

Laure Barbot, Yoan Moranville, Frank Fischer, Clara Petitfils, Matej Ďurčo, Klaus Illmayer, Tomasz Parkoła, Philipp Wieder, Sotiris Karampatakis
2019 Zenodo  
The Social Sciences & Humanities communities are in an urgent need for a place to gather and exchange information about their tools, services, and datasets.  ...  Over the course of the agile development of the Marketplace, the system specification will also be evolving and contributing to a growing number of SSHOC outcomes.  ...  Concretely, "instead of being just a list of links or database of resources, [the Marketplace] will contextualise and interlink tools, services and datasets offered, with screenshots, tutorials and links  ... 
doi:10.5281/zenodo.4558302 fatcat:4gbaohzm6rdaxl5hlocc4horvu

D7.1 System Specification - SSH Open Marketplace

Laure Barbot, Yoan Moranville, Frank Fischer, Clara Petitfils, Matej Ďurčo, Klaus Illmayer, Tomasz Parkoła, Philipp Wieder, Sotiris Karampatakis
2019 Zenodo  
The Social Sciences & Humanities communities are in an urgent need for a place to gather and exchange information about their tools, services, and datasets.  ...  Over the course of the agile development of the Marketplace, the system specification will also be evolving and contributing to a growing number of SSHOC outcomes.  ...  Concretely, "instead of being just a list of links or database of resources, [the Marketplace] will contextualise and interlink tools, services and datasets offered, with screenshots, tutorials and links  ... 
doi:10.5281/zenodo.3547648 fatcat:wohrsro2fvcfpb4lfs4wmvwvmi

"The Naming of Cats": Automated Genre Classification

Yunhyong Kim, Seamus Ross
2008 International Journal of Digital Curation  
as an object linked to previously classified objects and other external sources) and have examined visual and language model features.  ...  We have previously proposed dividing features of a document into five types (features for visual layout, language model features, stylometric features, features for semantic structure, and contextual features  ...  or absence and location of headers, delimiters, images, or links.  ... 
doi:10.2218/ijdc.v2i1.13 fatcat:pln55d4hxjdk3puknlwm3sspea

Seeing is Correcting: curating lexical resources using social interfaces

Livy Real, Fabricio Chalub, Valeria dePaiva, Claudia Freitas, Alexandre Rademaker
2015 Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications  
But to get there, there is a need for user interfaces that allow ordinary users and (not only computational) linguists to help in the checking and cleaning up of the quality of the resource.  ...  This showcases the use and importance of its linked data features, to keep track of information provenance during the whole life-cycle of the RDF resource.  ...  We are still investigating the benefits of using a lexical model such as lemon (Chiarcos et al., 2011a) and of a possible alignment with it.  ... 
doi:10.18653/v1/w15-4203 dblp:conf/acl-ldl/RealCPFR15 fatcat:hfjshy3lzncbdkdn4mcg33tno4

Requirements Analysis for an Open Research Knowledge Graph [article]

Arthur Brack, Anett Hoppe, Markus Stocker, Sören Auer, Ralph Ewerth
2020 arXiv   pre-print
As a result, we map necessary and desirable requirements for successful KG-based science communication, derive implications and outline possible solutions.  ...  , (b) establishing their consequential requirements for a KG-based system, (c) identifying overlaps and specificities, and their coverage in current solutions.  ...  In particular, we propose a framework with lightweight ontologies that can evolve by community curation.  ... 
arXiv:2005.10334v1 fatcat:v267cmxzefhfvcqgo5qwuu5ihq

Psychometric Analysis and Coupling of Emotions Between State Bulletins and Twitter in India during COVID-19 Infodemic [article]

Baani Leen Kaur Jolly, Palash Aggrawal, Amogh Gulati, Amarjit Singh Sethi, Ponnurangam Kumaraguru, Tavpritesh Sethi
2020 arXiv   pre-print
We look at these two sources with a psycho-linguistic lens of emotions and quantified the extent and coupling between the two.  ...  During the COVID-19 crisis, Twitter alone has seen a sharp 45% increase in the usage of its curated events page, and a 30% increase in its direct messaging usage, since March 6th 2020.  ...  To this end, we have curated dataset of more than 5.6 million tweets and retweets, specific to India.  ... 
arXiv:2005.05513v2 fatcat:gmrz4nmrvbc7pmh53qa5bskmly

'HypothesisFinder:' A Strategy for the Detection of Speculative Statements in Scientific Text

Ashutosh Malhotra, Erfan Younesi, Harsha Gurulingappa, Martin Hofmann-Apitius, Andrey Rzhetsky
2013 PLoS Computational Biology  
Subsequent exploration of derived hypothetical knowledge leads to generation of a coherent overview on emerging knowledge niches, and can thus provide added value to ongoing research activities.  ...  To demonstrate the practical utility of our approach, we applied it to the domain of Alzheimer's disease and showed that our automated approach captures a wide spectrum of scientific speculations on Alzheimer's  ...  Acknowledgments The authors would like to thank Theo Mevissen and Bernd Mu ¨ller for technical support, Dr. Juliane Fluck and Dr. Roman Klinger for their fruitful discussions, and Dr.  ... 
doi:10.1371/journal.pcbi.1003117 pmid:23935466 pmcid:PMC3723489 fatcat:e4iao2k5gzayjluzppzxo4o4am
« Previous Showing results 1 — 15 out of 2,620 results