A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Filters
On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID
[article]
2021
arXiv
pre-print
We do hope this paper will provide the RS community an overall perspective on constructing large-scale and practical image datasets for further research, especially data-driven ones. ...
Following the presented guidances, we also provide an example on building RS image dataset, i.e., Million-AID, a new large-scale benchmark dataset containing a million instances for RS image scene classification ...
On the one hand, the size and number of images are important properties concerning the scale of a RS image dataset. ...
arXiv:2006.12485v2
fatcat:g54goum7wzhcllh46vn6plr5ra
On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID
2021
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
We do hope this paper will provide the RS community an overall perspective on constructing large-scale and practical image datasets for further research, especially datadriven ones. ...
Following the presented guidances, we also provide an example on building RS image dataset, i.e., Million-AID 1 , a new large-scale benchmark dataset containing a million instances for RS image scene classification ...
On the one hand, the size and number of images are important properties concerning the scale of a RS image dataset. ...
doi:10.1109/jstars.2021.3070368
fatcat:or7fzkzdnzasldvjxs4l3fmzte
PrivOnto: A semantic framework for the analysis of privacy policies
2018
Semantic Web Journal
PrivOnto has been used to analyze a corpus of over 23,000 annotated data practices, extracted from 115 privacy policies of US-based companies. ...
researchers and regulators in the analysis of privacy policies at scale. ...
contributions to the design and validation of the annotation scheme, as well as the corpus creation. ...
doi:10.3233/sw-170283
fatcat:s2xxw6i6g5ax7ittoltji63lji
A Pragmatic Approach to Semantic Annotation for Search of Legal Texts – An Experiment on GDPR
[chapter]
2021
Frontiers in Artificial Intelligence and Applications
This new approach is illustrated on a proof-of-concept experiment that consisted in semantically annotating a significant part of the French version of the GDPR. ...
The aim is that legal texts can be enriched on a large scale at a reasonable cost, paving the way for new search capabilities that will facilitate mining of legal sources. ...
that large portions of text can be
ted quickly, without the need for legal experts.14 This suggests that the proposed
ch opens the way to a large-scale semantic search for legal texts.
ere is too ...
doi:10.3233/faia210313
fatcat:azao2s6mbrfirb223nnkxtbqsq
Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data
[article]
2018
arXiv
pre-print
Finally, the paper introduces a new agreement-based model to explore the semantic coherency information in the large-scale L2-L1 parallel data. ...
We first manually annotate the semantic roles for a set of learner texts to derive a gold standard for automatic SRL. ...
Acknowledgement This work was supported by the National Natural Science Foundation of China (61772036, 61331011) and the Key Laboratory of Science, Technology and Standard in Press Industry (Key Laboratory ...
arXiv:1808.09409v2
fatcat:wtyiabpamnasvbw7ryqx27dodm
Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data
2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Finally, the paper introduces a new agreement-based model to explore the semantic coherency information in the large-scale L2-L1 parallel data. ...
We first manually annotate the semantic roles for a set of learner texts to derive a gold standard for automatic SRL. ...
Acknowledgement This work was supported by the National Natural Science Foundation of China (61772036, 61331011) and the Key Laboratory of Science, Technology and Standard in Press Industry (Key Laboratory ...
doi:10.18653/v1/d18-1414
dblp:conf/emnlp/LinDZS018
fatcat:aj3ahe2hdzgd7pcgatomnv5wju
Emotional modelling and classification of a large-scale collection of scene images in a cluster environment
2018
PLoS ONE
OPEN ACCESS Citation: Cao J, Li Y, Tian Y (2018) Emotional modelling and classification of a large-scale collection of scene images in a cluster environment. PLoS ONE 13(1): e0191064. https:// ...
Thus, the experiments achieved a good classification effect and can lay a foundation for classification in terms of additional types of emotional image semantics, thereby demonstrating the practical significance ...
Acknowledgments This study was supported by the Natural Science Foundation of Shanxi Province The funders played no role in the study design, data collection and analysis, decision to publish, or preparation ...
doi:10.1371/journal.pone.0191064
pmid:29320579
pmcid:PMC5761962
fatcat:syiyh3hfdjhknhv736zvsfosxe
Teaching Parallelism without Programming: A Data Science Curriculum for Non-CS Students
2014
2014 Workshop on Education for High Performance Computing
A key aspect of our work is the use of workflows to illustrate key concepts and to allow the students to practice. ...
The course is designed to cover major concepts that are useful to understand the benefits of parallel and distributed programming while not relying on a programming background. ...
ACKNOWLEDGMENT We gratefully acknowledge support from the National Science Foundation (NSF) with award ACI-1355475. ...
doi:10.1109/eduhpc.2014.12
dblp:conf/sc/Gil14
fatcat:govj2asp4reblkdodiw5fwzcny
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
[article]
2021
arXiv
pre-print
The rapid progress in 3D scene understanding has come with growing demand for data; however, collecting and annotating 3D scenes (e.g. point clouds) are notoriously hard. ...
Our method achieves state-of-the-art results on a suite of benchmarks where training data or labels are scarce. ...
The authors would like to thank Norman Müller, Manuel Dahnert, Yawar Siddiqui and Angela Dai and anonymous reviewers for their constructive feedback. ...
arXiv:2012.09165v3
fatcat:oadjpimlvfgvljr737yxh2oijy
Data management strategies for multinational large-scale systems biology projects
2012
Briefings in Bioinformatics
Here, we give an overview of a selection of open-source data management systems proved to be employed successfully in large-scale projects. ...
The Guardian 2012]. By the use of high-throughput methods in many research areas from physics to systems biology, large data collections are increasingly important as raw material for research. ...
The authors acknowledge support from the German Federal Ministry of Education and Research (BMBF GRANT 0315717A), which is a partner of the ERASysBioþ initiative supported under the EU ERA-NET Plus scheme ...
doi:10.1093/bib/bbs064
pmid:23047157
pmcid:PMC3896927
fatcat:ivmmnprxzzg4tckczjkthgdeoi
Semantics Centric Solutions for Application and Data Portability in Cloud Computing
2010
2010 IEEE Second International Conference on Cloud Computing Technology and Science
The issues the Cloud computing community is facing now with respect to portability of data and application logic are exactly the same issue the Semantic Web community has been trying to address for some ...
Significant work has been done in the areas of data modeling, matching, and transformations. ...
Greenfield argues that the inability of the OOP 1 http://www.rememberthemilk.com/ Apart from these major undertakings, there has been a large number of small scale developments based on DSLs both in academia ...
doi:10.1109/cloudcom.2010.48
dblp:conf/cloudcom/RanabahuS10
fatcat:z2gndybrtratdj3wj45z2fyrwq
HPC and Grid Computing for Integrative Biomedical Research
2009
The international journal of high performance computing applications
Integrative biomedical research projects query, analyze, and integrate many different data types and make use of datasets obtained from measurements or simulations of structure and function at multiple ...
biological scales. ...
-0130437, #CNS-0615155, #CNS-0406386, the Ohio Board of Regents through grants BRTTC #BRTT02-0003 and AGMT TECH-04049, and the NIH through grants #R24 HL085343, #R01 LM009239, and #79077CBS10. ...
doi:10.1177/1094342009106192
pmid:20107625
pmcid:PMC2811341
fatcat:5p26dv5uwnbqtjjhn3lnugmqly
Time-Aware Ancient Chinese Text Translation and Inference
[article]
2021
arXiv
pre-print
%We show experimentally the efficacy of our framework in producing quality translation outputs and also validate our framework on a collected task-specific parallel corpus. ...
We validate our framework on a parallel corpus annotated with chronology information and show experimentally its efficacy in producing quality translation outputs. ...
Acknowledgements This research was funded in part by the German Research Foundation (DFG) as part of SFB 248 "Foundations of Perspicuous Software Systems". ...
arXiv:2107.03179v1
fatcat:mmzydyiohvdnvgly63n36v66tq
From the knowledge acquisition bottleneck to the knowledge acquisition overflow: A brief French history of knowledge acquisition
2013
International Journal of Human-Computer Studies
As a consequence of the huge amount of available data in the web, a paradigm shift occurred in the domain, from knowledge-intensive problem solving to large-scale data acquisition and management. ...
In particular, it reports the most significant steps in the parallel evolution of the web and the knowledge acquisition paradigm, which finally converged with the project of a semantic web. ...
As the web grew in size and diversity, the challenge to turn it into a semantic web became more complex and included the management of large data-sets, as well as the access to text and document content ...
doi:10.1016/j.ijhcs.2012.10.009
fatcat:gkwaq2sevrd47kc3kvwaeix4pq
Extracting Semantics from Multimedia Content: Challenges and Solutions
[chapter]
2008
Signals and Communication Technology
In this chapter, we present a review on extracting semantics from a large amount of multimedia data as a statistical learning problem. ...
The lack of effective indexes to describe the content of multimedia data is a main hurdle to multimedia search, and extracting semantics from multimedia content is the bottleneck for multimedia indexing ...
: rare semantics, sparseness of labels in an abundance of unlabeled data, scaling to large datasets and large sets of semantics; accounting for the the natural dependencies in data with structured input ...
doi:10.1007/978-0-387-76569-3_2
fatcat:jul6fw7esfaurct6erjnvpcq6q
« Previous
Showing results 1 — 15 out of 16,020 results