Filters








16,020 Hits in 7.1 sec

On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID [article]

Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li
2021 arXiv   pre-print
We do hope this paper will provide the RS community an overall perspective on constructing large-scale and practical image datasets for further research, especially data-driven ones.  ...  Following the presented guidances, we also provide an example on building RS image dataset, i.e., Million-AID, a new large-scale benchmark dataset containing a million instances for RS image scene classification  ...  On the one hand, the size and number of images are important properties concerning the scale of a RS image dataset.  ... 
arXiv:2006.12485v2 fatcat:g54goum7wzhcllh46vn6plr5ra

On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID

Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Yang, Xiaoxiang Zhu, Liangpei Zhang, Deren Li
2021 IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing  
We do hope this paper will provide the RS community an overall perspective on constructing large-scale and practical image datasets for further research, especially datadriven ones.  ...  Following the presented guidances, we also provide an example on building RS image dataset, i.e., Million-AID 1 , a new large-scale benchmark dataset containing a million instances for RS image scene classification  ...  On the one hand, the size and number of images are important properties concerning the scale of a RS image dataset.  ... 
doi:10.1109/jstars.2021.3070368 fatcat:or7fzkzdnzasldvjxs4l3fmzte

PrivOnto: A semantic framework for the analysis of privacy policies

Alessandro Oltramari, Dhivya Piraviperumal, Florian Schaub, Shomir Wilson, Sushain Cherivirala, Thomas B. Norton, N. Cameron Russell, Peter Story, Joel Reidenberg, Norman Sadeh, Mathieu d'Aquin, Sabrina Kirrane (+4 others)
2018 Semantic Web Journal  
PrivOnto has been used to analyze a corpus of over 23,000 annotated data practices, extracted from 115 privacy policies of US-based companies.  ...  researchers and regulators in the analysis of privacy policies at scale.  ...  contributions to the design and validation of the annotation scheme, as well as the corpus creation.  ... 
doi:10.3233/sw-170283 fatcat:s2xxw6i6g5ax7ittoltji63lji

A Pragmatic Approach to Semantic Annotation for Search of Legal Texts – An Experiment on GDPR [chapter]

Adeline Nazarenko, François Lévy, Adam Wyner
2021 Frontiers in Artificial Intelligence and Applications  
This new approach is illustrated on a proof-of-concept experiment that consisted in semantically annotating a significant part of the French version of the GDPR.  ...  The aim is that legal texts can be enriched on a large scale at a reasonable cost, paving the way for new search capabilities that will facilitate mining of legal sources.  ...  that large portions of text can be ted quickly, without the need for legal experts.14 This suggests that the proposed ch opens the way to a large-scale semantic search for legal texts. ere is too  ... 
doi:10.3233/faia210313 fatcat:azao2s6mbrfirb223nnkxtbqsq

Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data [article]

Zi Lin, Yuguang Duan, Yuanyuan Zhao, Weiwei Sun, Xiaojun Wan
2018 arXiv   pre-print
Finally, the paper introduces a new agreement-based model to explore the semantic coherency information in the large-scale L2-L1 parallel data.  ...  We first manually annotate the semantic roles for a set of learner texts to derive a gold standard for automatic SRL.  ...  Acknowledgement This work was supported by the National Natural Science Foundation of China (61772036, 61331011) and the Key Laboratory of Science, Technology and Standard in Press Industry (Key Laboratory  ... 
arXiv:1808.09409v2 fatcat:wtyiabpamnasvbw7ryqx27dodm

Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data

Zi Lin, Yuguang Duan, Yuanyuan Zhao, Weiwei Sun, Xiaojun Wan
2018 Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing  
Finally, the paper introduces a new agreement-based model to explore the semantic coherency information in the large-scale L2-L1 parallel data.  ...  We first manually annotate the semantic roles for a set of learner texts to derive a gold standard for automatic SRL.  ...  Acknowledgement This work was supported by the National Natural Science Foundation of China (61772036, 61331011) and the Key Laboratory of Science, Technology and Standard in Press Industry (Key Laboratory  ... 
doi:10.18653/v1/d18-1414 dblp:conf/emnlp/LinDZS018 fatcat:aj3ahe2hdzgd7pcgatomnv5wju

Emotional modelling and classification of a large-scale collection of scene images in a cluster environment

Jianfang Cao, Yanfei Li, Yun Tian, Pratyoosh Shukla
2018 PLoS ONE  
OPEN ACCESS Citation: Cao J, Li Y, Tian Y (2018) Emotional modelling and classification of a large-scale collection of scene images in a cluster environment. PLoS ONE 13(1): e0191064. https://  ...  Thus, the experiments achieved a good classification effect and can lay a foundation for classification in terms of additional types of emotional image semantics, thereby demonstrating the practical significance  ...  Acknowledgments This study was supported by the Natural Science Foundation of Shanxi Province The funders played no role in the study design, data collection and analysis, decision to publish, or preparation  ... 
doi:10.1371/journal.pone.0191064 pmid:29320579 pmcid:PMC5761962 fatcat:syiyh3hfdjhknhv736zvsfosxe

Teaching Parallelism without Programming: A Data Science Curriculum for Non-CS Students

Yolanda Gil
2014 2014 Workshop on Education for High Performance Computing  
A key aspect of our work is the use of workflows to illustrate key concepts and to allow the students to practice.  ...  The course is designed to cover major concepts that are useful to understand the benefits of parallel and distributed programming while not relying on a programming background.  ...  ACKNOWLEDGMENT We gratefully acknowledge support from the National Science Foundation (NSF) with award ACI-1355475.  ... 
doi:10.1109/eduhpc.2014.12 dblp:conf/sc/Gil14 fatcat:govj2asp4reblkdodiw5fwzcny

Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts [article]

Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie
2021 arXiv   pre-print
The rapid progress in 3D scene understanding has come with growing demand for data; however, collecting and annotating 3D scenes (e.g. point clouds) are notoriously hard.  ...  Our method achieves state-of-the-art results on a suite of benchmarks where training data or labels are scarce.  ...  The authors would like to thank Norman Müller, Manuel Dahnert, Yawar Siddiqui and Angela Dai and anonymous reviewers for their constructive feedback.  ... 
arXiv:2012.09165v3 fatcat:oadjpimlvfgvljr737yxh2oijy

Data management strategies for multinational large-scale systems biology projects

W. Wruck, M. Peuker, C. R. A. Regenbrecht
2012 Briefings in Bioinformatics  
Here, we give an overview of a selection of open-source data management systems proved to be employed successfully in large-scale projects.  ...  The Guardian 2012]. By the use of high-throughput methods in many research areas from physics to systems biology, large data collections are increasingly important as raw material for research.  ...  The authors acknowledge support from the German Federal Ministry of Education and Research (BMBF GRANT 0315717A), which is a partner of the ERASysBioþ initiative supported under the EU ERA-NET Plus scheme  ... 
doi:10.1093/bib/bbs064 pmid:23047157 pmcid:PMC3896927 fatcat:ivmmnprxzzg4tckczjkthgdeoi

Semantics Centric Solutions for Application and Data Portability in Cloud Computing

Ajith Ranabahu, Amit Sheth
2010 2010 IEEE Second International Conference on Cloud Computing Technology and Science  
The issues the Cloud computing community is facing now with respect to portability of data and application logic are exactly the same issue the Semantic Web community has been trying to address for some  ...  Significant work has been done in the areas of data modeling, matching, and transformations.  ...  Greenfield argues that the inability of the OOP 1 http://www.rememberthemilk.com/ Apart from these major undertakings, there has been a large number of small scale developments based on DSLs both in academia  ... 
doi:10.1109/cloudcom.2010.48 dblp:conf/cloudcom/RanabahuS10 fatcat:z2gndybrtratdj3wj45z2fyrwq

HPC and Grid Computing for Integrative Biomedical Research

Tahsin Kurc, Shannon Hastings, Vijay Kumar, Stephen Langella, Ashish Sharma, Tony Pan, Scott Oster, David Ervin, Justin Permar, Sivaramakrishnan Narayanan, Yolanda Gil, Ewa Deelman (+4 others)
2009 The international journal of high performance computing applications  
Integrative biomedical research projects query, analyze, and integrate many different data types and make use of datasets obtained from measurements or simulations of structure and function at multiple  ...  biological scales.  ...  -0130437, #CNS-0615155, #CNS-0406386, the Ohio Board of Regents through grants BRTTC #BRTT02-0003 and AGMT TECH-04049, and the NIH through grants #R24 HL085343, #R01 LM009239, and #79077CBS10.  ... 
doi:10.1177/1094342009106192 pmid:20107625 pmcid:PMC2811341 fatcat:5p26dv5uwnbqtjjhn3lnugmqly

Time-Aware Ancient Chinese Text Translation and Inference [article]

Ernie Chang, Yow-Ting Shiue, Hui-Syuan Yeh, Vera Demberg
2021 arXiv   pre-print
%We show experimentally the efficacy of our framework in producing quality translation outputs and also validate our framework on a collected task-specific parallel corpus.  ...  We validate our framework on a parallel corpus annotated with chronology information and show experimentally its efficacy in producing quality translation outputs.  ...  Acknowledgements This research was funded in part by the German Research Foundation (DFG) as part of SFB 248 "Foundations of Perspicuous Software Systems".  ... 
arXiv:2107.03179v1 fatcat:mmzydyiohvdnvgly63n36v66tq

From the knowledge acquisition bottleneck to the knowledge acquisition overflow: A brief French history of knowledge acquisition

Nathalie Aussenac-Gilles, Fabien Gandon
2013 International Journal of Human-Computer Studies  
As a consequence of the huge amount of available data in the web, a paradigm shift occurred in the domain, from knowledge-intensive problem solving to large-scale data acquisition and management.  ...  In particular, it reports the most significant steps in the parallel evolution of the web and the knowledge acquisition paradigm, which finally converged with the project of a semantic web.  ...  As the web grew in size and diversity, the challenge to turn it into a semantic web became more complex and included the management of large data-sets, as well as the access to text and document content  ... 
doi:10.1016/j.ijhcs.2012.10.009 fatcat:gkwaq2sevrd47kc3kvwaeix4pq

Extracting Semantics from Multimedia Content: Challenges and Solutions [chapter]

Lexing Xie, Rong Yan
2008 Signals and Communication Technology  
In this chapter, we present a review on extracting semantics from a large amount of multimedia data as a statistical learning problem.  ...  The lack of effective indexes to describe the content of multimedia data is a main hurdle to multimedia search, and extracting semantics from multimedia content is the bottleneck for multimedia indexing  ...  : rare semantics, sparseness of labels in an abundance of unlabeled data, scaling to large datasets and large sets of semantics; accounting for the the natural dependencies in data with structured input  ... 
doi:10.1007/978-0-387-76569-3_2 fatcat:jul6fw7esfaurct6erjnvpcq6q
« Previous Showing results 1 — 15 out of 16,020 results