1,127 Hits in 3.7 sec

Cloud-Centric Assured Information Sharing [chapter]

Bhavani Thuraisingham, Vaibhav Khadilkar, Jyothsna Rachapalli, Tyrone Cadenhead, Murat Kantarcioglu, Kevin Hamlen, Latifur Khan, Farhan Husain
2012 Lecture Notes in Computer Science  
In particular, we will describe our current implementation of a centralized cloud-based assured information sharing system and the design of a decentralized hybrid cloud-based assured information sharing  ...  Our goal is for coalition organizations to share information stored in multiple clouds and enforce appropriate policies.  ...  Based on the lessons learned from the implementation of CAISS we will then carry out a detailed design of CAISS++ and subsequently implement the system that will be the first of its kind for cloud-based  ... 
doi:10.1007/978-3-642-30428-6_1 fatcat:isjiaenyljdv3fqdenntqpfpzu

Cloud-Centric Assured Information Sharing [chapter]

2013 Developing and Securing the Cloud  
In particular, we will describe our current implementation of a centralized cloud-based assured information sharing system and the design of a decentralized hybrid cloud-based assured information sharing  ...  Our goal is for coalition organizations to share information stored in multiple clouds and enforce appropriate policies.  ...  Based on the lessons learned from the implementation of CAISS we will then carry out a detailed design of CAISS++ and subsequently implement the system that will be the first of its kind for cloud-based  ... 
doi:10.1201/b15433-41 fatcat:m27n53wcb5es5iszmsz2ysnvky

Designing a Document Retrieval Method for University Digital Libraries Based on Hadoop Technology

Haixia He
2021 Journal of Contemporary Educational Research  
This article uses Hadoop algorithm to extract semantic keywords and then calculates semantic similarity based on the literature retrieval keyword calculation process.  ...  method for university digital libraries based on Hadoop technology.  ...  When the text is in the processing stage, the Hadoop algorithm of the text data source needs to be processed in advance to calculate the weight of the documents under the keywords one by one.  ... 
doi:10.26689/jcer.v5i12.2821 fatcat:hyubmkv4nrbj3a2paujit4iuve

A Semantic-Based Approach for Managing Healthcare Big Data: A Survey

Rafat Hammad, Malek Barhoush, Bilal H. Abed-alguni, Saverio Maietta
2020 Journal of Healthcare Engineering  
In this paper, we review the state of the art on the semantic web for the healthcare industry.  ...  In the recent few years, a large number of organizations and companies have shown enthusiasm for using semantic web technologies with healthcare big data to convert data into knowledge and intelligence  ...  [86] presented a scalable management system for RDF data which is based on the Hadoop MapReduce framework. e proposed architecture is based on using a graph partitioning algorithm to store triples that  ... 
doi:10.1155/2020/8865808 pmid:33489061 pmcid:PMC7787845 fatcat:gorgipqn6famhkeffmqzr4tacq

New and Existing Approaches Reviewing of Big Data Analysis with Hadoop Tools

2022 Baghdad Science Journal  
The purpose of this paper is to analyze literature related analysis of big data of social media using the Hadoop framework for knowing almost analysis tools existing in the world under the Hadoop umbrella  ...  Social media are regarded as an important platform for sharing information, opinion, and knowledge of many subscribers.  ...  The proposed solution uses a scalable and fault tolerant framework (i.e., Hadoop) that usually uses HDFS for data storage. Data processing paradigm and map-reduce.  ... 
doi:10.21123/bsj.2022.19.4.0887 fatcat:syywdq6xgret5gfezyjuam33qy

An Effective and Efficient MapReduce Algorithm for Computing BFS-Based Traversals of Large-Scale RDF Graphs

Alfredo Cuzzocrea, Mirel Cosulschi, Roberto de Virgilio
2016 Algorithms  
Resource Description Framework (RDF) is a significant formalism and language for the so-called Semantic Web, due to the fact that a very wide family of Web entities can be naturally modeled in a graph-shaped  ...  for visiting (RDF) graphs to be decomposed and processed according to the MapReduce framework.  ...  [63] also investigates scalability issues of processing SPARQL queries over Web-scale RDF knowledge bases.  ... 
doi:10.3390/a9010007 fatcat:wxorzsnnovbjvpnoc4prrucnty

Big Data Storage [chapter]

Martin Strohbach, Jörg Daubert, Herman Ravkin, Mario Lischka
2016 New Horizons for a Data-Driven Economy  
Scalability Challenges in Graph-Based Data Stores: Processing data based on graph data structures is beneficial in an increasing amount of applications.  ...  For Treato, the impact of the Hadoop-based storage and processing infrastructure is that they obtain a scalable, reliable, and cost-effective system that may even create insights that would not have been  ... 
doi:10.1007/978-3-319-21569-3_7 fatcat:r7huxlg35fdulovdvoo2t2tfna

Big Data Semantics

Paolo Ceravolo, Antonia Azzini, Marco Angelini, Tiziana Catarci, Philippe Cudré-Mauroux, Ernesto Damiani, Alexandra Mazak, Maurice Van Keulen, Mustafa Jarrar, Giuseppe Santucci, Kai-Uwe Sattler, Monica Scannapieco (+3 others)
2018 Journal on Data Semantics  
In this paper, the third of its kind co-authored by members of IFIP WG 2.6 on Data Semantics, we propose a review of the literature addressing these topics and discuss relevant challenges for future research  ...  Indeed, multiple components and procedures must be coordinated to ensure a high level of data quality and accessibility for the application layers, e.g., data analytics and reporting.  ...  HDFS provides a foundation for several MapReducelike data processing frameworks such as Hadoop MapReduce, Apache Spark, or Flink [137] .  ... 
doi:10.1007/s13740-018-0086-2 fatcat:bhbeyntbtzdkvf5t3dcko42jpy

The state-of-the-art in web-scale semantic information processing for cloud computing [article]

Wei Yu, Junpeng Chen
2013 arXiv   pre-print
The purpose of this report is to give an overview of existing technologies for semantic information processing in cloud computing environment, to propose a research direction for addressing distributed  ...  Based on integrated infrastructure of resource sharing and computing in distributed environment, cloud computing involves the provision of dynamically scalable and provides virtualized resources as services  ...  Nowadays, more related projects are developed to support Hadoop framework and various technologies are applied. Pregel [8] is a new proposed cloud computing model for large scale graph processing.  ... 
arXiv:1305.4228v1 fatcat:duay2zecwfbqdbwzvy66fyixa4

A Review Paper on Big Data and Hadoop for Data Science

Mr. Ketan Bagade, Mrs. Anjali Gharat, Mrs. Helina Tandel
2019 Zenodo  
Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models.  ...  Helina Tandel "A Review Paper on Big Data and Hadoop for Data Science" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-4 | Issue-1 ,  ...  Hadoop Architecture Overview Apache Hadoop offers a scalable, flexible and reliable distributed computing big data framework for a cluster of systems with storage capacity and local computing power  ... 
doi:10.5281/zenodo.3610061 fatcat:ijuwfez7p5czjm5zlxfemhccsa

A Survey on Vertical and Horizontal Scaling Platforms for Big Data Analytics

Ahmed Hussein Ali, ICCI, Informatics Institute for Postgraduate Studies, Baghdad, IRAQ, Mahmood Zaki Abdullah, Department of Computer Engineering, Al-Mustansiriyah University, Baghdad, IRAQ
2019 International Journal of Integrated Engineering  
Acknowledgement The authors would like to thank ICCI, Informatics Institute for Postgraduate Studies (IIPS_IRAQ) for their moral support.  ...  Special thanks to the anonymous reviewers for their valuable suggestions and constructive comments.  ...  HDFS is a UNIX-based data storage layer of Hadoop and is seen as Hadoop's own rack-aware filesystem. HDFS is based on the Google filesystem concept.  ... 
doi:10.30880/ijie.2019.11.06.015 fatcat:qbtbeq6ukbe5pmpmgkld3r33fe

A Survey of Semantics-Aware Performance Optimization for Data-Intensive Computing [article]

Bingbing Rao, Liqiang Wang
2021 arXiv   pre-print
Given that the limitation of CPU and I/O in a single computer, the mainstream approach to scalability is to distribute computations among a large number of processing nodes in a cluster or cloud.  ...  Finally, we discuss the research challenges and opportunities in the field of semantics-aware performance optimization for data-intensive computing.  ...  There is a large body of research work involved in performance optimizations based on semantics-aware technology.  ... 
arXiv:2107.11540v1 fatcat:njo5wuctovgrti4sw6wwp6tvkq

A Software Architecture for Progressive Scanning of On-line Communities

Roberto Baldoni, Fabrizio DAmore, Massimo Mecella, Daniele Ucci
2014 2014 IEEE 34th International Conference on Distributed Computing Systems Workshops  
The paper presents a software architecture that progressively scan a set of on-line communities in order to detect such semantic causal relationships.  ...  The architecture includes a crawler, a large scale storage, a distributed indexing system and a mining system. The paper mainly focuses on crawling and indexing.  ...  supported through the Italian MIUR PRIN project TENACE -National Critical Infrastructure Protection from Cyber Threats, and Italian MIUR Smart Cities and Communities project RoMA -Resilience enhancement Of  ... 
doi:10.1109/icdcsw.2014.37 dblp:conf/icdcsw/BaldoniDMU14 fatcat:7yym2kqn75hidccihebyfgyeau


Valentina Janev, Dejan Paunović, Damien Graux, Hajira Jabeen, Emanuel Sallinger, Sahar Vahdati
2019 Zenodo  
Hence, in the LAMBDA project framework, an effort was made to develop a new set of lectures based on the education materials and courses offered by the University of Bonn and University of Oxford.  ...  As the number of Big Data related methods, tools, frameworks and solutions is growing, there is a need to systematize the knowledge about the domain.  ...  ACKNOWLEDGEMENT The research presented in this paper is partly financed by the Ministry of Science and Technological Development of the Republic of Serbia (SOFIA project, Pr.  ... 
doi:10.5281/zenodo.3555275 fatcat:gmafcchntze4znth3jbih5rlb4

Towards a New Scalable Big Data System Semantic Web Applied on Mobile Learning

Mouad Banane, Abdessamad Belangour
2020 International Journal of Interactive Mobile Technologies  
In Web 3.0, semantic data gives machines the ability to understand and process data. Resource Description Framework (RDF) is the liagna franca of Semantic Web.  ...  The results obtained in the system evaluation experiments, on a large number of servers show the efficiency, scalability, and robustness of our system if the amount of data processed is very large.  ...  This system is composed of two main layers, in the semantic knowledge layer we use an M-Learning domain ontology, and in the storage layer, we use a document-oriented NoSQL database named MongoDB for data  ... 
doi:10.3991/ijim.v14i01.10922 fatcat:iz3fpis7xfd7tfpk2ovwtaqeiu
« Previous Showing results 1 — 15 out of 1,127 results