1,755 Hits in 6.7 sec

Computing on masked data: a high performance method for improving big data veracity

Jeremy Kepner, Vijay Gadepally, Pete Michaleas, Nabil Schear, Mayank Varia, Arkady Yerukhimovich, Robert K. Cunningham
2014 2014 IEEE High Performance Extreme Computing Conference (HPEC)  
This work introduces a new technique called Computing on Masked Data (CMD), which improves data veracity by allowing computations to be performed directly on masked data and ensuring that only authorized  ...  The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity and variety.  ...  These approaches are all significant steps forward in improving the veracity of a big data system.  ... 
doi:10.1109/hpec.2014.7040946 dblp:conf/hpec/KepnerGMSVYC14 fatcat:6ih6eru33zh55awm24lfguekmy

Computing on Masked Data to improve the Security of Big Data [article]

Vijay Gadepally, Braden Hancock and Benjamin Kaiser, Jeremy Kepner, Pete Michaleas, Mayank Varia, Arkady Yerukhimovich
2015 arXiv   pre-print
Much of big data computation and analytics make use of signal processing fundamentals for computation.  ...  In this article, we propose a tool called Computing on Masked Data (CMD), which combines advances in database technologies and cryptographic tools to provide a low overhead mechanism to offload certain  ...  ACKNOWLEDGEMENTS The authors would like to acknowledge the anonymous reviewers, Nabil Schear, Rob Cunningham, and the LLGrid operations team at MIT Lincoln Laboratory for their support in developing and  ... 
arXiv:1504.01287v1 fatcat:vl4o5rdioja37i6wtfzhkgmg5e

Methods for Assessing, Predicting, and Improving Data Veracity: A survey

Fatmah Assiri
2020 Advances in Distributed Computing and Artificial Intelligence Journal  
The challenges or limitations and related gaps in existing work will bediscussed, and future research directions will be proposed to address the critical issuesof data veracity in the era of big data  ...  However, recently, veracity has beenadded as the fourth dimension. Data veracity relates to the quality of the data.  ...  proposed a technique, known as the computing on masked data (CMD) system, to improve the data veracity by performing computations on masked data in which only an authorized recipient can unmask the data  ... 
doi:10.14201/adcaij202094530 fatcat:5nuibi4ningdbp7ivrbgn6dk6i

Study on Big Data Frameworks

Adriano Fernandes, Jonathan Barretto, Jonas Fernandes
2021 International Journal of Scientific Research in Science and Technology  
Big data analytics is becoming more and more popular every day as a tool for evaluating large volumes of data on demand.  ...  four big data architectures against these KPIs in a literature review.  ...  Spark has the ability to store data in memory for future iterations, which improves performance.  ... 
doi:10.32628/ijsrst218475 fatcat:qnfdtgeqgzhalpsb6lzpp3v4gi

DTRM: A new reputation mechanism to enhance data trustworthiness for high-performance cloud computing

Hui Lin, Jia Hu, Chuanfeng Xu, Jianfeng Ma, Mengyang Yu
2018 Future generations computer systems  
To enhance data veracity and thus improve the performance of big data computing in MCC, this paper proposes a Data Trustworthiness enhanced Reputation Mechanism (DTRM) which can be used to defend against  ...  Troublesome internal attacks launched by internal malicious users is one key problem that reduces data veracity and remains difficult to handle.  ...  [8] introduced a technique called Computing on Masked Data (CMD) to improve data veracity while allowing a wide range of computations and queries to be performed with low overhead.  ... 
doi:10.1016/j.future.2018.01.026 fatcat:vockpqxcdzfoheaxq2gqsdbqmu

A Review of Data Science and Big Data Computing

Wajid Ali, Muhammad Usman Shafique, Muhammad Arslan Majeed, Muhammad Faizan, Ahmad Raza
2020 Asian Journal of Research in Computer Science  
In this paper, we discussed the general concept of data science, Big data, and areas of Big data computing.  ...  Data Science emerged as an important discipline and its education is essential for success in almost every aspect of life. Here comes the age of Big data.  ...  Computing similarity is one of the data sciences principal methods.  ... 
doi:10.9734/ajrcos/2020/v6i330158 fatcat:4a2jbja4qnfntmwnad6nhyaruy

Privacy-Preserving Record Linkage for Big Data: Current Approaches and Research Challenges [chapter]

Dinusha Vatsalan, Ziad Sehili, Peter Christen, Erhard Rahm
2017 Handbook of Big Data Technologies  
2) achieving high quality results of the linkage in the presence of variety and veracity of Big Data, and (3) preserving privacy and confidentiality of the entities represented in Big Data collections.  ...  PPRL for Big Data poses several challenges, with the three major ones being (1) scalability to multiple large databases, due to their massive volume and the flow of data within Big Data applications, (  ...  Scalable Data Services and Solutions (ScaDS) Dresden/Leipzig (BMBF 01IS14014B).  ... 
doi:10.1007/978-3-319-49340-4_25 fatcat:a6p7w4sannbmdkq4ceridmoan4

Quality assurance in big data analytics: An IoT perspective

Fernandes Ann, Rupali Wagh
2019 Telfor Journal  
Emergence of IoT as one of the key data contributors in a big data application has presented new data quality challenges and has necessitated for an IoT inclusive data validation ecosystem.  ...  Standardized data quality approaches and frameworks are available for data obtained for a variety of sources like data warehouses, webblogs, social media, etc. in a big data application.  ...  Based on various processes and methods as mentioned transformations on raw IoT data are performed wherever necessary.  ... 
doi:10.5937/telfor1902114a fatcat:y2tvstn4hbchjoblqdtobc4a64

MapReduce based Text Detection using MSER in Big Data Natural Scene Videos

Parul Pathak, Nirmal Gaud
2017 International Journal of Computer Applications  
Text is one of the most important features in images and videos. It can be used for various analysis purposes.  ...  In this research work, a method is proposed to detect text in Natural scene videos using MapReduce and MSER (Maximally Stable Extremal Regions).  ...  INTRODUCTION Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate .It can be defined as high volume, high velocity & high variety information  ... 
doi:10.5120/ijca2017915448 fatcat:4nbnvqk3q5d35dpwvb2kofdyaq

Toward better data veracity in mobile cloud computing: A context-aware and incentive-based reputation mechanism

Hui Lin, Jia Hu, Youliang Tian, Li Yang, Li Xu
2017 Information Sciences  
As a promising next-generation computing paradigm, Mobile Cloud Computing (MCC) enables the large-scale collection and big data processing of personal private data.  ...  An important but often overlooked V of big data is data veracity, which ensures that the data used are trusted, authentic, accurate and protected from unauthorized access and modification.  ...  [10] introduced a new technique called Computing on Masked Data (CMD) to improve data veracity while allowing a wide range of computations and queries to be performed with low overhead by combining  ... 
doi:10.1016/j.ins.2016.12.031 fatcat:mk6p4d6yyrdsjclgmg3bcdmohi

Additive Manufacturing and Big Data

Lidong Lidong, Cheryl Ann Alexander
2016 International journal of mathematical, engineering and management sciences  
Big data in AM and Big Data analytics for AM are also presented.  ...  Big Data analytics helps analyze AM processes and facilitate AM in impacting supply chains. This paper introduces advantages, applications, and technology progress of AM.  ...  In-situ monitoring and Big Data analytics for additive manufacturing are important research topics (Rapporteur, 2014; Dehoff et al., 2015) . High performance computation (HPC) can be used in AM.  ... 
doi:10.33889/ijmems.2016.1.3-012 fatcat:bdzjb5tm6jerve42uxynzjawru

Single-cell Transcriptome Study as Big Data

Pingjian Yu, Wei Lin
2016 Genomics, Proteomics & Bioinformatics  
After extensively reviewing the available big-data applications of next-generation sequencing (NGS)-based studies, we propose a workflow that accounts for the unique characteristics of scRNA-seq data and  ...  Big-data technology provides a framework that facilitates the comprehensive discovery of biological signals from inter-institutional scRNA-seq datasets.  ...  Carson Harrod for editing the manuscript.  ... 
doi:10.1016/j.gpb.2016.01.005 pmid:26876720 pmcid:PMC4792842 fatcat:g6zc5bsl4vhynhimaxpoi3tatq

Big Data: Ideology vs. Enlightenment

Hartmut Will Hartmut Will
2019 International Journal of Computer Auditing  
<p> &ldquo;Big Data&rdquo; is a technological term with a seemingly cognitive connotation that masks an ideological orientation of those attempting to be benevolently, criminally of even &ldquo;innocently  ...  An enlightened framework for data governance is overdue in the &ldquo;digital big data age!&rdquo;</p>  ...  To hope for improved objectivity, transparency and trust with (ill-defined) big data is also an illusion .  ... 
doi:10.53106/256299802019120101002 fatcat:mencxp3xircfdfdl4g63jijlqy

DaLiF: a data lifecycle framework for data-driven governments

Syed Iftikhar Hussain Shah, Vassilios Peristeras, Ioannis Magnisalis
2021 Journal of Big Data  
AbstractThe public sector, private firms, business community, and civil society are generating data that is high in volume, veracity, velocity and comes from a diversity of sources.  ...  From the above, the Government Big Data Ecosystem (GBDE) emerges. Managing big data throughout its lifecycle becomes a challenging task for governmental organizations.  ...  Acknowledgements The European Union-funded Project: Digital Europe for All (DE4A), Horizon 2020-the Framework Programme for Research and Innovation  ... 
doi:10.1186/s40537-021-00481-3 fatcat:c2c7kozsazf6jeptgcb43sctsi

Technical Research Priorities for Big Data [chapter]

Edward Curry, Sonja Zillner, Andreas Metzger, Arne J. Berre, Sören Auer, Ray Walshe, Marija Despenic, Milan Petkovic, Dumitru Roman, Walter Waterfeld, Robert Seidl, Souleiman Hasan (+2 others)
2021 The Elements of Big Data Value  
The process also highlighted the important role of data standardisation, data engineering and DevOps for Big Data.  ...  This chapter details a community-driven initiative to identify and characterise the key technical research priorities for research and development in data technologies.  ...  big data and high-performance computing architecture: Efficient hybrid architectures that optimise the mixture of big data (i.e.  ... 
doi:10.1007/978-3-030-68176-0_5 fatcat:c2ydwkla6ngrxfatohkuxqnfm4
« Previous Showing results 1 — 15 out of 1,755 results