47,303 Hits in 3.2 sec

Distributed gene clinical decision support system based on cloud computing

Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Chao Wang, Xuehai Zhou
2018 BMC Medical Genomics  
To boost the data processing of GCDSS, we propose CloudBWA, which is a novel distributed read mapping algorithm to leverage batch processing technique in mapping stage using Apache Spark platform.  ...  At the same time, we present CloudBWA which is a novel distributed read mapping algorithm leveraging batch processing strategy to map reads on Apache Spark.  ...  Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Published: 20 November 2018  ... 
doi:10.1186/s12920-018-0415-1 fatcat:mbayqg4wtjb65afd56afhcgmdi

A Brief Review on scheduling algorithms of MapReduce Optimization Techniques

R Lavanya, Jeevanshu Malhotra, Rajeshwari Swaminathan
2019 Journal of Physics, Conference Series  
Scheduling algorithms of MapReduce model using hadoop vary with design and behaviour, and are used for handling many issues like data locality, awareness with resource, energy and time.  ...  With the increase in size and complexity if modern datasets, the world is faced with new challenges in the automation and scalability of the very large data sets.  ...  Map Reduce framework has become a manageable and scalable technology for a fault free processing of big data.  ... 
doi:10.1088/1742-6596/1362/1/012001 fatcat:ikw7wxziybbm7e2eph2xkgtif4

Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes

H. Subramoni, S. Potluri, K. Kandalla, B. Barth, J. Vienne, J. Keasler, K. Tomko, K. Schulz, A. Moody, D. K. Panda
2012 2012 International Conference for High Performance Computing, Networking, Storage and Analysis  
Experimental results show that the new neighbor joining algorithm is able to significantly reduce the network topology discovery time.  ...  In this paper, we design a novel and scalable method to detect the InfiniBand network topology by using Neighbor-Joining techniques (NJ).  ...  We accelerated the initial distance matrix construction using OpenMP constructs, achieving 87% parallel efficiency on 6 cores on a Westmere node, further reducing discovery cost.  ... 
doi:10.1109/sc.2012.47 dblp:conf/sc/SubramoniPKBVKTSMP12 fatcat:dd5mkhn4drdyrjsht7ti4ka7ne

Map-Reduce Parallelization of Motif Discovery [article]

Umang Vipul
2014 arXiv   pre-print
To achieve this, we have decided to use sub-sampling and the Map Reduce model. At each Map node, a sub-sampled version of the input DNA sequences is used as input to HOMER.  ...  The output of the map phase and the input of the reduce phase is a list of Motifs discovered using the sub-sampled sequences.  ...  These results clash with our initial aim of running HOMER in parallel using map-reduce to obtain better performance and scalability.  ... 
arXiv:1405.0354v1 fatcat:i3xttwz5evcsncnrqa2fasdxzy

Towards a Scalable Architecture for Smart Villages: The Discovery Phase

Vijaya Kumar Murty, Sukarmina Singh Shankar
2020 Sustainability  
In this paper we outline the discoveryphase, which will lay the foundation for developing our framework of scalable smart villages.The Discovery Phase is a research process where the community learns about  ...  Alleviating poverty, reducing inequality, and achieving economic prosperity and well-beingis a global challenge.  ...  The process of discovery has started in India, and our experience has helped us to abstract the process to some extent.  ... 
doi:10.3390/su12187580 fatcat:2rdqz6obqbg2jpg3hxbpdjhlka


Hamid Mushtaq, Frank Liu, Carlos Costa, Gang Liu, Peter Hofstee, Zaid Al-Ars
2017 Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics - ACM-BCB '17  
This implementation is highly scalable and capable of parallelizing computation by utilizing data-level parallelism as well as load balancing techniques.  ...  In order to reduce the analysis cost, SparkGA can run on nodes with as little memory as 16GB.  ...  Within a map task, the unmodified DNA mapping tool is called using the Java's process package.  ... 
doi:10.1145/3107411.3107438 dblp:conf/bcb/MushtaqLCLHA17 fatcat:6pvahsud7jckfgz7ihbszhkaxi

Similarity Flooding for Efficient Distributed Discovery of OWL-S Process Model in P2P Networks

Adel Boukhadra, Karima Benatchba, Amar Balla
2015 Procedia Computer Science  
The approach exploits a scalable epidemic algorithm that uses different sources of network knowledge, such as exponential distribution, to fulfill the users requirements in order to ensure high recall,  ...  In order to improve the applicability of the scalable epidemic algorithm for discovering SWs, we propose the semantic matching of OWL-S process model which improves the recall while keeping an acceptable  ...  The approach improves the discovery and composition process by using distributed bidirectional search.  ... 
doi:10.1016/j.procs.2015.07.214 fatcat:5ldkg2ymsvcsvmtviuhpam44hm

Event Correlation Analytics: Scaling Process Mining Using Mapreduce-Aware Event Correlation Discovery Techniques

Hicham Reguieg, Boualem Benatallah, Hamid R. Motahari Nezhad, Farouk Toumani
2015 IEEE Transactions on Services Computing  
This paper introduces a scalable process event analysis approach, including parallel algorithms, to support efficient event correlation for big process data.  ...  It proposes a two-stages approach for finding potential event relationships, and their verification over big event datasets using MapReduce framework.  ...  The map outputs are then processed by reduce function.  ... 
doi:10.1109/tsc.2015.2476463 fatcat:z7k4bwcq3ncpzhnhfqdluurr5q

MRSim: A discrete event based MapReduce simulator

Suhel Hammoud, Maozhen Li, Yang Liu, Nasullah Khalid Alham, Zelong Liu
2010 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery  
The simulator on one hand allows us to measure scalability of MapReduce based applications easily and quickly, on the other hand captures the effects of different configurations of Hadoop setup on MapReduce  ...  Recently MapReduce programming model is becoming popular for large scale data intensive distributed applications due to its efficiency, simplicity and ease of use.  ...  The following processes take place; • A once a map task complete several reducers try to get the Map output using several shuffling processes per reducer. • Data are shuffled to reducer memory buffer the  ... 
doi:10.1109/fskd.2010.5569086 dblp:conf/fskd/HammoudLLAL10 fatcat:way7gsjpbfaqjnlmeyd2jf3mba

Bernard Wong, Emin Gün Sirer
2006 ACM SIGOPS Operating Systems Review is an accurate, scalable, and backwards-compatible service for mapping clients to a nearby server.  ...  A shared system for performing such a mapping amortizes the administration and implementation costs of proximity-based server selection.  ...  Selection schemes based on databases of IP address to geographical region [15] mappings can reduce the closest node discovery latency to that of a database lookup, but are fragile to IP assignment changes  ... 
doi:10.1145/1113361.1113373 fatcat:z4xk4oritzfsxdktknkqr7x5xy


Mahdi MollaMotalebi, Raheleh Maghami, Abdul Samad Ismail, Alireza Poshtkohi
2014 Cybernetics and systems  
Resource discovery is one of the most important services that significantly affects the efficiency of grid computing systems.  ...  The inherent dynamic and large-scale characteristics of grid environments make their resource discovery a challenging task.  ...  Using the hierarchical structure can reduce the bottleneck probability and subsequently increase the scalability.  ... 
doi:10.1080/01969722.2014.972100 fatcat:5x7z4f5qwve6fglakc5czkv5p4

Parallel Processing Framework on a P2P System Using Map and Reduce Primitives

Kyungyong Lee, Tae Woong Choi, Arijit Ganguly, David I. Wolinsky, P. Oscar Boykin, Renato Figueiredo
2011 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum  
A parallel processing task is expressed using Map and Reduce primitives inspired by functional programming models.  ...  The Map and Reduce tasks are distributed to a subset of nodes within a P2P network for execution by using a self-organizing multicast tree.  ...  In this paper we present a decentralized parallel processing framework which uses Map and Reduce primitives on a structured P2P network for applications such as network status monitoring, resource discovery  ... 
doi:10.1109/ipdps.2011.315 dblp:conf/ipps/LeeCGWBF11 fatcat:ebm34nhdnfdqbjkgz3rqvv2vgu

Using Mapreduce to Scale Events Correlation Discovery for Business Processes Mining [chapter]

Hicham Reguieg, Farouk Toumani, Hamid Reza Motahari-Nezhad, Boualem Benatallah
2012 Lecture Notes in Computer Science  
The map outputs are then processed by reduce function.  ...  Map and reduce functions can be implemented using any general-purpose programming language.  ...  We propose to partition the space of candidates in such a way that a given partition can be handled by a unique reducer.  ... 
doi:10.1007/978-3-642-32885-5_22 fatcat:w23ob4eduffd7fucuauaixwsme

Accelerating TauDEM as a Scalable Hydrological Terrain Analysis Service on XSEDE

Ye Fan, Yan Liu, Shaowen Wang, David Tarboton, Ahmet Yildirim, Nancy Wilkins-Diehr
2014 Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment - XSEDE '14  
the number of processes needed by an output file • No Collective IO • Parallel File System o Use as many OSTs on Lustre file system Scalability Results • Scalability Tests o Processors: up  ...  • Data Processing o Study area clipping o Multi-file generation o Reprojection o GDAL library ( o High-performance map reprojection • Collaborative work with USGS 25 Analysis  ... 
doi:10.1145/2616498.2616510 dblp:conf/xsede/FanLWTYW14 fatcat:s25x3unak5cqhlp7mzkaok5ygi

A Review on Latest Technologies in Big Data Analysis

Castro S, Pushpalakshmi R
2018 International Journal of Engineering & Technology  
Moreover to the map reduce process, it validates SQL queries, data streaming, processing of graph data and machine learning.  ...  To process huge datasets, the map reduce is employed which is a programming model depends on divide and conquer strategy.  ... 
doi:10.14419/ijet.v7i3.1.16806 fatcat:fvxtqfgysrhtbiakgnvowceyhm
« Previous Showing results 1 — 15 out of 47,303 results