A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
Responsive Parallelism with Futures and State
[article]
2020
arXiv
pre-print
Motivated by the increasing shift to multicore computers, recent work has developed language support for responsive parallel applications that mix compute-intensive tasks with latency-sensitive, usually ...
These developments include calculi that allow assigning priorities to threads, type systems that can rule out priority inversions, and accompanying cost models for predicting responsiveness. ...
CONCLUSION This paper bridges cooperative and competitive threading models by bringing together a classic threading construct, futures, with priorities and mutable state. ...
arXiv:2004.02870v1
fatcat:p62xx2g6rnfn5ptje32ympig7y
Preliminary Experiments with XKaapi on Intel Xeon Phi Coprocessor
2013
2013 25th International Symposium on Computer Architecture and High Performance Computing
Performance evaluation shows our XKaapi data-flow parallel programming environment exposes the lowest overhead of all and is highly competitive with native OpenMP and CilkPlus environments on Xeon Phi. ...
We compare Intel OpenMP, Intel CilkPlus and XKaapi together on the same benchmark suite and we provide comparisons between an Intel Xeon Phi coprocessor and a Sandy Bridge Xeon-based machine. ...
ACKNOWLEDGMENTS This work has been partially supported by the ANR-11-BS02-013 HPAC Project, the ANR 09-COSI-011-05 Project Repdyn, CAPES/Brazil, CNPq/Brazil, and FAPERGS/Brazil. ...
doi:10.1109/sbac-pad.2013.28
dblp:conf/sbac-pad/LimaBGR13
fatcat:i2ie22b7ubcahcj3p2g6dhuweq
Models and languages for parallel computation
1998
ACM Computing Surveys
Both kinds of models raise the possibility of parallelism as part of the mainstream of computing. ...
At the other end of the spectrum, low-level models make all of the messy issues of parallel programming explicit (how many threads, how to place them, how to express communication, and how to schedule ...
ACKNOWLEDGMENTS We are grateful to Dave Dove and Luigi Palopoli for their comments on a draft of this article and to the anonymous referees for their helpful comments. ...
doi:10.1145/280277.280278
fatcat:ynjiydoq2baonp7hubvz3ajuky
vCUDA: GPU accelerated high performance computing in virtual machines
2009
2009 IEEE International Symposium on Parallel & Distributed Processing
Our evaluation shows that GPU acceleration for HPC applications in VMs is feasible and competitive with those running in a native, nonvirtualized environment. ...
With API interception and redirection, applications in VMs can access graphics hardware device and achieve high performance computing in a transparent way. ...
Acknowledgments The authors would like to thank the anonymous reviewers for their useful suggestions and comments on this paper. ...
doi:10.1109/ipdps.2009.5161020
dblp:conf/ipps/ShiCS09
fatcat:4w3tatkvjnc3jk42f37fzqezo4
Running EveryWare on the computational grid
1999
Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '99
The goal of a Computational Grid is to aggregate ensembles of shared, heterogeneous, and distributed resources (potentially controlled by separate organizations) to provide computational "power" to an ...
The Computational Grid [10] has recently been proposed for the implementation of high-performance applications using widely dispersed computational resources. ...
In particular, we thank Fran Berman for her moral support during the effort, and Marcio Faerman, Walfredo Cirne, and Dimitri Zagorod for launching EveryWare on every conceivable public email and Java workstation ...
doi:10.1145/331532.331538
dblp:conf/sc/WolskiBKOSS99
fatcat:fhhmdmqtnnayneia3ptsztw5ie
vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines
2012
IEEE transactions on computers
Our evaluation shows that GPU acceleration for HPC applications in VMs is feasible and competitive with those running in a native, nonvirtualized environment. ...
With API interception and redirection, applications in VMs can access graphics hardware device and achieve high performance computing in a transparent way. ...
Acknowledgments The authors would like to thank the anonymous reviewers for their useful suggestions and comments on this paper. ...
doi:10.1109/tc.2011.112
fatcat:t2x5vrzwlffltho2n7rhf77kfq
Adaptive Scheduling Framework for Multi - Core Systems Based on the Task - Parallel Programming Model
2016
Journal of Engineering Science and Technology Review
However, the most updated multi-core parallel programming models have defects, such as poor scalability and intensive competition in processor core resources. ...
Experimental results indicated that the proposed adaptive scheduling method and a dynamic perception of core resources could effectively improve mutual competition between inter-core tasks and shared resources ...
The concepts of parallel programming and computation emerged early. However, even after several years, parallel computing has not become the mainstream of pervasive computing. ...
doi:10.25103/jestr.096.12
fatcat:h5vgipeojfdnnbxgkw6a6fdjba
Strategies for preparing computer science students for the multicore world
2010
Proceedings of the 2010 ITiCSE working group reports on Working group reports - ITiCSE-WGR '10
Multicore computers have become standard, and the number of cores per computer is rising rapidly. How does the new demand for understanding of parallel computing impact computer science education? ...
(iii) What resources will support computer science educators, including non-specialists, to teach parallel computing? (iv) What systemic obstacles impede this change, and how might they be overcome? ...
as task parallelism (where a computation is accomplished using cooperating functional units or tasks), data parallelism (where a given computation is applied in parallel to multiple data sets), pipeline ...
doi:10.1145/1971681.1971689
fatcat:6lpc6w7i55a65gyzv7nyoo4vc4
Fast VMM-based overlay networking for bridging the cloud and high performance computing
2013
Cluster Computing
It is now feasible to allow VMs hosting high performance computing (HPC) applications to seamlessly bridge distributed cloud resources and tightly-coupled supercomputing and cluster resources. ...
A collection of virtual machines (VMs) interconnected with an overlay network with a layer 2 abstraction has proven to be a powerful, unifying abstraction for adaptive distributed and parallel computing ...
-The number of packet dispatcher threads n dispatchers that are instantiated. -The yield model and parameters for the bridge thread, the packet dispatch threads, and the VMM's halt handler. ...
doi:10.1007/s10586-013-0274-7
fatcat:wstd6bama5d2fbttpojnpzw72q
Relative debugging for a highly parallel hybrid computer system
2015
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '15
In this paper we extend relative debugging to support porting stencil computation on a hybrid computer. ...
Relative debugging is particularly effective when code is migrated from one platform to another, and this is of significant interest for hybrid computer architectures containing CPUs accelerators or coprocessors ...
To bridge the variety of data partitioning schemes used in practice at different levels of parallelism, we propose a generic data model for general stencil computations on a hybrid computer. ...
doi:10.1145/2807591.2807605
dblp:conf/sc/RoseGVMADJ15
fatcat:pjq7qzkylzewzlphqoermywo64
Cluster Computing White Paper
[article]
2001
arXiv
pre-print
It is, however, evident that there is a growing interest in its usage in all areas where applications have traditionally used parallel or distributed computing platforms. ...
Cluster computing is not a new area of computing. ...
This is especially true with respect to its seamless integration with other facets of cluster computing such as high performance computing, scalability, and standardization of application programming interfaces ...
arXiv:cs/0004014v2
fatcat:cx3lbz6fcbcmrgicye7ure7u3u
Conceptual and Technical Challenges for High Performance Computing
[article]
2020
arXiv
pre-print
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific and real life problems. ...
The advent of multicore architectures is noticeable in the HPC history, because it has brought the underlying parallel programming concept into common considerations. ...
Thus, the answer for more efficient processing was either a more powerful single-thread processor or an aggregation of cooperative computer systems. ...
arXiv:2010.02769v1
fatcat:jdndyb3w6ncqjbriax25u6zofe
Key computational modeling issues in Integrated Computational Materials Engineering
2013
Computer-Aided Design
Another class of approaches advocates simulation-based design to exploit computational materials science and physics in accelerating the discovery of new materials, computing structure and properties using ...
One involves selection of materials, emphasizing population of databases and efficient search algorithms for properties or responses that best suit a set of specified performance indices [1], often using ...
Distinguished Chair in Metals Processing, as well as the NSF Industry/University Cooperative Research Center for Computational Materials Design (CCMD), supported by CCMD members, through grants NSF IIP ...
doi:10.1016/j.cad.2012.06.006
fatcat:ujbgljntf5emfatguyzcxoy5oa
Enhanced parallel Differential Evolution algorithm for problems in computational systems biology
2015
Applied Soft Computing
Many key problems in computational systems biology and bioinformatics can be formulated and solved using a global optimization framework. ...
Here we consider this latter class of problems and present several enhancements to DE based on the introduction of additional algorithmic steps and the exploitation of parallelism. ...
It has been also partially supported by the Spanish Ministerio de Ciencia e Innovación (Project TIN2013-42148-P) and by the Galician Government (consolidation program of competitive reference groups GRC2013 ...
doi:10.1016/j.asoc.2015.04.025
fatcat:2txrmdawsvfzrprmnmxa56lzpe
GPU acceleration for evolutionary topology optimization of continuum structures using isosurfaces
2017
Computers & structures
Evolutionary topology optimization of three-dimensional continuum structures is a computationally demanding task in terms of memory consumption and processing time. ...
This work aims to alleviate these constraints proposing a well-suited strategy for Graphics Processing Unit (GPU) computing. ...
Such a work has also been supported by the research support programmes of Ministry of Economy and Competitiveness under the contract DPI2016-77538-R and "Fundación Séneca -Agencia de Ciencia y Tecnología ...
doi:10.1016/j.compstruc.2016.10.018
fatcat:7hxl2vtzavdvvemndy5qrpb7ny
« Previous
Showing results 1 — 15 out of 8,346 results