Filters








8,346 Hits in 6.6 sec

Responsive Parallelism with Futures and State [article]

Stefan K. Muller and Kyle Singer and Noah Goldstein and Umut A. Acar and Kunal Agrawal and I-Ting Angelina Lee
2020 arXiv   pre-print
Motivated by the increasing shift to multicore computers, recent work has developed language support for responsive parallel applications that mix compute-intensive tasks with latency-sensitive, usually  ...  These developments include calculi that allow assigning priorities to threads, type systems that can rule out priority inversions, and accompanying cost models for predicting responsiveness.  ...  CONCLUSION This paper bridges cooperative and competitive threading models by bringing together a classic threading construct, futures, with priorities and mutable state.  ... 
arXiv:2004.02870v1 fatcat:p62xx2g6rnfn5ptje32ympig7y

Preliminary Experiments with XKaapi on Intel Xeon Phi Coprocessor

Joao V.F. Lima, Francois Broquedis, Thierry Gautier, Bruno Raffin
2013 2013 25th International Symposium on Computer Architecture and High Performance Computing  
Performance evaluation shows our XKaapi data-flow parallel programming environment exposes the lowest overhead of all and is highly competitive with native OpenMP and CilkPlus environments on Xeon Phi.  ...  We compare Intel OpenMP, Intel CilkPlus and XKaapi together on the same benchmark suite and we provide comparisons between an Intel Xeon Phi coprocessor and a Sandy Bridge Xeon-based machine.  ...  ACKNOWLEDGMENTS This work has been partially supported by the ANR-11-BS02-013 HPAC Project, the ANR 09-COSI-011-05 Project Repdyn, CAPES/Brazil, CNPq/Brazil, and FAPERGS/Brazil.  ... 
doi:10.1109/sbac-pad.2013.28 dblp:conf/sbac-pad/LimaBGR13 fatcat:i2ie22b7ubcahcj3p2g6dhuweq

Models and languages for parallel computation

David B. Skillicorn, Domenico Talia
1998 ACM Computing Surveys  
Both kinds of models raise the possibility of parallelism as part of the mainstream of computing.  ...  At the other end of the spectrum, low-level models make all of the messy issues of parallel programming explicit (how many threads, how to place them, how to express communication, and how to schedule  ...  ACKNOWLEDGMENTS We are grateful to Dave Dove and Luigi Palopoli for their comments on a draft of this article and to the anonymous referees for their helpful comments.  ... 
doi:10.1145/280277.280278 fatcat:ynjiydoq2baonp7hubvz3ajuky

vCUDA: GPU accelerated high performance computing in virtual machines

Lin Shi, Hao Chen, Jianhua Sun
2009 2009 IEEE International Symposium on Parallel & Distributed Processing  
Our evaluation shows that GPU acceleration for HPC applications in VMs is feasible and competitive with those running in a native, nonvirtualized environment.  ...  With API interception and redirection, applications in VMs can access graphics hardware device and achieve high performance computing in a transparent way.  ...  Acknowledgments The authors would like to thank the anonymous reviewers for their useful suggestions and comments on this paper.  ... 
doi:10.1109/ipdps.2009.5161020 dblp:conf/ipps/ShiCS09 fatcat:4w3tatkvjnc3jk42f37fzqezo4

Running EveryWare on the computational grid

Rich Wolski, John Brevik, Chandra Krintz, Graziano Obertelli, Neil Spring, Alan Su
1999 Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '99  
The goal of a Computational Grid is to aggregate ensembles of shared, heterogeneous, and distributed resources (potentially controlled by separate organizations) to provide computational "power" to an  ...  The Computational Grid [10] has recently been proposed for the implementation of high-performance applications using widely dispersed computational resources.  ...  In particular, we thank Fran Berman for her moral support during the effort, and Marcio Faerman, Walfredo Cirne, and Dimitri Zagorod for launching EveryWare on every conceivable public email and Java workstation  ... 
doi:10.1145/331532.331538 dblp:conf/sc/WolskiBKOSS99 fatcat:fhhmdmqtnnayneia3ptsztw5ie

vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines

Lin Shi, Hao Chen, Jianhua Sun, Kenli Li
2012 IEEE transactions on computers  
Our evaluation shows that GPU acceleration for HPC applications in VMs is feasible and competitive with those running in a native, nonvirtualized environment.  ...  With API interception and redirection, applications in VMs can access graphics hardware device and achieve high performance computing in a transparent way.  ...  Acknowledgments The authors would like to thank the anonymous reviewers for their useful suggestions and comments on this paper.  ... 
doi:10.1109/tc.2011.112 fatcat:t2x5vrzwlffltho2n7rhf77kfq

Adaptive Scheduling Framework for Multi - Core Systems Based on the Task - Parallel Programming Model

H. M. LU, School of Computer Science and Engineering, Changchun University of Technology, Changchun 1 30012 , China, Y. J. CAO, J. J. SONG, T. Y. DI, H. Y. SUN, X. M. HAN, School of Software , Zhengzhou University, Zhengzhou 450 000, China, School of Computer Science and Engineering, Changchun University of Technology, Changchun 1 30012 , China, School of Computer Science and Engineering, Changchun University of Technology, Changchun 1 30012 , China, School of Computer Engineering, Nanyang Technological Univers ity, Singapore 639798 , Singapore, School of Computer Science and Engineering, Changchun University of Technology, Changchun 1 30012 , China
2016 Journal of Engineering Science and Technology Review  
However, the most updated multi-core parallel programming models have defects, such as poor scalability and intensive competition in processor core resources.  ...  Experimental results indicated that the proposed adaptive scheduling method and a dynamic perception of core resources could effectively improve mutual competition between inter-core tasks and shared resources  ...  The concepts of parallel programming and computation emerged early. However, even after several years, parallel computing has not become the mainstream of pervasive computing.  ... 
doi:10.25103/jestr.096.12 fatcat:h5vgipeojfdnnbxgkw6a6fdjba

Strategies for preparing computer science students for the multicore world

Richard Brown, Elizabeth Shoop, Joel Adams, Curtis Clifton, Mark Gardner, Michael Haupt, Peter Hinsbeeck
2010 Proceedings of the 2010 ITiCSE working group reports on Working group reports - ITiCSE-WGR '10  
Multicore computers have become standard, and the number of cores per computer is rising rapidly. How does the new demand for understanding of parallel computing impact computer science education?  ...  (iii) What resources will support computer science educators, including non-specialists, to teach parallel computing? (iv) What systemic obstacles impede this change, and how might they be overcome?  ...  as task parallelism (where a computation is accomplished using cooperating functional units or tasks), data parallelism (where a given computation is applied in parallel to multiple data sets), pipeline  ... 
doi:10.1145/1971681.1971689 fatcat:6lpc6w7i55a65gyzv7nyoo4vc4

Fast VMM-based overlay networking for bridging the cloud and high performance computing

Lei Xia, Zheng Cui, John Lange, Yuan Tang, Peter Dinda, Patrick Bridges
2013 Cluster Computing  
It is now feasible to allow VMs hosting high performance computing (HPC) applications to seamlessly bridge distributed cloud resources and tightly-coupled supercomputing and cluster resources.  ...  A collection of virtual machines (VMs) interconnected with an overlay network with a layer 2 abstraction has proven to be a powerful, unifying abstraction for adaptive distributed and parallel computing  ...  -The number of packet dispatcher threads n dispatchers that are instantiated. -The yield model and parameters for the bridge thread, the packet dispatch threads, and the VMM's halt handler.  ... 
doi:10.1007/s10586-013-0274-7 fatcat:wstd6bama5d2fbttpojnpzw72q

Relative debugging for a highly parallel hybrid computer system

Luiz DeRose, Andrew Gontarek, Aaron Vose, Robert Moench, David Abramson, Minh Ngoc Dinh, Chao Jin
2015 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '15  
In this paper we extend relative debugging to support porting stencil computation on a hybrid computer.  ...  Relative debugging is particularly effective when code is migrated from one platform to another, and this is of significant interest for hybrid computer architectures containing CPUs accelerators or coprocessors  ...  To bridge the variety of data partitioning schemes used in practice at different levels of parallelism, we propose a generic data model for general stencil computations on a hybrid computer.  ... 
doi:10.1145/2807591.2807605 dblp:conf/sc/RoseGVMADJ15 fatcat:pjq7qzkylzewzlphqoermywo64

Cluster Computing White Paper [article]

Mark Baker, et. al
2001 arXiv   pre-print
It is, however, evident that there is a growing interest in its usage in all areas where applications have traditionally used parallel or distributed computing platforms.  ...  Cluster computing is not a new area of computing.  ...  This is especially true with respect to its seamless integration with other facets of cluster computing such as high performance computing, scalability, and standardization of application programming interfaces  ... 
arXiv:cs/0004014v2 fatcat:cx3lbz6fcbcmrgicye7ure7u3u

Conceptual and Technical Challenges for High Performance Computing [article]

Claude Tadonki
2020 arXiv   pre-print
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific and real life problems.  ...  The advent of multicore architectures is noticeable in the HPC history, because it has brought the underlying parallel programming concept into common considerations.  ...  Thus, the answer for more efficient processing was either a more powerful single-thread processor or an aggregation of cooperative computer systems.  ... 
arXiv:2010.02769v1 fatcat:jdndyb3w6ncqjbriax25u6zofe

Key computational modeling issues in Integrated Computational Materials Engineering

Jitesh H. Panchal, Surya R. Kalidindi, David L. McDowell
2013 Computer-Aided Design  
Another class of approaches advocates simulation-based design to exploit computational materials science and physics in accelerating the discovery of new materials, computing structure and properties using  ...  One involves selection of materials, emphasizing population of databases and efficient search algorithms for properties or responses that best suit a set of specified performance indices [1], often using  ...  Distinguished Chair in Metals Processing, as well as the NSF Industry/University Cooperative Research Center for Computational Materials Design (CCMD), supported by CCMD members, through grants NSF IIP  ... 
doi:10.1016/j.cad.2012.06.006 fatcat:ujbgljntf5emfatguyzcxoy5oa

Enhanced parallel Differential Evolution algorithm for problems in computational systems biology

D.R. Penas, J.R. Banga, P. González, R. Doallo
2015 Applied Soft Computing  
Many key problems in computational systems biology and bioinformatics can be formulated and solved using a global optimization framework.  ...  Here we consider this latter class of problems and present several enhancements to DE based on the introduction of additional algorithmic steps and the exploitation of parallelism.  ...  It has been also partially supported by the Spanish Ministerio de Ciencia e Innovación (Project TIN2013-42148-P) and by the Galician Government (consolidation program of competitive reference groups GRC2013  ... 
doi:10.1016/j.asoc.2015.04.025 fatcat:2txrmdawsvfzrprmnmxa56lzpe

GPU acceleration for evolutionary topology optimization of continuum structures using isosurfaces

Jesús Martínez-Frutos, David Herrero-Pérez
2017 Computers & structures  
Evolutionary topology optimization of three-dimensional continuum structures is a computationally demanding task in terms of memory consumption and processing time.  ...  This work aims to alleviate these constraints proposing a well-suited strategy for Graphics Processing Unit (GPU) computing.  ...  Such a work has also been supported by the research support programmes of Ministry of Economy and Competitiveness under the contract DPI2016-77538-R and "Fundación Séneca -Agencia de Ciencia y Tecnología  ... 
doi:10.1016/j.compstruc.2016.10.018 fatcat:7hxl2vtzavdvvemndy5qrpb7ny
« Previous Showing results 1 — 15 out of 8,346 results