Filters








2,755 Hits in 6.6 sec

Early Science on Theta

Timothy J. Williams
2018 Computing in science & engineering (Print)  
The author gratefully acknowledges Laura Wolf of Argonne National Laboratory for her assistance in the production of this article. ABOUT THE AUTHOR Timothy J.  ...  His research interests include plasma physics, particle-in-cell simulation of tokamak plasmas in particular, and wide-ranging applications of large-scale supercomputing in science and applied math.  ...  Aurora, expected in 2021, will be a capable exascale platform equally suited for largescale simulation, deep learning, and data analysis applications.  ... 
doi:10.1109/mcse.2018.03202630 fatcat:e34ytdoxtfe6vc3vlbh2xrqwou

Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community

Jeffrey S. Vetter, Richard Glassbrook, Jack Dongarra, Karsten Schwan, Bruce Loftis, Stephen McNally, Jeremy Meredith, James Rogers, Philip Roth, Kyle Spafford, Sudhakar Yalamanchili
2011 Computing in science & engineering (Print)  
Acknowledgments Keeneland is funded by the US National Science Foundation's Office of Cyberinfrastructure under award 0910735.  ...  research interests include performance analysis, prediction, and tools, with special emphases on scalability, automation, and emerging architectures. roth has a PhD in computer science from the university  ...  Meredith is a computer scientist in the Future technologies Group at oak ridge National laboratory. his research interests include emerging computing architectures and large-scale visualization and analysis  ... 
doi:10.1109/mcse.2011.83 fatcat:dhp6upupevffzfpuj77u7i3czm

Software Design for Petascale Climate Science [chapter]

Philip Jones, Mariana Vertenstein, Patrick Worley, John Drake, James White Iii
2007 Chapman & Hall/CRC Computational Science  
These strategies for performance portability of CCSM components are well suited for the emerging petascale architectures.  ...  Performance scalability in CAM is not yet adequate for petascale computation.  ... 
doi:10.1201/9781584889106.ch7 fatcat:65n245k3abgfxgqog3od67h4am

Building a software infrastructure for computational science applications

Osni Marques, Tony Drummond
2005 Proceedings of the second international workshop on Software engineering for high performance computing system applications - SE-HPCS '05  
ACTS is a set of DOE-developed software tools, sometimes in collaboration with other funding agencies, that make it easier to write high performance codes for computational science applications.  ...  The development of high performance engineering and scientific applications is an expensive process that often requires specialized support and adequate information about the available computational resources  ...  portability, scalability and robustness.  ... 
doi:10.1145/1145319.1145332 fatcat:maqmkmr42rehdoz2p7qsmwh3d4

How ECP Software Technologies and Math Libraries are Working Toward Performance Portability at Exascale [article]

Lois Curfman McInnes
2021 figshare.com  
algorithms and data structures for efficient and scalable performance.  ...  scalable mathematics, visualization, and analytics, as well as tools for performance analysis and tuning.  ...  mechanism for ECP math libraries' continual advancements toward predictive science ECP Math libraries Performance on new node architectures Extreme strong scalability Advanced, coupled  ... 
doi:10.6084/m9.figshare.14156903.v2 fatcat:6l4gtrlvnjdoxesj5wedcephlu

How ECP Software Technologies and Math Libraries are Working Toward Performance Portability at Exascale [article]

Lois Curfman McInnes
2021 figshare.com  
algorithms and data structures for efficient and scalable performance.  ...  scalable mathematics, visualization, and analytics, as well as tools for performance analysis and tuning.  ...  mechanism for ECP math libraries' continual advancements toward predictive science ECP Math libraries Performance on new node architectures Extreme strong scalability Advanced, coupled  ... 
doi:10.6084/m9.figshare.14156903.v1 fatcat:t34u2qaltng65dr6eb5trdbavm

IMPROVING PERFORMANCE IN HPC SYSTEM UNDER POWER CONSUMPTIONS LIMITATIONS

Muhammad Usman Ashraf
2019 International Journal of Advanced Research in Computer Science  
towards everyday life.  ...  Leading to objectives, the current study presents a comprehensive analysis of existing strategies that can be considered to enhance performance and reducing power for emerging Exascale computing system  ...  In Table 1 , we have done the critical analysis on all the above-discussed approaches to deciding the promising approach for future Exascale systems.  ... 
doi:10.26483/ijarcs.v10i2.6397 fatcat:k3l3lk5kuzhnldn5b2qzkh4eia

Energy-Efficient Computing for Extreme-Scale Science

David Donofrio, Leonid Oliker, John Shalf, Michael F. Wehner, Chris Rowen, Jens Krueger, Shoaib Kamil, Marghoob Mohiyuddin
2009 Computer  
The challenge of moving high-performance computing architecture toward exaflops has staggering economic and political ramifications.  ...  To that end, we have developed Green Flash, an application-driven design that combines a many-core processor with novel alternatives to cache coherence and autotuning to improve the kernels' computational  ...  We thank Dave Randall's modeling group in the Department of Atmospheric Science at Colorado State University for early access to their icosahedral model.  ... 
doi:10.1109/mc.2009.353 fatcat:qerwoknemnaivcn2j55l2oc7iu

Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures [chapter]

Damián A. Mallón, Guillermo L. Taboada, Carlos Teijeiro, Juan Touriño, Basilio B. Fraguela, Andrés Gómez, Ramón Doallo, J. Carlos Mouriño
2009 Lecture Notes in Computer Science  
of data locality, the key factor for performance in these systems.  ...  Regarding UPC, although it exploits efficiently the data layout in memory, it suffers from remote shared memory accesses, whereas OpenMP usually lacks efficient data locality support and is restricted  ...  We gratefully thank Jim Bovay and Brian Wibecan at HP for their valuable support, and CESGA for providing access to the Finis Terrae supercomputer.  ... 
doi:10.1007/978-3-642-03770-2_24 fatcat:c4crntkoqrelhp4yxov7aee2ii

Research and Education in Computational Science and Engineering [article]

Ulrich Rüde, Karen Willcox, Lois Curfman McInnes, Hans De Sterck, George Biros, Hans Bungartz, James Corones, Evin Cramer, James Crowley, Omar Ghattas, Max Gunzburger, Michael Hanke, Robert Harrison (+20 others)
2018 arXiv   pre-print
However, a combination of disruptive developments---including the architectural complexity of extreme-scale computing, the data revolution that engulfs the planet, and the specialization required to follow  ...  Over the past two decades the field of computational science and engineering (CSE) has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize  ...  Acknowledgments 35 Acknowledgments This report is an outcome of a workshop in August 2014 on Future Directions in CSE Education and Research, sponsored by the Society for Industrial and Applied Mathematics  ... 
arXiv:1610.02608v4 fatcat:7qiqiajd7vcstditmztqxwwtx4

Research and Education in Computational Science and Engineering

Ulrich Rüde, Karen Willcox, Lois Curfman McInnes, Hans De Sterck
2018 SIAM Review  
and data science.  ...  However, a combination of disruptive developments-including the architectural complexity of extreme-scale computing, the data revolution and increased attention to data-driven discovery, and the specialization  ...  This report is an outcome of a workshop in August 2014 on Future Directions in CSE Education and Research, sponsored by the Society for Industrial and Applied Mathematics (http://www.siam.org) and the  ... 
doi:10.1137/16m1096840 pmid:30287973 pmcid:PMC6168210 fatcat:uogrxxc4bvdxlnmeo7racudchi

The Omni Macroprogramming Environment for Sensor Networks [chapter]

Asad Awan, Ahmed Sameh, Ananth Grama
2006 Lecture Notes in Computer Science  
The Omni architecture is designed to be a flexible, extensible, scalable, and portable system, upon which a wide variety of DDDAS applications can be built.  ...  In this paper, we provide a high-level overview of the Omni architecture, its salient features, and implementation details.  ...  The Omni architecture provides a flexible, portable, and scalable platform over which a number of DDDAS applications can be built.  ... 
doi:10.1007/11758532_62 fatcat:qcbqkhwg2ngolmmtesngchcuii

How the common component architecture advances computational science

G Kumfert, D E Bernholdt, T G W Epperly, J A Kohl, L C McInnes, S Parker, J Ray
2006 Journal of Physics, Conference Series  
Computational chemists are using Common Component Architecture (CCA) technology to increase the parallel scalability of their application ten-fold.  ...  Combustion researchers are publishing science faster because the CCA manages software complexity for them.  ...  Research Research at the University of Utah is also sponsored by the National Science Foundation under contract ACI0113829, and the DOE ASC Program.  ... 
doi:10.1088/1742-6596/46/1/066 fatcat:wbmt5n32jvgsxkbp7pmgtqumhu

Aeras: A Next Generation Global Atmosphere Model

William F. Spotz, Thomas M. Smith, Irina P. Demeshko, Jeffrey A. Fike
2015 Procedia Computer Science  
We present early UQ and performance portability results for the shallow water equations.  ...  Embedded uncertainty quantification (UQ) is an original design capability of Albany, and performance portability is a recent upgrade.  ...  Gather routines collect all data necessary to compute element integration kernels from a global vector and provide it in an element centric pattern.  ... 
doi:10.1016/j.procs.2015.05.478 fatcat:sznqy2jkarh5vhnxvsh5zuzh7a

Scaling FMM with Data-Driven OpenMP Tasks on Multicore Architectures [chapter]

Abdelhalim Amer, Satoshi Matsuoka, Miquel Pericàs, Naoya Maruyama, Kenjiro Taura, Rio Yokota, Pavan Balaji
2016 Lecture Notes in Computer Science  
Poor scalability on parallel architectures can be attributed to several factors, among which idle times, data movement, and runtime overhead are predominant.  ...  Work units operate on tiled computational patterns and serve as building blocks in an OpenMP task-based data-driven execution.  ...  Department of Energy, Office of Science, under Contract DE-AC02-06CH11357, and by JST, CREST (Research Areas: Advanced Core Technologies for Big Data Integration; Development of System Software Technologies  ... 
doi:10.1007/978-3-319-45550-1_12 fatcat:r6ehuceyc5b4vo47oz254xm2su
« Previous Showing results 1 — 15 out of 2,755 results