310 Hits in 5.3 sec

Code Generation for High-Level Synthesis of Multiresolution Applications on FPGAs [article]

Moritz Schmid, Oliver Reiche, Christian Schmitt, Frank Hannig, Jürgen Teich
2014 arXiv   pre-print
Recently, the HIPAcc framework was proposed as a means for automatic code generation of image processing algorithms, based on a Domain-Specific Language (DSL).  ...  Multiresolution Analysis (MRA) is a mathematical method that is based on working on a problem at different scales.  ...  ACKNOWLEDGMENT We thank Richard Membarth for providing the tested multigrid implementation in HIPA cc .  ... 
arXiv:1408.4721v1 fatcat:qoj4yo6gejac7gj7jjhc6qex6u

POEMS: end-to-end performance design of large parallel adaptive computational systems

M.K. Vernon, P.J. Teller, D.J. Sundaram-Stukel, R. Sakellariou, J.R. Rice, E.N. Houstis, A. Dube, E. Deelman, J.C. Browne, R. Bagrodia, V.S. Adve
2000 IEEE Transactions on Software Engineering  
Sophisticated parallelizing compiler techniques allow this representation to be generated automatically for a given parallel program.  ...  AbstractÐThe POEMS project is creating an environment for end-to-end performance modeling of complex parallel and distributed systems, spanning the domains of application software, runtime and operating  ...  ACKNOWLEDGMENTS A number of people from the member institutions represented by the POEMS team contributed to the work.  ... 
doi:10.1109/32.881716 fatcat:w47k2yff2jde3jrnvcf3rrvidu

Metascalable molecular dynamics simulation of nano-mechano-chemistry

F Shimojo, R K Kalia, A Nakano, K Nomura, P Vashishta
2008 Journal of Physics: Condensed Matter  
We have also used the framework for automated execution of adaptive hybrid DFT/MD simulation on a grid of six supercomputers in the US and Japan, in which the number of processors changed dynamically on  ...  We have developed a metascalable (or 'design once, scale on new architectures') parallel application-development framework for first-principles based simulations of nano-mechano-chemical processes on emerging  ...  FS acknowledges the support of a Grant-in-Aid for Scientific Research on Priority Area 'Nanoionics (439)' from MEXT, Japan.  ... 
doi:10.1088/0953-8984/20/29/294204 fatcat:u7vvl52cgvdoxckgzehtlrgdam


Seapahn Meguerdichian, Farinaz Koushanfar, Advait Morge, Dusan Petranovic, Miodrag Potkonjak
2001 Proceedings of the 38th conference on Design automation - DAC '01  
In order to further enhance performance, efficiency, and flexibility of IP design, we have developed a new approach for designing hardware and software IP called MetaCores.  ...  From the algorithmic viewpoint, the main contribution is the introduction of multiresolution search in algorithm optimization and synthesis process.  ...  Trimaran provides a compiler and hardware platform for parallel programmable VLIW and Superscalar architectures.  ... 
doi:10.1145/378239.379028 dblp:conf/dac/MeguerdichianKMPP01 fatcat:4tqiuk3zwndyxnqjtbwso6xzwq

Quantum Molecular Dynamics in the Post-Petaflops Era

Nichols A. Romero, Aiichiro Nakano, Katherine M. Riley, Fuyuki Shimojo, Rajiv K. Kalia, Priya Vashishta, Paul C. Messina
2015 Computer  
Q uantum molecular dynamics (QMD) simulations are a critical tool in various scienti c domains including condensed-matter physics, materials science, chemistry, and biology.  ...  As the scale of QMD simulations has grown beyond the canonical metrics of time and system size, there has been a paradigm shift to more "naturally" parallel methods such as divide-and-conquer, equilibrium  ...  For example, MADNESS (Multiresolution Adaptive Numerical Evaluation for Scientific Simulation) aims to facilitate application development by providing a higher-level abstraction for node parallelism.  ... 
doi:10.1109/mc.2015.337 fatcat:rxm2uiybgjdhpfppnkvdwk5fbu

Using High-Level RTOS Models for HW/SW Embedded Architecture Exploration: Case Study on Mobile Robotic Vision

François Verdier, Benoît Miramond, Mickaël Maillard, Emmanuel Huck, Thomas Lefebvre
2008 EURASIP Journal on Embedded Systems  
We also describe our executable RTOS model written in SystemC allowing an early simulation of our application on top of its specific scheduling layer.  ...  Based on this model, a methodology is discussed and results are given on the exploration and validation of a distributed platform adapted to this vision system.  ...  In our case, the use of an RTOS becomes essential in a domain where applications exhibit dynamic and adaptive behaviours.  ... 
doi:10.1155/2008/349465 fatcat:iioaiect6rciph3blai2jka3ni

PPM – A highly efficient parallel particle–mesh library for the simulation of continuum systems

I.F. Sbalzarini, J.H. Walther, M. Bergdorf, S.E. Hieber, E.M. Kotsalis, P. Koumoutsakos
2006 Journal of Computational Physics  
The coexistence of meshes and particles, allows for the development of a consistent and adaptive numerical method, but it presents a set of challenging parallelization issues that have hindered in the  ...  The present library solves the key parallelization issues involving particle-mesh interpolations and the balancing of processor particle loading, using a novel adaptive tree for mixed domain decompositions  ...  For purely particle-based simulations a number of application-specific parallel software libraries is also available, such as PARTI for Monte-Carlo simulations [12] , or the parallel utilities library  ... 
doi:10.1016/ fatcat:jkvh4xqjpbgvlike56zcvsp4sq

Ultrascale Visualization of Climate Data

Dean N. Williams, Timo Bremer, Charles Doutriaux, John Patchett, Sean Williams, Galen Shipman, Ross Miller, David R. Pugmire, Brian Smith, Chad Steed, E. Wes Bethel, Hank Childs (+15 others)
2013 Computer  
F ueled by exponential increases in the computational and storage capabilities of high-performance computing platforms, climate simulations are evolving toward higher numerical fidelity, complexity, volume  ...  Working in close collaboration with domain experts, the Ultrascale Visualization Climate Data Analysis Tools (UV-CDAT) project aims to provide high-level solutions to a variety of climate data analysis  ...  Complexity stems from the need to incorporate a broad nexus of climate and other related science domains such as climate adaptation and mitigation for water, energy, and agriculture conservation. • Flexible  ... 
doi:10.1109/mc.2013.119 fatcat:7kd3wgswf5bdfljj5gnifzl45u

A metascalable computing framework for large spatiotemporal-scale atomistic simulations

Ken-ichi Nomura, Richard Seymour, Weiqiang Wang, Hikmet Dursun, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, Fuyuki Shimojo, Lin H. Yang
2009 2009 IEEE International Symposium on Parallel & Distributed Processing  
A metascalable (or "design once, scale on new architectures") parallel computing framework has been developed for large spatiotemporal-scale atomistic simulations of materials based on spatiotemporal data  ...  parallel (STEP) approach based on temporal locality to predict longtime dynamics, while introducing multiple parallelization axes; and (3) a tunable hierarchical cellular decomposition (HCD) parallelization  ...  We thank the staff of the Argonne Leadership Computing Facility for their help on the BlueGene/P benchmark.  ... 
doi:10.1109/ipdps.2009.5160992 dblp:conf/ipps/NomuraSWDKNVSY09 fatcat:rbb2ngnybvak5peg3qu23xfxni

The Richness of Modeling and Simulation and an Index of Its Body of Knowledge [chapter]

Tuncer Ören
2014 Advances in Intelligent Systems and Computing  
The increasing importance of modeling and simulation (M&S) is emphasized. Richness and stakeholders of M&S are documented. Three aspects of professionalism of M&S are clarified.  ...  The conclusions section terminates with the fact that wide-spread application and ever increasing importance of modelling and simulation necessitate the preservation of the integrityof the M&S discipline  ...  It is the ac-cepted ontology for a specific domain" (Wiki-BoK). It is worthwhile underlining the fact that a BoK of a specific domain or discipline is its ontology.  ... 
doi:10.1007/978-3-319-03581-9_1 fatcat:googllzqjvfylbaod5rlhmguzi

Far voxels

Enrico Gobbetti, Fabio Marton
2005 ACM Transactions on Graphics  
hardware occlusion queries for culling occluded subtrees, and asynchronous I/O for detecting and avoiding data access latencies.  ...  Since the granularity of the multiresolution structure is coarse, data management, traversal and occlusion culling cost is amortized over many graphics primitives.  ...  Introduction Many important application domains, including 3D scanning, computer aided design, and numerical simulation, require the interactive inspection of huge geometric models.  ... 
doi:10.1145/1073204.1073277 fatcat:7mahgiqe5zbyhafyhykwar55rm

Far voxels

Enrico Gobbetti, Fabio Marton
2005 ACM SIGGRAPH 2005 Papers on - SIGGRAPH '05  
hardware occlusion queries for culling occluded subtrees, and asynchronous I/O for detecting and avoiding data access latencies.  ...  Since the granularity of the multiresolution structure is coarse, data management, traversal and occlusion culling cost is amortized over many graphics primitives.  ...  Introduction Many important application domains, including 3D scanning, computer aided design, and numerical simulation, require the interactive inspection of huge geometric models.  ... 
doi:10.1145/1186822.1073277 fatcat:fbzic5ydfjgutlu7csvkcs7xxa

Trends in Data Locality Abstractions for HPC Systems

Didem Unat, Anshu Dubey, Torsten Hoefler, John Shalf, Mark Abraham, Mauro Bianco, Bradford L. Chamberlain, Romain Cledat, H. Carter Edwards, Hal Finkel, Karl Fuerlinger, Frank Hannig (+9 others)
2017 IEEE Transactions on Parallel and Distributed Systems  
However, with the increasing complexity of the memory hierarchy and higher parallelism in emerging HPC systems, locality management has acquired a new urgency.  ...  Support for expression of data locality has been explored in the past, but those efforts have had only modest success in being adopted in HPC applications for various reasons.  ...  An embedded domain-specific language (DSL) is a technique for delivering a languagebased solution within a host general-purpose language.  ... 
doi:10.1109/tpds.2017.2703149 fatcat:vjalwrujhrex7cibod3qerf3z4

Achieving Portable Performance For Wavelet Compression Using Data Parallel Primitives [article]

Shaomeng Li, Nicole Marsaglia, Vincent Chen, Christopher Sewell, John Clyne, Hank Childs
2017 Eurographics Symposium on Parallel Graphics and Visualization  
We contribute a new implementation of the wavelet transform algorithm that uses data parallel primitives from the VTK-m library.  ...  We also study the efficacy of this implementation over multiple architectures against hardware-specific comparators.  ...  National Center for Atmospheric Research (NCAR) also wishes to thank our primary sponsor, the National Science Foundation.  ... 
doi:10.2312/pgv.20171095 dblp:conf/egpgv/LiMCSCC17 fatcat:uokc2nxn5rhdhhsemrz42vevhu

Distributed Information and Computation in Scientific and Engineering Environments

Nicholas M. Patrikalakis, Paul J. Fortier, Yannis E. Ioannidis, Christos N. Nikolaou, Allan R. Robinson, Jarek R. Rossignac, Alvar Vinacua, Stephen L. Abrams
1999 D-Lib Magazine  
The NSF Invitational Workshop on Distributed Information, Computation, and Process Management for Scientific and Engineering Environments (DICPM) brought together domain specialists from engineering and  ...  The workshop brought together domain specialists from engineering and the ocean, atmospheric, and space sciences involved in the development and use of simulations of complex systems, and computer scientists  ...  ., numeric values, dates, and format or protocol specifications).  ... 
doi:10.1045/april99-abrams fatcat:5krbkkwwubdsznmkjcr4ppxe7m
« Previous Showing results 1 — 15 out of 310 results