Filters








231,897 Hits in 3.5 sec

Towards a heterogeneous simulation kernel for system-level models: a SystemC kernel for synchronous data flow models

H.D. Patel, S.K. Shukla
2005 IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems  
As SystemC gains popularity as a modeling language of choice for system-on-chip (SOC) designs, heterogeneous modeling in SystemC and efficient simulation become increasingly important.  ...  In an SOC model, different components may need to be naturally expressible in different Models Of Computations.  ...  This is because heterogeneous Models Of Computation [9] are better suited for different parts of a system design.  ... 
doi:10.1109/tcad.2005.850819 fatcat:egq3bkmijrc7zpsmzxy4jjbeuq

Towards a heterogeneous simulation kernel for system level models

Hiren D. Patel, Sandeep K. Shukla
2004 Proceedins of the 14th ACM Great Lakes symposium on VLSI - GLSVLSI '04  
As SystemC gains popularity as a modeling language of choice for system-on-chip (SOC) designs, heterogeneous modeling in SystemC and efficient simulation become increasingly important.  ...  In an SOC model, different components may need to be naturally expressible in different Models Of Computations.  ...  This is because heterogeneous Models Of Computation [9] are better suited for different parts of a system design.  ... 
doi:10.1145/988952.989013 dblp:conf/glvlsi/PatelS04 fatcat:mh7b4aqzhrdw7i5yoszxanbldq

KBERG: A MatLab toolbox for nonlinear kernel-based regularization and system identification

M. Mazzoleni, M. Scandella, F. Previdi
2020 IFAC-PapersOnLine  
We present KBERG, a MatLab package for nonlinear Kernel-BasEd ReGularization and system identification.  ...  The toolbox provides a complete environment for running experiments on simulated and experimental data from both static and dynamical systems.  ...  Peculiar features of KBERG are: (i) the definition of a all-in-one environment for testing kernel-based nonlinear system identification approaches; (ii) the possibility to easily combine experiments (i.e  ... 
doi:10.1016/j.ifacol.2020.12.1340 fatcat:gq3hcgxvnfepba2u5zwm4gbz3a

Power profiling and optimization for heterogeneous multi-core systems

Kuen Hung Tsoi, Wayne Luk
2011 SIGARCH Computer Architecture News  
Processing speed and energy efficiency are two of the most critical issues for computer systems.  ...  This paper presents a systematic approach for profiling the power and performance characteristics of application targeting heterogeneous multi-core computing platforms.  ...  Since our measurements are in the scale of seconds, the experiments may need to repeat the kernels for a prolonged execution time.  ... 
doi:10.1145/2082156.2082159 fatcat:o5cxa2rgjzdcnffu4aei7lqv4i

Systolic-CNN: An OpenCL-defined Scalable Run-time-flexible FPGA Accelerator Architecture for Accelerating Convolutional Neural Network Inference in Cloud/Edge Computing [article]

Akshay Dua, Yixing Li, Fengbo Ren
2020 arXiv   pre-print
Systolic-CNN is also run-time-flexible in the context of multi-tenancy cloud/edge computing, which can be time-shared to accelerate a variety of CNN models at run time without the need of recompiling the  ...  The existing OpenCL-defined FPGA accelerators for CNN inference are insufficient due to limited flexibility for supporting multiple CNN models at run time and poor scalability resulting in underutilized  ...  The host kernel should invoke the corresponding computation kernel in Systolic-CNN just once for mapping each layer of a CNN model depending on the CNN model structure.  ... 
arXiv:2012.03177v1 fatcat:h5alzshjybhv7kmpmeb46an3qm

Performance modeling in CUDA streams — A means for high-throughput data processing

Hao Li, Di Yu, Anand Kumar, Yi-Cheng Tu
2014 2014 IEEE International Conference on Big Data (Big Data)  
Understanding the performance of operations as a result of resource consumption is thus a premise in the design of G-SDMS.  ...  Specifically, we explore the connection between performance and resource occupancy of compute-bound kernels and develop a model that can predict the performance of such kernels.  ...  The experimental setup is the same as that described in Section III-C except we design a series of kernels for our experiments in this section.  ... 
doi:10.1109/bigdata.2014.7004245 pmid:26566545 pmcid:PMC4640924 dblp:conf/bigdataconf/LiY0T14 fatcat:2e2p4kjmgfecta3h4yfozebxgm

High performance MRI simulations of motion on multi-GPU systems

Christos G Xanthis, Ioannis E Venetis, Anthony H Aletras
2014 Journal of Cardiovascular Magnetic Resonance  
The displacement of the anatomical model isochromats was calculated within the Graphics Processing Unit (GPU) kernel for every timestep of the pulse sequence.  ...  Last, MRISIMUL demonstrated an almost linear scalable performance with the increasing number of available GPU cards, in both single-node and multi-node multi-GPU computer systems.  ...  To test for scalability in multi-GPU experiments, the execution times of the computational kernel were recorded for a total of 1, 2, 3, 4, 5 and 6 GPU cards.  ... 
doi:10.1186/1532-429x-16-48 pmid:24996972 pmcid:PMC4107941 fatcat:3l4spbwnnrfslnsoiz44vhwcze

Kernel Method Based Human Model for Enhancing Interactive Evolutionary Optimization

Yan Pei, Qiangfu Zhao, Yong Liu
2015 The Scientific World Journal  
In this paper, we propose a method to establish a human model in projected high dimensional search space by kernel classification for enhancing IEC search.  ...  The human model is established by this method for predicting potential perceptual knowledge of human. With the human model, we design an evolution control method to enhance IEC search.  ...  Conflict of Interests The authors declare that there is no conflict of interests regarding the publication of this paper.  ... 
doi:10.1155/2015/185860 pmid:25879050 pmcid:PMC4386548 fatcat:u5pehbib3bfkfbt4n4digwpiaq

Parallelizing MPEG Decoder with Scalable Streaming Computation Kernels

Josip Knezović, Igor Čavrak, Daniel Hofman
2014 Automatika  
In this paper, we describe a scalable and portable parallelized implementation of a MPEG decoder using a streaming computation paradigm, tailored to new generations of multi-core systems.  ...  An architecture-independent StreamIt language is used for design, optimization and implementation of parallelized segments, while the developed STREAMGATE interface provides a communication mechanism between  ...  RELATED WORK The idea of streaming model of computation has been present in computer science for quite a long time [9] [10] .  ... 
doi:10.7305/automatika.2014.12.617 fatcat:sa6ebofcvfbznitpzvehtpktia

A GPU-Based Simulation Kernel within Heterogeneous Collaborative Computation on Large-Scale Artificial Society

Li Zhen, Qiuxiao Gang, Guo Gang, Chen Bin
2014 International Journal of Modeling and Optimization  
But there are lots of problems in GPU-based ABS. The paper proposes a GPU-based conservative parallel discrete event simulation algorithm for ABS.  ...  The graphic processing unit (GPU) gets strong computing ability with relatively low energy and money consumption, it has been widely used in the field of large-scale simulation and computation.  ...  EXPERIMENT In order to verify the performance of GPU-based simulation kernel, the paper design a Game of Life model to be scheduled and processed in GPU.  ... 
doi:10.7763/ijmo.2014.v4.374 fatcat:hsdsarc4rjhzbkbtgeqy67xz3q

On the design of LQR kernels for efficient controller learning

Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe
2017 2017 IEEE 56th Annual Conference on Decision and Control (CDC)  
For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning  ...  Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.  ...  In [5] , a kernel is designed to model information from simulation and physical experiments, in order to leverage both sources of information for RL.  ... 
doi:10.1109/cdc.2017.8264429 dblp:conf/cdc/MarcoHST17 fatcat:6mo3g4oqf5fhfmdeplc5ehnf4y

Quingo: A Programming Framework for Heterogeneous Quantum-Classical Computing with NISQ Features [article]

The Quingo Development Team
2021 arXiv   pre-print
We believe the Quingo framework could contribute to the clarification of key techniques in the design of future HQCC systems.  ...  We propose a six-phase quantum program life-cycle model matching the refined HQCC model, which is implemented by a runtime system.  ...  ACKNOWLEDGMENTS We would like to thank Center for Quantum Computing, Peng Cheng Laboratory for their generous support.  ... 
arXiv:2009.01686v2 fatcat:2ap3h7s2xfd73daptwkqbxf6f4

A Reference Model of Cloud Operating and Open Source Software Implementation Mapping

Wenke Ji, Jiangbo Ma, Xiaoyong Ji
2009 2009 18th IEEE International Workshops on Enabling Technologies: Infrastructures for Collaborative Enterprises  
in this article, a reference model is proposed. The model divides the cloud computing system with various components in a 3-layer hierarchy called infrastructure, platform and application.  ...  Also the open source software implementation for the components in the model is addressed Keywords-Reference Model, Cloud Computing, Open Source Software, FCAPS I.  ...  DETAIL DESCRIPTION OF THE REFERENCE MODEL In the model, the cloud system is clarified with a 3layer hierarchy as follow: Cloud computing infrastructure layer provides a cluster of hardware resource such  ... 
doi:10.1109/wetice.2009.28 dblp:conf/wetice/JiMJ09 fatcat:l3gljizjgvg3pgoh3gbwksmbou

DOSA: Design Optimizer for Scientific Applications

David A. Bader, Viktor K. Prasanna
2007 2007 IEEE International Parallel and Distributed Processing Symposium  
The run-time system is a portable interface that enables dynamic application optimization by interfacing with the output of DOSA.  ...  for speed (or power) at design-time and use a run-time optimizer.  ...  In the DOSA framework, at design time, the designer models the kernels and architectures of her/his application.  ... 
doi:10.1109/ipdps.2007.370494 dblp:conf/ipps/BaderP07 fatcat:vuso7sdnfrhytfnxc4njwzqm6m

A kernel-based method to determine optimal sampling times for the simultaneous estimation of the parameters of rival mathematical models

Brecht M. R. Donckels, Dirk J.W. De Pauw, Peter A. Vanrolleghem, Bernard De Baets
2009 Journal of Computational Chemistry  
of In this article, a method was presented to design an experiment to each of the nine models (Fig. 13).  ...  In this article, a kernel-based method is presented to determine optimal sampling times to simultaneously estimate the parameters of rival models in a single experiment.  ... 
doi:10.1002/jcc.21171 pmid:19165773 fatcat:jm55aty35jfctomp7kh6leth44
« Previous Showing results 1 — 15 out of 231,897 results