Filters








36,245 Hits in 4.9 sec

An efficient profile-analysis framework for data-layout optimizations

Shai Rubin, Rastislav Bodík, Trishul Chilimbi
2002 Proceedings of the 29th ACM SIGPLAN-SIGACT symposium on Principles of programming languages - POPL '02  
We propose a parameterizable framework for data-layout optimization of generalpurpose applications.  ...  Acknowledging that finding an optimal layout is not only NP-hard, but also poorly approximable, our framework finds a good layout by searching the space of possible layouts, with the help of profile feedback  ...  The work was started when Shai Rubin was an intern at Microsoft Research.  ... 
doi:10.1145/503272.503287 dblp:conf/popl/RubinBC02 fatcat:jldcejcfonhdvld34luzzsan2e

An efficient profile-analysis framework for data-layout optimizations

Shai Rubin, Rastislav Bodík, Trishul Chilimbi
2002 SIGPLAN notices  
We propose a parameterizable framework for data-layout optimization of generalpurpose applications.  ...  Acknowledging that finding an optimal layout is not only NP-hard, but also poorly approximable, our framework finds a good layout by searching the space of possible layouts, with the help of profile feedback  ...  The work was started when Shai Rubin was an intern at Microsoft Research.  ... 
doi:10.1145/565816.503287 fatcat:oyncsxtkmvck5p6cspuooe767e

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs [article]

Chao Li, Yi Yang, Min Feng, Srimat Chakradhar, Huiyang Zhou
2016 arXiv   pre-print
In this work, we study the memory efficiency of various CNN layers and reveal the performance implication from both data layouts and memory access patterns.  ...  Experiments show the universal effect of our proposed optimizations on both single layers and various networks, with up to 27.9x for a single layer and up to 5.6x on the whole networks.  ...  The results labeled 'Opt+Optimized Transform' show the one with our optimized transformation. Their comparison highlights the impact of an efficient data layout transformation.  ... 
arXiv:1610.03618v1 fatcat:r2estvyawfaute3citbu42odm4

Multi-objective Human-computer Co-operative Co-evolutionary Method Based on Non-dominated Sorting Strategy

Chen Jing, Huo Junzhou, Zhou Jianjun, Wu Hanyang
2014 Open Electrical & Electronic Engineering Journal  
set for engineers to choose from.  ...  optimization idea is integrated to avoid the "flooding" phenomenon that occurs during the combination of the artificial solutions and the algorithm solutions.  ...  In addition, Murthy [7] proposed an asynchronous team framework which substituted the algorithm operators.  ... 
doi:10.2174/1874129001408010213 fatcat:mxtsqshhsja7rgatgotupqswim

On the integrated design of curvilinearly grid-stiffened panel with non-uniform distribution and variable stiffener profile

Dachuan Liu, Peng Hao, Kunpeng Zhang, Kuo Tian, Bo Wang, Gang Li, Weixiu Xu
2020 Materials & design  
Wang et al. [13] proposed a two-stage optimization framework for cylindrical or flat stiffened panels. In the second stage, the location of each stiffener in orthogrid configuration was optimized.  ...  In addition to the layout design, the variable profile design method of grid-stiffened structures is also proposed in this paper.  ...  Then, they [32] proposed an efficient optimization framework of cylindrical stiffened shells with reinforced cutouts by curvilinear stiffeners.  ... 
doi:10.1016/j.matdes.2020.108556 fatcat:br44lncdmjcbdkrf7z64niyyma

Adaptive GPU Array Layout Auto-Tuning

Nicolas Weber, Michael Goesele
2016 Proceedings of the ACM Workshop on Software Engineering Methods for Parallel and High Performance Applications - SEM4HPC '16  
Our adaptive optimization requires in principle profiling data for an extremely high number of scenarios which cannot be exhaustively evaluated for complex applications.  ...  We present an auto-tuner which optimizes array layouts in CUDA applications. Depending on the data and program parameters, kernels can have varying optimal configurations.  ...  The required data analysis can furthermore introduce an overhead depending on the data analysis actually performed.  ... 
doi:10.1145/2916026.2916031 dblp:conf/hpdc/WeberG16 fatcat:ca54f3ccnfcwzl3l5kiwbdtrna

JITProf: pinpointing JIT-unfriendly JavaScript code

Liang Gong, Michael Pradel, Koushik Sen
2015 Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering - ESEC/FSE 2015  
This paper presents JITPROF, a profiling framework to dynamically identify code locations that prohibit profitable JIT optimizations.  ...  Most modern JavaScript engines use just-in-time (JIT) compilation to translate parts of JavaScript code into efficient machine code at runtime.  ...  The authors would like to thank Luca Della Toffola for his valuable feedback.  ... 
doi:10.1145/2786805.2786831 dblp:conf/sigsoft/GongPS15 fatcat:gqjuyz2gvfhefib5zlzzkdjbwi

Parametric Optimization and Decision Support Model Framework for Life Cycle Cost Analysis and Life Cycle Assessment of Flexible Industrial Building Structures Integrating Production Planning

Julia Reisinger, Stefan Kugler, Iva Kovacic, Maximilian Knoll
2022 Buildings  
This paper presents a parametric optimization and decision support (POD) model framework that enables automated structural analysis and simultaneous calculation of life cycle cost (LCC), life cycle assessment  ...  an early design stage.  ...  In fact, an integrated decision support framework that optimizes building structures and layouts towards improved sustainability and flexibility while taking into account production layout scenarios is  ... 
doi:10.3390/buildings12020162 fatcat:kat4npnahzhbvebaf2wcn4l7mq

Data Structures and Transformations for Physically Based Simulation on a GPU [chapter]

Perhaad Mistry, Dana Schaa, Byunghyun Jang, David Kaeli, Albert Dvornik, Dwight Meglan
2011 Lecture Notes in Computer Science  
These data structures allow us to maintain a physics-based abstraction of the underlying data, reduce programmer effort and obtain 6x-8x speedup over previously implemented GPU kernels.  ...  Implementing physically-based simulation on data-parallel hardware requires preprocessing overhead which affects application performance.  ...  For the CPU layout, function f () was applied as in the loop nest in Figure 4 . For the loop in Figure 4 , data layout 1 would be optimal.  ... 
doi:10.1007/978-3-642-19328-6_17 fatcat:bp6z6lxv25abbic7f7fstxhu3e

Author index

2012 2012 19th International Conference on High Performance Computing  
data layout for multithreaded stencil codes on CPUs and GPUs Jeelani, SMD Profiling and scalability of the high resolution NCEP model for Weather and Climate Simulations Jensen, Scott Visualization of  ...  Big Data Analytics with Apache Hadoop Houngkaew, Charuwat Scalable Performance of ScaleGraph for Large Scale Graph Analysis Huang, Tao ElastiCat: A load rebalancing framework for cloud-based key-value  ... 
doi:10.1109/hipc.2012.6507473 fatcat:7k6al4ozjbecrjykky7kyd5b7e

Text graphic separation in Indian newspapers

Ritu Garg, Anukriti Bansal, Santanu Chaudhury, Sumantra Dutta Roy
2013 Proceedings of the 4th International Workshop on Multilingual OCR - MOCR '13  
In this paper we propose a novel framework for learning optimal parameters for text graphic separation in the presence of complex layouts.  ...  The learning problem has been formulated as an optimization problem using EM algorithm to learn optimal parameters depending on the nature of the document content.  ...  The problem of layout analysis on newspaper data has been addressed by few researchers [8, 7, 9, 10, 17, 18, 20, 16] .  ... 
doi:10.1145/2505377.2505393 dblp:conf/icdar/GargBCR13 fatcat:f6pnvpn2rnh5tbxlotbsaxid64

Thoth

Mayuresh Kunjir, Prajakta Kalmegh, Shivnath Babu
2014 Proceedings of the VLDB Endowment  
(ii) Data Layout Recommender app will allow searching for the optimal data layout in the multi-system setting.  ...  of resources, designing a multi-system-aware optimizer, tuning of configuration parameters on each system, data storage, and layout schemes.  ...  For instance, log analysis can be performed on parallel databases like Teradata/Greenplum, batch systems like Hadoop, column-oriented systems for efficient OLAP analysis like Vertica, or in-memory analytics  ... 
doi:10.14778/2733004.2733062 fatcat:53gzqnjcrfajnhkjd4cuommcj4

Front Matter: Volume 9427

2015 Design-Process-Technology Co-optimization for Manufacturability IX  
The publisher is not responsible for the validity of the information or for any outcomes resulting from reliance thereon.  ...  The complete citation is used on the first page, and an abbreviated version on subsequent pages.  ...  on the design and process for N10/N7 metal layers [9427-8] 9427 0A An efficient auto TPT stitch guidance generation for optimized standard cell design [9427-9] 9427 0B Yield-aware mask assignment  ... 
doi:10.1117/12.2193828 fatcat:7rndz5faybdp3k3r2cednynm24

Design space exploration for flexibility assessment and decision making support in integrated industrial building design

Julia Reisinger, Maximilian Knoll, Iva Kovacic
2021 Optimization and Engineering  
Results confirmed the efficiency of the process that an evolutionary multi-objective optimization algorithm can be implemented in future research to enable multidisciplinary design optimization for flexible  ...  To address these issues, this paper presents a parametric design process for efficient design space exploration in IIBD.  ...  the FFG BRIDGE (30th call for tenders) programme (Grant No. 877159).  ... 
doi:10.1007/s11081-021-09614-2 fatcat:csflpwssibdevfedanrywtp5iy

Dynamic Compilation for Reducing Energy Consumption of I/O-Intensive Applications [chapter]

Seung Woo Son, Guangyu Chen, Mahmut Kandemir, Alok Choudhary
2006 Lecture Notes in Computer Science  
This paper presents the details of a dynamic compilation framework developed specifically for I/O-intensive large-scale applications.  ...  Amongst these, I/O is becoming a major bottleneck, impeding the expected pace of scientific discovery and analysis of data.  ...  An illustration of performance optimization rules incorporated for data access strategies for efficient I/O.  ... 
doi:10.1007/978-3-540-69330-7_32 fatcat:lomfjpasjngldecgl4yajn7wwa
« Previous Showing results 1 — 15 out of 36,245 results