Filters








18,054 Hits in 17.2 sec

A New Frontier of Kernel Design for Structured Data

Kilho Shin
2013 International Conference on Machine Learning  
Many kernels for discretely structured data in the literature are designed within the framework of the convolution kernel and its generalization, the mapping kernel.  ...  On the other hand, the recent theory of partitionable kernels reveals that the known kernels only take advantage of a very small portion of the potential of the framework.  ...  Secondly, the research of partitionable kernels of hidden degree two is significantly attractive for two reasons: It is an exploration to a new frontier of kernel design, and we have good opportunities  ... 
dblp:conf/icml/Shin13 fatcat:iyrvx4zoafdb7gru6hm4jmu3ju

WebCore

Yuhao Zhu, Vijay Janapa Reddi
2014 SIGARCH Computer Architecture News  
In this paper, we first perform design-space exploration to identify appropriate general-purpose architectures that uniquely fit the characteristics of a popular Web browsing engine.  ...  ., WebKit) are also growing in importance because they provide a common substrate for developing various mobile Web applications.  ...  Altogether, they push the energy-efficiency frontier of general-purpose mobile processor designs to a new level for mobile Web browsing workloads.  ... 
doi:10.1145/2678373.2665749 fatcat:7zitexf4qngahppnzppbsa2yeu

Gunrock: a high-performance graph processing library on the GPU

Yangzihao Wang, Andrew Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens
2015 Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming - PPoPP 2015  
The inherent irregularity of graph data structures leads to irregularity in data access and control flow, making an efficient implementation on GPUs a significant challenge.  ...  instead implements a novel data-centric abstraction centered on operations on a vertex or edge frontier.  ...  Acknowledgments We thank Joe Mako for providing the speedup chart design.  ... 
doi:10.1145/2688500.2688538 dblp:conf/ppopp/WangDPWRO15 fatcat:utd3goviwva2ta34pi3qorbpwu

An experimental Nexos laboratory using Virtual Xinu

Paul Ruth, Dennis Brylow
2011 2011 Frontiers in Education Conference (FIE)  
This paper describes an extension to the Nexos laboratory that includes a new target platform composed of Qemu virtual machines.  ...  The Nexos Project is a joint effort between Marquette University, the University of Buffalo, and the University of Mississippi to build curriculum materials and a supporting experimental laboratory for  ...  for experimentation with the internal data structures and flow control of fully functional kernels.  ... 
doi:10.1109/fie.2011.6143069 dblp:conf/fie/RuthB11 fatcat:qdvnlxr5bvh6tczkhk5nd4u73e

WebCore: Architectural support for mobile Web browsing

Yuhao Zhu, Vijay Janapa Reddi
2014 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)  
In this paper, we first perform design-space exploration to identify appropriate general-purpose architectures that uniquely fit the characteristics of a popular Web browsing engine.  ...  ., WebKit) are also growing in importance because they provide a common substrate for developing various mobile Web applications.  ...  Altogether, they push the energy-efficiency frontier of general-purpose mobile processor designs to a new level for mobile Web browsing workloads.  ... 
doi:10.1109/isca.2014.6853239 dblp:conf/isca/ZhuR14 fatcat:slbtljkrwvhpnd2bq5fiqtsk7q

Optimizing General-Purpose CPUs for Energy-Efficient Mobile Web Computing

Yuhao Zhu, Vijay Janapa Reddi
2017 ACM Transactions on Computer Systems  
We see the domain-specific specialized architecture as a promising approach for future mobile CPU designs.  ...  The general-purpose programmability is a particular necessity for web technologies because they involve large pieces of software that are written in a combination of different general-purpose programming  ...  Altogether, they push the energy efficiency frontier of general-purpose mobile processor designs to a new level for mobile web browsing workloads.  ... 
doi:10.1145/3041024 fatcat:iv4yj6huufgpzir6hgt5rch7jm

Gunrock

Yangzihao Wang, Andrew Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens
2016 Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming - PPoPP '16  
"Gunrock", our graph-processing system designed specifically for the GPU, uses a high-level, bulk-synchronous, data-centric abstraction focused on operations on a vertex or edge frontier.  ...  For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs have been two significant challenges for developing a programmable high-performance  ...  Acknowledgments We thank Joe Mako for providing the speedup chart design.  ... 
doi:10.1145/2851141.2851145 dblp:conf/ppopp/WangDPWRO16 fatcat:72lqk4hh3ve3hdpusd43emajoi

Test schedules for VLSI circuits having built-in test hardware

M.S. Abadir, M.A. Breuer
1987 Computers and Mathematics with Applications  
We introduce the new concept of an I-path which is used to transfer data from one place in a circuit to another, without modifying the data.  ...  A~traet--Numerous built-in test techniques exist for testing structures within a VLSI chip.  ...  The format of a data transfer action is as follows: Transfer (source --* destination) A data processing action calls for propagating data through a structure.  ... 
doi:10.1016/0898-1221(87)90080-0 fatcat:op53wclg5jahfm6ksva33xi23e

A K Nearest Classifier design

Y. Prudent, A. Ennaji
2005 ELCVIA Electronic Letters on Computer Vision and Image Analysis  
This paper presents a multi-classifier system design controlled by the topology of the learning data. Our work also introduces a training algorithm for an incremental self-organizing map (SOM).  ...  This SOM is used to distribute classification tasks to a set of classifiers. Thus, the useful classifiers are activated when new data arrives.  ...  (a) Training data (b) Decision frontiers generated by a SVM (c) Decision frontiers generated by our approach Closing of the decision frontiers The frontiers designed by SVM with a Gaussian kernel is based  ... 
doi:10.5565/rev/elcvia.96 fatcat:uzzjnbu2rrg5xhmrzan55kovz4

Performance Characterization of High-Level Programming Models for GPU Graph Analytics

Yuduo Wu, Yangzihao Wang, Yuechao Pan, Carl Yang, John D. Owens
2015 2015 IEEE International Symposium on Workload Characterization  
We show that efficient building block operators enable more powerful operations for fast information propagation and result in fewer device kernel invocations, less data movement, and fewer global synchronizations  ...  We analyze the impact of these critical factors through three GPU graph analytic frameworks, Gunrock, MapGraph, and VertexAPI2.  ...  GR supports three ways to manipulate the current frontier: advance generates a new frontier by visiting the neighbors of the current vertex frontier; filter generates a new frontier by choosing a subset  ... 
doi:10.1109/iiswc.2015.13 dblp:conf/iiswc/WuWPYO15 fatcat:ueeyfurjrjcjjiyfnqkxkjaxzq

Gunrock: GPU Graph Analytics [article]

Yangzihao Wang, Yuechao Pan, Andrew Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Muhammad Osama, Chenshan Yuan, Weitang Liu, Andy T. Riffel and John D. Owens
2017 arXiv   pre-print
"Gunrock", our graph-processing system designed specifically for the GPU, uses a high-level, bulk-synchronous, data-centric abstraction focused on operations on a vertex or edge frontier.  ...  For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs, have presented two significant challenges to developing a programmable  ...  Thanks to the Altair and Vega-lite teams in the Interactive Data Lab at the University of Washington for graphing help. Joe Mako provided the speedup chart design.  ... 
arXiv:1701.01170v1 fatcat:kgx3yuxsrzegvkbo6x7tz5jbba

Editorial: New Trends and Developments on Structural Control & Health Monitoring

Ersin Aydin, Baki Ozturk, Ehsan Noroozinejad Farsangi, Aleksandra Bogdanovic
2020 Frontiers in Built Environment  
AUTHOR CONTRIBUTIONS All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication. BO wrote the original draft of the Editorial.  ...  ., within the fifth paper, have extended a kernel-based procedure for the modeling of non-harmonic periodic phenomena in Bayesian dynamic linear models.  ...  Six research papers are parts of the Special Issue named "New Trends and Developments on Structural Control & Health Monitoring." A brief description of the findings and scope is presented.  ... 
doi:10.3389/fbuil.2020.00053 fatcat:ajytafmd7rayzkmqvv3gk35ere

GPU Parallel algorithm for the generation of polygonal meshes based on terminal-edge regions [article]

Sergio Salinas, José Ojeda, Nancy Hitschfeld, Alejandro Ortiz-Bernardin
2022 arXiv   pre-print
This paper presents a GPU parallel algorithm to generate a new kind of polygonal meshes obtained from Delaunay triangulations.  ...  We also run a very simple test to show that these meshes can be useful for the virtual element method.  ...  DATA STRUCTURE The GPU algorithm receives as input a triangulation.  ... 
arXiv:2204.05438v1 fatcat:7a5kvgpwxvf4llw7qkx3tamx3u

Combining data reuse exploitationwith data-level parallelization for FPGA targeted hardware compilation: A geometric programming framework

Qiang Liu, George A. Constantinides, Konstantinos Masselos, Peter Y.K. Cheung
2008 2008 International Conference on Field Programmable Logic and Applications  
We expose the dependence between data reuse and data-level parallelization and explore both problems under the on-chip memory constraint for performance-optimal designs within a single optimization step  ...  A geometric programming framework is proposed in this paper to automate exploration of the design space consisting of data reuse (buffering) exploitation and loop-level parallelization, in the context  ...  Clearly, there are the similar descending trends of the frontiers in (a) and (b) over the number of on-chip RAM blocks for three kernels.  ... 
doi:10.1109/fpl.2008.4629928 dblp:conf/fpl/LiuCMC08 fatcat:he4zwbkusvge5lbomnajwvrxum

Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders [article]

Samuel Stanton, Wesley Maddox, Nate Gruver, Phillip Maffettone, Emily Delaney, Peyton Greenside, Andrew Gordon Wilson
2022 arXiv   pre-print
However, its adoption for drug design has been hindered by the discrete, high-dimensional nature of the decision variables.  ...  We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, allowing gradient-based optimization of multi-objective acquisition  ...  We would like to thank Sait Cakmak, Andres Potapczynski and Sanyam Kapoor for helpful discussions.  ... 
arXiv:2203.12742v2 fatcat:6pgx2qriinbkhe4r42wt3h63de
« Previous Showing results 1 — 15 out of 18,054 results