Filters








42,504 Hits in 5.1 sec

BSGP

Qiming Hou, Kun Zhou, Baining Guo
2008 ACM Transactions on Graphics  
We present BSGP, a new programming language for general purpose computation on the GPU. A BSGP program looks much the same as a sequential C program.  ...  A well-designed BSGP compiler converts BSGP programs to kernels and combines them using optimally allocated temporary streams.  ...  Acknowledgements The authors would like to thank Matt Scott for his help with video production. We are also grateful to the anonymous reviewers for their helpful comments.  ... 
doi:10.1145/1360612.1360618 fatcat:hg3lyfxyc5hilipu4p27loenge

BSGP

Qiming Hou, Kun Zhou, Baining Guo
2008 ACM SIGGRAPH 2008 papers on - SIGGRAPH '08  
We present BSGP, a new programming language for general purpose computation on the GPU. A BSGP program looks much the same as a sequential C program.  ...  A well-designed BSGP compiler converts BSGP programs to kernels and combines them using optimally allocated temporary streams.  ...  Acknowledgements The authors would like to thank Matt Scott for his help with video production. We are also grateful to the anonymous reviewers for their helpful comments.  ... 
doi:10.1145/1399504.1360618 fatcat:niob3f53vbgbvivca2gus2hj6e

Computation of Similarity Measures for Sequential Data using Generalized Suffix Trees

Konrad Rieck, Pavel Laskov, Sören Sonnenburg
2006 Neural Information Processing Systems  
Experiments with network intrusion detection, DNA analysis and text processing applications demonstrate the utility of distances and similarity coefficients for sequences as alternatives to classical kernel  ...  The algorithm uses generalized suffix trees for efficient calculation of various kernel, distance and non-metric similarity functions.  ...  Acknowledgments The authors gratefully acknowledge the funding from Bundesministerium für Bildung und Forschung under the project MIND (FKZ 01-SC40A) and would like to thank Klaus-Robert Müller and Mikio Braun for  ... 
dblp:conf/nips/RieckLS06 fatcat:qpycv6tqybh3vkp6dxc22lsvkm

PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data Platforms

Patrick Dreher, Chansup Byun, Chris Hill, Vijay Gadepally, Bradley Kuszmaul, Jeremy Kepner
2016 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)  
The surrounding kernels provide the context for each kernel that allows rigorous definition of both the input and the output for each kernel.  ...  The PageRank pipeline benchmark builds on existing prior scalable benchmarks (Graph500, Sort, and PageRank) to create a holistic benchmark with multiple integrated kernels that can be run together or independently  ...  Figure 5 . 5 Benchmark results for Kernel 1 showing the performance for reading files generated in Kernel 0, sorting them by the start vertex and re-writing the sorted data back to nonvolatile storage.  ... 
doi:10.1109/ipdpsw.2016.89 dblp:conf/ipps/DreherBHGKK16 fatcat:kv4qrs5yprc3bdfxhmwmle5xrq

Convex Kernelized Sorting

Nemanja Djuric, Mihajlo Grbovic, Slobodan Vucetic
2021 PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE  
Kernelized sorting is a method for aligning objects across two domains by considering within-domain similarity, without a need to specify a cross-domain similarity measure.  ...  In this paper we present the Convex Kernelized Sorting method where, unlike in the previous approaches, the cross-domain object matching is formulated as a convex optimization problem, leading to simpler  ...  Acknowledgments The authors thank Vladan Radosavljevic for helpful discussions, Qingqing Cai and Xin Li for their help during preprocessing of the data for the NLP task, and also acknowledge help from  ... 
doi:10.1609/aaai.v26i1.8314 fatcat:qqicwpo3ovbmfhrl3ggiazqxki

Accelerating steady-state genetic algorithms based on CUDA architecture

Masashi Oiso, Toshiyuki Yasuda, Kazuhiro Ohkura, Yoshiyuki Matumura
2011 2011 IEEE Congress of Evolutionary Computation (CEC)  
CUDA is a general-purpose computation environment for GPUs. The major characteristic of this study is that a steady-state GA is implemented on a GPU based on concurrent kernel execution.  ...  Parallel processing using graphic processing units (GPUs) have attracted much research interest in recent years.  ...  The kernel allocates an individual to a block and a gene to a thread as the sorting kernel. 4) Sorting: The sorting process sorts the population based on the fitness of the individuals; however, the  ... 
doi:10.1109/cec.2011.5949685 dblp:conf/cec/OisoYOM11 fatcat:c7m4is3pr5cuxctuwvtlrpag3a

Look into details

Mohammad H. Foroozannejad, Matin Hashemi, Trevor L. Hodges, Soheil Ghiasi
2010 Proceedings of the ACM SIGPLAN/SIGBED 2010 conference on Languages, compilers, and tools for embedded systems - LCTES '10  
window of this sequence  Moves forward in data space 5 5 2 6 4 1 8 9 3 input output -1 7 2 0.4 7.2 1 //53°around the z axis const R[3][3]={ {0.6,-0.8, 0.0}, {0.8, 0.6, 0.0}, {0.0, 0.0, 1.0}} Rotation3D { for  ...  (i=0; i<3; i++) for (j=0; j<3; j++) B[i] += R[i][j] * A[j] } Application Model  Data Flow Graph  Vertices or Actors  functions, computations  Edges  data dependency, communication between actors  ...   Benchmark Applications:  Two sorting algorithms: Bitonic Sort, Insertion Sort  Two different implementation of the Fast Fourier Transform  Time Delay Estimation kernel  Matrix Multiplication kernel  ... 
doi:10.1145/1755888.1755894 dblp:conf/lctrts/ForoozannejadHHG10 fatcat:ghwlwutplzb4demmnuu73byyyq

Red Fox

Haicheng Wu, Gregory Diamos, Tim Sheard, Molham Aref, Sean Baxter, Michael Garland, Sudhakar Yalamanchili
2014 Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization - CGO '14  
Red Fox is comprised of i) a language front-end for LogiQL which is a commercial query language, ii) an RA to GPU compiler, iii) optimized GPU implementation of RA operators, and iv) a supporting runtime  ...  Modern enterprise applications represent an emergent application arena that requires the processing of queries and computations over massive amounts of data.  ...  The declarative nature of LogiQL is the key reason for its suitability for rapid application development.  ... 
doi:10.1145/2581122.2544166 fatcat:piwvqur6ubdgbobceu4uoafnnq

Protein Remote Homology Detection by Combining Profile-based Protein Representation with Local Alignment Kernel

Bin Liu, Xiaolong Wang, Ruifeng Xu, Buzhou Tang
2014 Journal of Medical and Bioengineering  
) is combined with these profile-based proteins for the prediction.  ...  In this approach, the frequency profiles calculated from the multiple sequence alignments outputted by PSI-BLAST are converted into several profile-based proteins and then the local alignment kernel (LA  ...  The natural language processing techniques have been applied to this field. These methods are based on the similarities between protein sequences and natural languages.  ... 
doi:10.12720/jomb.3.1.17-22 fatcat:jlxqwyg7jng5tlx6ucafghryty

Red Fox

Haicheng Wu, Gregory Diamos, Tim Sheard, Molham Aref, Sean Baxter, Michael Garland, Sudhakar Yalamanchili
2014 Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization  
Red Fox is comprised of i) a language front-end for LogiQL which is a commercial query language, ii) an RA to GPU compiler, iii) optimized GPU implementation of RA operators, and iv) a supporting runtime  ...  Modern enterprise applications represent an emergent application arena that requires the processing of queries and computations over massive amounts of data.  ...  research was supported by the National Science Foundation under grant CCF 0905459, by LogicBlox Corporation, by an NVIDIA Graduate Fellowship, and by Intel via the Intel Science and Technology Center for  ... 
doi:10.1145/2544137.2544166 fatcat:p6mzg2ugcrgi5odbrg6vz7wtwa

Counting and occurrence sort for GPUs using an embedded language

Josef David Svenningsson, Bo Joel Svensson, Mary Sheeran
2013 Proceedings of the 2nd ACM SIGPLAN workshop on Functional high-performance computing - FHPC '13  
The algorithms are implemented in Obsidian, a high-level domain specific language for GPU programming.  ...  The duplicate removing variation turns out to have a natural functional, dataparallel implementation which makes it particularly interesting for GPUs.  ...  Acknowledgments This research has been funded by the Swedish Foundation for Strategic Research (which funds the Resource Aware Functional Programming (RAW FP) Project) and by the Swedish Research Council  ... 
doi:10.1145/2502323.2502325 dblp:conf/icfp/SvenningssonSS13 fatcat:7xsny4aizbcgzmudfyrdxnbmai

Kernelized Sorting

N Quadrianto, A J Smola, Le Song, T Tuytelaars
2010 IEEE Transactions on Pattern Analysis and Machine Intelligence  
This problem can be cast as one of maximizing a quadratic assignment problem with special structure and we present a simple algorithm for finding a locally optimal solution.  ...  TT is supported by a postdoctoral research grant of the Fund for Scientific Research Flanders.  ...  Now for each source language and the target language we can compute a kernel matrix based on abag-of-words kernel; and the ij-th entry of this kernel matrix is the similarity between document i in the  ... 
doi:10.1109/tpami.2009.184 pmid:20724758 fatcat:qw3t2shhxneqpopn3tyr6ilmge

Convolutional Neural Networks over Tree Structures for Programming Language Processing [article]

Lili Mou, Ge Li, Lu Zhang, Tao Wang, Zhi Jin
2015 arXiv   pre-print
In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to  ...  Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence  ...  Acknowledgments We would like to thank anonymous reviewers for insightful comments; we also thank Xiaowei Sun for annotating bubble sort programs, Yuxuan Liu for data processing, and Weiru Liu for discussion  ... 
arXiv:1409.5718v2 fatcat:w5x3irx75bbahc6gzfofg4p6ki

A reliable approach to customizing linux kernel using custom build tool-chain for ARM architecture and application to agriculture

Mahendra Swain, Rajesh Singh, Anita Gehlot, Md Farukh Hashmi, Shiv Kumar, Manish Parmar
2019 International Journal of Electrical and Computer Engineering (IJECE)  
The framework has been used for the development of a customized kernel for Raspberry Pi on Ubuntu 14.04 host computer.  ...  The proposed techniques uses different tool chains for the kernel customization.  ...  The ABI represents higher level language to machine level language. For different targets Linux kernel get updated with tool chains for different application.  ... 
doi:10.11591/ijece.v9i6.pp4920-4928 fatcat:evthva4yxre2dmwzwsvdxdwwxi

A Recurrent Neural Network Based Patch Recommender for Linux Kernel Bugs [article]

Anusha Bableshwar and Arun Ravindran and Manoj Iyer
2020 arXiv   pre-print
We present an initial design of such an automated tool that uses Recurrent Neural Network (RNN) based Natural Language Processing to generate patch recommendations from user generated bug reports.  ...  We describe the typical bug fixing process employed in the Linux kernel, and motivate the need for a new automated tool flow to fix bugs.  ...  We present an initial design of such an automated tool that uses Recurrent Neural Network (RNN) based Natural Language Processing to generate patch recommendations from user generated bug reports.  ... 
arXiv:2002.08454v1 fatcat:fum2wtigwjhltob3x4yobbkuf4
« Previous Showing results 1 — 15 out of 42,504 results