Filters








21 Hits in 3.5 sec

High-speed string searching against large dictionaries on the Cell/B.E. Processor

Daniele Paolo Scarpazza, Oreste Villa, Fabrizio Petrini
<span title="">2008</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5vsih2yegrfubf7el6ncng3mgq" style="color: black;">Proceedings, International Parallel and Distributed Processing Symposium (IPDPS)</a> </i> &nbsp;
We have parallelized a popular string searching algorithm, Aho-Corasick, on the IBM Cell/B.E. processor, with the goal of performing exact string matching against large dictionaries.  ...  In this article we propose a novel approach to fully exploit the DMA-based communication mechanisms of the Cell/B.E. to provide an unprecedented level of aggregate performance with irregular access patterns  ...  The advent of multi-core architectures, such as the Cell/B.E. processor, is adding an important player to the game.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/ipdps.2008.4536300">doi:10.1109/ipdps.2008.4536300</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/ipps/ScarpazzaVP08.html">dblp:conf/ipps/ScarpazzaVP08</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/jlrw5pfcwzfupn4zf2s3e2436m">fatcat:jlrw5pfcwzfupn4zf2s3e2436m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170810054840/http://www.dsi.unive.it/~calpar/AA07-08/aho-corasick.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/eb/8f/eb8fe0ae2fcde9fba93b660bcd7a08dcdab1d753.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/ipdps.2008.4536300"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Optimized On-Chip-Pipelined Mergesort on the Cell/B.E [chapter]

Rikard Hultén, Christoph W. Kessler, Jörg Keller
<span title="">2010</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
for larger data sets.  ...  Limited bandwidth to off-chip main memory is a performance bottleneck in chip multiprocessors for streaming computations, such as Cell/B.E., and this will become even more problematic with an increasing  ...  We thank Niklas Dahl and his colleagues from IBM Sweden for giving us access to their QS20 blade server.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-15291-7_19">doi:10.1007/978-3-642-15291-7_19</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/lvnixmfiebg5jg3fhwdyhcbydu">fatcat:lvnixmfiebg5jg3fhwdyhcbydu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809150910/http://www.ida.liu.se/~chrke55/papers/hulten-europar-crc-my.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a8/cf/a8cfdec8dd3037b4bd7266c986286355347fb425.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-15291-7_19"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

High performance combinatorial algorithm design on the Cell Broadband Engine processor

David A. Bader, Virat Agarwal, Kamesh Madduri, Seunghwa Kang
<span title="">2007</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/sv4mpg7lmfaqdp24ohp5qqiobm" style="color: black;">Parallel Computing</a> </i> &nbsp;
We design efficient parallel algorithms for these combinatorial kernels, and exploit concurrency at multiple levels on the Cell/B.E. processor.  ...  While the Cell/B.E. processor is architected for multimedia applications with regular processing requirements, we are interested in its performance on problems with non-uniform memory access patterns.  ...  We also acknowledge Georgia Institute of Technology, its Sony-Toshiba-IBM Center of Competence, and the National Science Foundation, for the use of Cell Broadband Engine resources that have contributed  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.parco.2007.09.005">doi:10.1016/j.parco.2007.09.005</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/lyg43stjnbapthceoyaw4h66su">fatcat:lyg43stjnbapthceoyaw4h66su</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809142433/http://www.cse.psu.edu/~kxm85/papers/HPC_Cell-PARCO07.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a9/52/a952bfc7436685883276cd46346d1e4349ac2494.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.parco.2007.09.005"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> elsevier.com </button> </a>

3D Seismic Imaging through Reverse-Time Migration on Homogeneous and Heterogeneous Multi-Core Processors

Mauricio Araya-Polo, Félix Rubio, Raúl de la Cruz, Mauricio Hanzich, José María Cela, Daniele Paolo Scarpazza
<span title="">2009</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fw4azkpu65d2thmrwfkoawyxse" style="color: black;">Scientific Programming</a> </i> &nbsp;
In this paper, we present a mapping of the RTM computational kernel to the IBM Cell/B.E. processor that reaches close-to-optimal performance.  ...  Also, the RTM-Cell/B.E. combination proves to be a strong competitor in the seismic arena.  ...  The authors thank Jizhu Lu and Michael Perrone from the IBM T.J. Watson Research Center for their valuable contributions to the optimization of the Cell/B.E. RTM kernel.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/382638">doi:10.1155/2009/382638</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ih5zuzjxtzbvxedne24c6xozwq">fatcat:ih5zuzjxtzbvxedne24c6xozwq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200215165501/http://downloads.hindawi.com/journals/sp/2009/382638.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/1c/76/1c76847a0e08d6a3270422c9ce750b69876f49cf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/382638"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

Streaming Model Based Volume Ray Casting Implementation for Cell Broadband Engine

Jusub Kim, Joseph JaJa
<span title="">2009</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fw4azkpu65d2thmrwfkoawyxse" style="color: black;">Scientific Programming</a> </i> &nbsp;
However the recent introduction of the Cell Broadband Engine (Cell B.E.) processor, which consists of 9 heterogeneous cores designed to handle extremely demanding computations with large streams of data  ...  The implementation is designed to take full advantage of the computational power and memory bandwidth of the Cell B.E. using an intricate orchestration of the ray casting computation on the available heterogeneous  ...  David Bader for generously allowing us to use Cell Processors at the STI Cell Center of Competence at Georgia Tech for this research.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/248465">doi:10.1155/2009/248465</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6zqtsik54bddvgidfjzelfprra">fatcat:6zqtsik54bddvgidfjzelfprra</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190504092550/http://downloads.hindawi.com/journals/sp/2009/248465.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/53/83/5383749e714992f0327621e75c6f514d1533822a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/248465"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

The reverse-acceleration model for programming petascale hybrid systems

S. Pakin, M. Lang, D. J. Kerbyson
<span title="">2009</span> <i title="IBM"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/cr766v23pncdhc7hmikak4m7pi" style="color: black;">IBM Journal of Research and Development</a> </i> &nbsp;
The typical model for programming such systems is host-centric: The general-purpose processor orchestrates the computation, offloading performancecritical work to the accelerator, and data are communicated  ...  Points on curves represent measured data. Memory flow controller (MFC) DMA commands [43] are used for SPE-to-SPE and SPE-to-main memory data transfers.  ...  ., in the United States, other countries, or both.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1147/jrd.2009.5429074">doi:10.1147/jrd.2009.5429074</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vaxso4lh35hddmdfdbfbacpww4">fatcat:vaxso4lh35hddmdfdbfbacpww4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20110401042545/http://www.ccs3.lanl.gov/PAL/publications/papers/Pakin2009:rev-accel.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0d/d3/0dd3c5b9b651b82597498ef4dee7e6758e6ac718.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1147/jrd.2009.5429074"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Solving dense linear systems on platforms with multiple hardware accelerators

Gregorio Quintana-Ortí, Francisco D. Igual, Enrique S. Quintana-Ortí, Robert A. van de Geijn
<span title="">2008</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/a3hx753rrfdorizx3a3ovuee4y" style="color: black;">Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP &#39;09</a> </i> &nbsp;
hardware accelerators (GPUs, Cell B.E., etc.), each with its own local memory, resulting in a platform more reminiscent of a heterogeneous distributed-memory system.  ...  In this paper we provide further evidence that this approach solves the programmability problem for this domain by targeting a more complex architecture, composed of a multicore processor and multiple  ...  Additional support came from the J. Tinsley Oden Faculty Fellowship Research Program of the Institute for Computational Engineering and Sciences (ICES) at UT-Austin.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1504176.1504196">doi:10.1145/1504176.1504196</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/ppopp/Quintana-OrtiIQG09.html">dblp:conf/ppopp/Quintana-OrtiIQG09</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hk2uw4hnkve7lfv5yf7k733rxi">fatcat:hk2uw4hnkve7lfv5yf7k733rxi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170830015048/http://www.cs.rice.edu/~vs3/PDF/ppopp.09/p121-quintana-orti.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8d/4e/8d4e0bf09f926590364c39590e2ee6bd31ab23ba.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1504176.1504196"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Solving dense linear systems on platforms with multiple hardware accelerators

Gregorio Quintana-Ortí, Francisco D. Igual, Enrique S. Quintana-Ortí, Robert A. van de Geijn
<span title="2009-02-14">2009</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/xu5bk2lj5rbdxlx6222nw7tsxi" style="color: black;">SIGPLAN notices</a> </i> &nbsp;
hardware accelerators (GPUs, Cell B.E., etc.), each with its own local memory, resulting in a platform more reminiscent of a heterogeneous distributed-memory system.  ...  In this paper we provide further evidence that this approach solves the programmability problem for this domain by targeting a more complex architecture, composed of a multicore processor and multiple  ...  Additional support came from the J. Tinsley Oden Faculty Fellowship Research Program of the Institute for Computational Engineering and Sciences (ICES) at UT-Austin.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1594835.1504196">doi:10.1145/1594835.1504196</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6xan7rjzkffetczdtrau3yg5w4">fatcat:6xan7rjzkffetczdtrau3yg5w4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170830015048/http://www.cs.rice.edu/~vs3/PDF/ppopp.09/p121-quintana-orti.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8d/4e/8d4e0bf09f926590364c39590e2ee6bd31ab23ba.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1594835.1504196"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

CellSs: Scheduling Techniques to Better Exploit Memory Hierarchy

Pieter Bellens, Josep M. Perez, Felipe Cabarcas, Alex Ramirez, Rosa M. Badia, Jesus Labarta
<span title="">2009</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fw4azkpu65d2thmrwfkoawyxse" style="color: black;">Scientific Programming</a> </i> &nbsp;
The CellSs scheduler takes an extension of the memory hierarchy for Cell/B.E. into account, with a cache memory shared between the SPEs.  ...  Cell Superscalar's (CellSs) main goal is to provide a simple, flexible and easy programming approach for the Cell Broadband Engine (Cell/B.E.) that automatically exploits the inherent concurrency of the  ...  Bellens et al. / CellSs: Scheduling techniques to better exploit memory hierarchy High Level Scholarships for Latin America, scholarship No. E05D058240CO.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/561672">doi:10.1155/2009/561672</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/eswzt3dopba6pddrszmjjbhb4e">fatcat:eswzt3dopba6pddrszmjjbhb4e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190227214847/http://pdfs.semanticscholar.org/96d7/b36d07a9ef596d8c02d5983f4eefa00cca37.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/96/d7/96d7b36d07a9ef596d8c02d5983f4eefa00cca37.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2009/561672"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

OPELL and PM: A Case Study on Porting Shared Memory Programming Models to Accelerators Architectures [chapter]

Joseph B. Manzano, Ge Gan, Juergen Ributzka, Sunil Shrestha, Guang R. Gao
<span title="">2013</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
Even though the Cell processor is very well known for its accomplishments, it is also well known for its low programmability.  ...  An example of this architectural design is the Cell processor which exhibits both a heavy core and a group of simple cores designed as a computational engine.  ...  The ALF and DaCS frameworks are designed to facilitate the creation of tasks and data communication respectively for the Cell B.E.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-36036-7_8">doi:10.1007/978-3-642-36036-7_8</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zfab4wlweveslmvfr4p4lxfo6u">fatcat:zfab4wlweveslmvfr4p4lxfo6u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20120130200058/http://www.capsl.udel.edu/pub/doc/papers/LCPC2011-Manzano.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/46/28/462865b946f2348b3aaee9b85f253e83649d39bf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-36036-7_8"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

An overview of selected hybrid and reconfigurable architectures

S. Stojanovic, D. Bojic, M. Bojovic, M. Valero, V. Milutinovic
<span title="">2012</span> <i title="IEEE"> 2012 IEEE International Conference on Industrial Technology </i> &nbsp;
Engine Architecture (CBEA), the ClearSpeed processor, the field programmable gate array (FPGA) accelerator solutions from Maxeler MaxNodes (MAX), the SGI systems (RASC), and the Convey Hybrid-Core Computer  ...  We present a review of the hardware, available software tools for each solution, a quantitative and a qualitative comparison of the architectures, and we give our view on the future of heterogeneous computing  ...  To compile code for the Cell/B.E., one can use either the IBM xlc or the GNU gcc (ppu-gcc and spu-gcc) compilers.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icit.2012.6209978">doi:10.1109/icit.2012.6209978</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dfhopdozpzgkdfc5ewbrphdroy">fatcat:dfhopdozpzgkdfc5ewbrphdroy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170922011408/http://home.etf.rs/%7Evm/os/vlsi/predavanja/Survey-ADV_v11.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f8/2d/f82db68a6792698395521dda9dbb69f92f27ef8a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icit.2012.6209978"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Parallel Programming Models for Heterogeneous Many-Cores : A Survey [article]

Jianbin Fang, Chun Huang, Tao Tang, Zheng Wang
<span title="2020-05-05">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In this article, we provide a comprehensive survey for parallel programming models for heterogeneous many-core architectures and review the compiling techniques of improving programmability and portability  ...  While heterogeneous many-core design offers the potential for energy-efficient high-performance, such potential can only be unlocked if the application programs are suitably parallel and can be made to  ...  As we have mentioned in Section 2.1.2, programming the Cell/B.E. processor is challenging.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2005.04094v1">arXiv:2005.04094v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/e2psrdnyajh3hih3znnjjbezae">fatcat:e2psrdnyajh3hih3znnjjbezae</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200512011406/https://arxiv.org/pdf/2005.04094v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/72/5a/725a7cc6f95e56c8339e0378de7835617ec2bbf0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2005.04094v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Parallel programming models for heterogeneous many-cores: a comprehensive survey

Jianbin Fang, Chun Huang, Tao Tang, Zheng Wang
<span title="2020-07-31">2020</span> <i title="Springer Science and Business Media LLC"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/oyzrfqv3i5ghbipholgpvqu37y" style="color: black;">CCF Transactions on High Performance Computing</a> </i> &nbsp;
In this article, we provide a comprehensive survey for parallel programming models for heterogeneous many-core architectures and review the compiling techniques of improving programmability and portability  ...  While heterogeneous many-core design offers the potential for energy-efficient high-performance, such potential can only be unlocked if the application programs are suitably parallel and can be made to  ...  As we have mentioned in Sect. 2.1.2, programming the Cell/B.E. processor is challenging.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s42514-020-00039-4">doi:10.1007/s42514-020-00039-4</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/nn56xhjm6rcu7kya6gfnyjg66q">fatcat:nn56xhjm6rcu7kya6gfnyjg66q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201108112543/https://link.springer.com/content/pdf/10.1007/s42514-020-00039-4.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/2e/91/2e91e7f4fedaf6916462afd8f5c8c3e466319a54.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s42514-020-00039-4"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

High-Performance Reverse Time Migration on GPU

J Cabezas, M Araya-Polo, I Gelado, N Navarro, E Morancho, J M Cela
<span title="">2009</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/23mvuta6mrfapnkuowvcwult5u" style="color: black;">2009 International Conference of the Chilean Computer Science Society</a> </i> &nbsp;
Due to GPU characteristics, the parallelism paradigm shifts from the classical threads plus SIMD to Single Program Multiple Data (SPMD).  ...  This seismic imaging (Geophysics) algorithm is widely used in the oil industry. GPUs are natural contenders in the aftermath of the clock race, in particular for High-performance Computing (HPC).  ...  ACKNOWLEDGMENT The authors thank the Barcelona Supercomputing Center for their permission to publish the material reported in this article.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sccc.2009.19">doi:10.1109/sccc.2009.19</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sccc/CabezasAGNMC09.html">dblp:conf/sccc/CabezasAGNMC09</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/wkj22rbi2rdyhmc4mjirrfwolm">fatcat:wkj22rbi2rdyhmc4mjirrfwolm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170816095429/http://upcommons.upc.edu/bitstream/handle/2117/7930/jcc2009.pdf?sequence=1" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/14/e8/14e84416a7f5b65f49cc134fddc7d4a3b19f57a9.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sccc.2009.19"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Unleashing the high-performance and low-power of multi-core DSPs for general-purpose HPC

Francisco D. Igual, Murtaza Ali, Arnon Friedmann, Eric Stotzer, Timothy Wentz, Robert A. van de Geijn
<span title="">2012</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zigbcra6rjdivda6lkzknwuo5q" style="color: black;">2012 International Conference for High Performance Computing, Networking, Storage and Analysis</a> </i> &nbsp;
The potential for HPC is clear: It promises 128 GFLOPS (single precision) for 10 Watts; It is used in millions of network related devices and hence benefits from economies of scale; It should be simpler  ...  Take a multicore Digital Signal Processor (DSP) chip designed for cellular base stations and radio network controllers, add floating-point capabilities to support 4G networks, and out of thin air a HPC  ...  Source: [13] (Intel multi-core) or specific purpose (Nvidia GPUs, Cell B.E. or FPGAs) in terms of GFLOPS/Watt.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sc.2012.109">doi:10.1109/sc.2012.109</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sc/IgualAFSWG12.html">dblp:conf/sc/IgualAFSWG12</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/gnkl5kwf2bad7axiszdfzeu2hq">fatcat:gnkl5kwf2bad7axiszdfzeu2hq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170814085839/http://www.cs.utexas.edu/users/flame/pubs/SC12.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/2f/9e/2f9e29b4a129161f4a8fda3aaf07fbfe01e0a28f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sc.2012.109"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 21 results