Filters








52,451 Hits in 7.0 sec

Optimal broadcast on parallel locality models

Ben Juurlink, Petr Kolman, Friedhelm Meyer auf der Heide, Ingo Rieping
<span title="">2003</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/usw2n4yaarcurchx7i4on2iqea" style="color: black;">Journal of Discrete Algorithms</a> </i> &nbsp;
In this paper matching upper and lower bounds for broadcast on general purpose parallel computation models that exploit network locality are proven.  ...  Both upper and lower bounds apply for other parallel locality models like Y-PRAM, D-BSP and E-BSP, too.  ...  For the following problems upper bounds have been already given on parallel locality models: broadcast and prefix operations (the optimal algorithms in the present paper), FFT graph, matrix multiplication  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/s1570-8667(03)00023-6">doi:10.1016/s1570-8667(03)00023-6</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ruakgex4evgopdiy2tg4dy5kxi">fatcat:ruakgex4evgopdiy2tg4dy5kxi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190418180642/https://core.ac.uk/download/pdf/81956423.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ec/62/ec62948353a01dbd2aac835ccc65772bd15931bb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/s1570-8667(03)00023-6"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> elsevier.com </button> </a>

Broadcast with mask on a massively parallel processing on a chip

Hana Krichene, Mouna Baklouti, Mohamed Abid, Philippe Marquet, Jean Luc Dekeyser
<span title="">2012</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/hfatt7tua5aatozu2id5rdeuti" style="color: black;">2012 International Conference on High Performance Computing &amp; Simulation (HPCS)</a> </i> &nbsp;
This paper describes the design of a communication model called broadcast with mask.  ...  The delay of instructions broadcast has a significant impact on the performance of Single Instruction Multiple Data (SIMD) architecture.  ...  Fig. 1 . 1 Broadcast with mask model in massively parallel processing on chip architecture Fig. 2 . 2 Red-Black broadcast Fig. 3 . 3 Influence of broadcast models on bandwidth TABLE I I MASK & BROADCAST  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/hpcsim.2012.6266924">doi:10.1109/hpcsim.2012.6266924</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/ieeehpcs/KricheneBAMD12.html">dblp:conf/ieeehpcs/KricheneBAMD12</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pwzwzrwzcfa5rfei3i67eizdpm">fatcat:pwzwzrwzcfa5rfei3i67eizdpm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170923184725/https://hal.inria.fr/hal-00688418/document" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e9/6e/e96e2ca41b29c2694c0357986cf1b308aef48cd6.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/hpcsim.2012.6266924"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Communication optimizations for parallel computing using data access information

Martin C. Rinard
<span title="">1995</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zigbcra6rjdivda6lkzknwuo5q" style="color: black;">Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing &#39;95</a> </i> &nbsp;
Broadcasting widely accessed data has a significant performance impact on one application; other optimizations such as concurrently fetching remote data and overlapping computation with communication have  ...  Given the large communication overheads characteristic of modern parallel machines, optimizations that eliminate, hide or parallelize communication may improve the performance of parallel computations.  ...  Locality optimizations had a significant impact on two of the four applications, and the adaptive broadcast optimization had an effect on one of the applications.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/224170.224413">doi:10.1145/224170.224413</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sc/Rinard95.html">dblp:conf/sc/Rinard95</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/j4aqqk5xjnecbd5o6ylycg4eze">fatcat:j4aqqk5xjnecbd5o6ylycg4eze</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170813183326/http://people.csail.mit.edu/rinard/paper/sc95.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/09/65/0965323d49ad271691364df6d0f41e4d73b524ca.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/224170.224413"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Linear Algebra Computation Benchmarks on a Model Grid Platform [chapter]

Loriano Storchi, Carlo Manuali, Osvaldo Gervasi, Giuseppe Vitillaro, Antonio Laganà, Francesco Tarantelli
<span title="">2003</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
Local broadcast tree in a 8-node cluster. to be optimal on a high bandwidth network where each node is connected inde-302 L. Storchi et al.  ...  Communication benchmarks and computational tests based on parallel linear algebra routines widely used in computational chemistry applications have been carried out on a model Grid infrastructure composed  ...  After the inter-cluster broadcast, three parallel local broadcasts, one on each cluster.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-44862-4_32">doi:10.1007/3-540-44862-4_32</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ji65qwzgpvchrabug3n26l4nbq">fatcat:ji65qwzgpvchrabug3n26l4nbq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170810092720/http://www.thch.unipg.it/~franc/Reprints/2003_lncs_2658_297-306.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/7e/1a/7e1a405abfc529fc243f5811a8cb867f0bb2fbc5.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-44862-4_32"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Optimal broadcast and summation in the LogP model

Richard M. Karp, Abhijit Sahay, Eunice E. Santos, Klaus Erik Schauser
<span title="">1993</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tewj77cuufbzbgbk265bb462ga" style="color: black;">Proceedings of the fifth annual ACM symposium on Parallel algorithms and architectures - SPAA &#39;93</a> </i> &nbsp;
We also devise an (absolutely) optimal algorithm for summing a list of elements (using a non-commutative operation) using one of the optimal broadcast algorithms.  ...  We consider several natural broadcasting problems for the LogP model of distributed memory machines recently proposed by Culler et al.  ...  Some problems that we are currently engaged in studying are optimal broadcasting of multiple messages and optimal parallel prefix computation.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/165231.165250">doi:10.1145/165231.165250</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/spaa/KarpSSS93.html">dblp:conf/spaa/KarpSSS93</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/rhlsdjzsmfe7hklkfrgv7s6x2q">fatcat:rhlsdjzsmfe7hklkfrgv7s6x2q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20060918233517/http://www.eecs.berkeley.edu/Pubs/TechRpts/1992/CSD-92-721.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/99/c8/99c813907945a8d142289e1d69342f0bba8dc559.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/165231.165250"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Subject Index

<span title="">2003</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/usw2n4yaarcurchx7i4on2iqea" style="color: black;">Journal of Discrete Algorithms</a> </i> &nbsp;
Regular expression searching on compressed text, 423 Broadcast Optimal broadcast on parallel locality models, 151 BSP model Optimal broadcast on parallel locality models, 151 Byzantine failures  ...  independent permutations, 11 Parallel computing Optimal broadcast on parallel locality models, 151 Parameterized complexity An efficient fixed-parameter algorithm for 3-Hit- ting Set, 89 Planar  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/s1570-8667(03)00075-3">doi:10.1016/s1570-8667(03)00075-3</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/icg7if3uingibmjwy2mjud4rwe">fatcat:icg7if3uingibmjwy2mjud4rwe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190311153435/https://core.ac.uk/download/pdf/82483795.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3b/7f/3b7fe42ca1330b396c00c8956e750c1a8971d791.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/s1570-8667(03)00075-3"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> elsevier.com </button> </a>

A Multilevel Approach to Topology-Aware Collective Operations in Computational Grids [article]

N. T. Karonis, B. de Supinski, I. Foster, W. Gropp, E. Lusk
<span title="2002-06-24">2002</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Initial efforts produced "optimal" trees based on network communication models that assumed equal point-to-point latencies between any two processes.  ...  In response, more recent work has focused on creating topology-aware trees for collective operations that minimize communication across slower channels (e.g., a wide-area network).  ...  Under models that expand the telephone model to account for message latency, such as the postal [1] or LogP [4] models, the communication topology of an optimal broadcast algorithm becomes a generalized  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/cs/0206038v1">arXiv:cs/0206038v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/riqgiqeebvby3mibbixegfbwee">fatcat:riqgiqeebvby3mibbixegfbwee</a> </span>
<a target="_blank" rel="noopener" href="https://archive.org/download/arxiv-cs0206038/cs0206038.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> File Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/81/46/8146a28245f583f64e1f7b72810dec4e88d8e41c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/cs/0206038v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Parallel application experience with replicated method invocation

Jason Maassen, Thilo Kielmann, Henri E. Bal
<span title="">2001</span> <i title="Wiley"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/o454xj4tdjfllm3oragfkdrffi" style="color: black;">Concurrency and Computation</a> </i> &nbsp;
Our programming model allows the programmer to define groups of objects that can be replicated and updated as a whole, using reliable, totallyordered broadcast to send update methods to all machines containing  ...  shared objects without taking locality into account.  ...  We thank Grégory Mounié for his comments on the paper. We thank Kees Verstoep and John Romein for keeping the DAS in good shape.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/cpe.581">doi:10.1002/cpe.581</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hzv7yrvv5rgxpkw4rm36a5wlbm">fatcat:hzv7yrvv5rgxpkw4rm36a5wlbm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20061012013200/http://www.cs.vu.nl:80/~rob/papers/cpe01-repmi.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/52/de/52def3a6a2bd317ad96f058d7696c21ca61d9c93.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/cpe.581"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> wiley.com </button> </a>

Parallel application experience with replicated method invocation

Jason Maassen, Thilo Kielmann, Henri E. Bal
<span title="">2001</span> <i title="Wiley"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/o454xj4tdjfllm3oragfkdrffi" style="color: black;">Concurrency and Computation</a> </i> &nbsp;
Our programming model allows the programmer to define groups of objects that can be replicated and updated as a whole, using reliable, totallyordered broadcast to send update methods to all machines containing  ...  shared objects without taking locality into account.  ...  We thank Grégory Mounié for his comments on the paper. We thank Kees Verstoep and John Romein for keeping the DAS in good shape.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/cpe.581.abs">doi:10.1002/cpe.581.abs</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/prhbsg7ibraepktbw4soux6i7q">fatcat:prhbsg7ibraepktbw4soux6i7q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20061012013200/http://www.cs.vu.nl:80/~rob/papers/cpe01-repmi.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/52/de/52def3a6a2bd317ad96f058d7696c21ca61d9c93.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1002/cpe.581.abs"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> wiley.com </button> </a>

Exponential Moving Average Model in Parallel Speech Recognition Training [article]

Xu Tian, Jun Zhang, Zejun Ma, Yi He, Juan Wei
<span title="2017-03-03">2017</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
It is a non-interference strategy that the exponential moving average model is not broadcasted to distributed workers to update their local models after model synchronization in the training process, and  ...  moving average method in large-scale parallel training of neural network model.  ...  Each GPU optimizes local model in parallel with one split of training dataset.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1703.01024v1">arXiv:1703.01024v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/pasewukyxrgrhbi4qbtgofurfi">fatcat:pasewukyxrgrhbi4qbtgofurfi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200930144829/https://arxiv.org/pdf/1703.01024v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/62/d5/62d588d2200ab2cbee153a8998a7fa98c30cf7bb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1703.01024v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Optimizing the DFCN Broadcast Protocol with a Parallel Cooperative Strategy of Multi-Objective Evolutionary Algorithms [chapter]

Carlos Segura, Alejandro Cervantes, Antonio J. Nebro, María Dolores Jaraíz-Simón, Eduardo Segredo, Sandra García, Francisco Luna, Juan Antonio Gómez-Pulido, Gara Miranda, Cristóbal Luque, Enrique Alba, Miguel Ángel Vega-Rodríguez (+2 others)
<span title="">2009</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
The optimization lies on searching the best configurations of the dfcn broadcast protocol for a given manet scenario.  ...  This work presents the application of a parallel cooperative optimization approach to the broadcast operation in mobile ad-hoc networks (manets).  ...  The model is based on the hybridization of parallel islandbased evolutionary algorithms and hyperheuristics.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-01020-0_26">doi:10.1007/978-3-642-01020-0_26</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ijmtk4ytanbvparxad2ez3qhxq">fatcat:ijmtk4ytanbvparxad2ez3qhxq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20171117172758/https://core.ac.uk/download/pdf/29403737.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a5/04/a504f476bb2146ac5390824bb5b92d31c56fc6b0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/978-3-642-01020-0_26"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Massively parallel data analysis with PACTs on Nephele

Alexander Alexandrov, Max Heimel, Volker Markl, Dominic Battré, Fabian Hueske, Erik Nijkamp, Stephan Ewen, Odej Kao, Daniel Warneke
<span title="2010-09-01">2010</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
The PACT Programming Model The PACT programming model is a generalization of map/reduce [4] . It is based on a key/value data model and the concept of Parallelization Contracts (PACTs).  ...  Optimizing a PACT program Similar as in query optimization for relational DBMS, the optimal execution strategy for a PACT program cannot be found by combining the locally optimal choices for all PACTs.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1920841.1921056">doi:10.14778/1920841.1921056</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qkb3dlwwmrdbnggys7pvykbk4y">fatcat:qkb3dlwwmrdbnggys7pvykbk4y</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20151031191313/http://www.vldb.org/pvldb/vldb2010/pvldb_vol3/D28.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/68/32/6832204b5c72ca807e3aea8fa86f5bff1fe8795a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1920841.1921056"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Performance Modeling and Optimization of a High Energy Colliding Beam Simulation Code

Hongzhang Shan, Erich Strohmaier, Ji Qiang, David Bailey, Kathy Yelick
<span title="">2006</span> <i title="IEEE"> ACM/IEEE SC 2006 Conference (SC&#39;06) </i> &nbsp;
On torus based systems errors of 29% are higher but optimized performance can again be predicted within 8% in some cases.  ...  BeamBeam3D was the first parallel code that can be used to study this interaction fully self-consistently on highperformance computing platforms.  ...  Performance Optimization Strategies The dominant communication phases of BeamBeam3D are the parallel grid reduction and the parallel grid broadcast.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sc.2006.48">doi:10.1109/sc.2006.48</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/wo2nidbxrbgx3laklmjozkxeii">fatcat:wo2nidbxrbgx3laklmjozkxeii</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170829194405/http://crd-legacy.lbl.gov/~dhbailey/dhbpapers/BeamBeam3D.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/db/0c/db0cfda0c4733a4360d05bdef845bfa5f16bd8b4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/sc.2006.48"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Performance analysis and optimization of MPI collective operations on multi-core clusters

Bibo Tu, Jianping Fan, Jianfeng Zhan, Xiaofang Zhao
<span title="2009-04-22">2009</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/qhbautqnvzgwvm3vvdvieylhwq" style="color: black;">Journal of Supercomputing</a> </i> &nbsp;
Many parallel computation models are used to predict performance of collective operation on given parallel platform, and they are useful for the performance analysis of optimal collective operations.  ...  This paper proposes new parallel computation model to unitedly abstract memory hierarchy on multi-core clusters in vertical and horizontal levels.  ...  Parallel computation models Previous work on parallel computation models may be classified as hardwareparameterized models and software-parameterized models.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s11227-009-0296-3">doi:10.1007/s11227-009-0296-3</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6oxs7kwoenbjpnw6nynd2bc7x4">fatcat:6oxs7kwoenbjpnw6nynd2bc7x4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20150926002233/http://prof.ict.ac.cn/jfzhan/papers/Tu_jsc_09.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c6/ea/c6ea00eb2208f9fdc177ecc61332365c19ce54ad.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s11227-009-0296-3"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Optimizing UPC Programs for Multi-Core Systems

Yili Zheng
<span title="">2010</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/fw4azkpu65d2thmrwfkoawyxse" style="color: black;">Scientific Programming</a> </i> &nbsp;
The Partitioned Global Address Space (PGAS) model of Unified Parallel C (UPC) can help users express and manage application data locality on non-uniform memory access (NUMA) multi-core shared-memory systems  ...  Second, we use two numerical computing kernels, parallel matrix–matrix multiplication and parallel 3-D FFT, to demonstrate the end-to-end development and optimization for UPC applications.  ...  . • Use optimized BLAS library for local dgemm. • Overlap non-blocking one-sided communication with computation. • Use team collective communication for row broadcast and column broadcast.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/646829">doi:10.1155/2010/646829</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/q63ngpj47jblhfzbfcdehsmuyi">fatcat:q63ngpj47jblhfzbfcdehsmuyi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200214052143/http://downloads.hindawi.com/journals/sp/2010/646829.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3b/27/3b2797736aed9eafb9f92fedc0cc6a2b87e86c0e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/646829"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 52,451 results