Filters








597 Hits in 5.5 sec

Performance evaluation of R with Intel Xeon Phi coprocessor

Yaakoub El-Khamra, Niall Gaffney, David Walling, Eric Wernert, Weijia Xu, Hui Zhang
2013 2013 IEEE International Conference on Big Data  
In this paper, we evaluated approaches to speed up R computations with the utilization of the Intel Math Kernel Library and automatic offloading to Intel Xeon Phi SE10P Co-processor.  ...  Offloading to Phi co-processor further improves the performance.  ...  In the case of the Intel Xeon Phi SE10P Co-processor used in this paper, each coprocessor chip has a peak performance of roughly 1070 GFLOPS, approximately six times the peak performance of a single Xeon  ... 
doi:10.1109/bigdata.2013.6691695 dblp:conf/bigdataconf/KhamraGWWXZ13 fatcat:vansjz4x5bddfnkae46xh4n7py

Bit-Parallel Approximate Pattern Matching on the Xeon Phi Coprocessor

Tuan Tu Tran, Simon Schindel, Yongchao Liu, Bertil Schmidt
2014 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing  
-Data-Parallelism on the many-core coprocessorPerformance Evaluation • Conclusions and Perspectives Xeon Phi Architecture• A coprocessor running Linux • Connected to a Host CPU via PCIe • Run in either  ...  Phi, intrinsic data and functions • wmAutoVec: Xeon Phi, array of 16 uints, automatical vectorization • wmHost: multicore CPU, array of 16 uints • Introduction • • Performance Evaluation • Conclusions  ... 
doi:10.1109/sbac-pad.2014.37 dblp:conf/sbac-pad/TranSLS14 fatcat:cazay7utjffuhoq4vgl6nw4tmy

Accelerating DNA Sequence Analysis Using Intel(R) Xeon Phi(TM)

Suejb Memeti, Sabri Pllana
2015 2015 IEEE Trustcom/BigDataSE/ISPA  
While considerable research has addressed DNA analysis using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor.  ...  The experimental results on Intel Xeon Phi show speed-ups of up to 10× compared to a sequential implementation running on an Intel Xeon processor E5.  ...  We used the Intel Vtune Amplifier 2015 for performance data collection. To evaluate our algorithm the experiments were performed on an Intel Xeon Phi 7120P coprocessor.  ... 
doi:10.1109/trustcom.2015.636 dblp:conf/trustcom/MemetiP15 fatcat:g6j3djqts5f2ret6x7y3xgikea

Accelerating DNA Sequence Analysis using Intel Xeon Phi [article]

Suejb Memeti, Sabri Pllana
2015 arXiv   pre-print
While considerable research has addressed DNA analysis using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor.  ...  The experimental results on Intel Xeon Phi show speed-ups of up to 10x compared to a sequential implementation running on an Intel Xeon processor E5.  ...  We used the Intel Vtune Amplifier 2015 for performance data collection. To evaluate our algorithm the experiments were performed on an Intel Xeon Phi 7120P coprocessor.  ... 
arXiv:1506.08612v1 fatcat:nty74rv63bhn7pkilptqj6sava

The Potential of the Intel (R) Xeon Phi for Supervised Deep Learning

Andre Viebke, Sabri Pllana
2015 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems  
Our approach is evaluated on the Intel Xeon Phi 7120P using the MNIST dataset of handwritten digits for various thread counts and CNN architectures.  ...  While numerous research groups have addressed the training of CNNs using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor.  ...  Evaluation performed on an Intel Xeon Phi 5110P resulted with a speed up of 7 to 10 times compared to an Intel Xeon E5620.  ... 
doi:10.1109/hpcc-css-icess.2015.45 dblp:conf/hpcc/ViebkeP15 fatcat:mdcts776ofhh3jytkpxgxx3mfq

Porting Feastflow To The Intel Xeon Phi: Lessons Learned

Georgios Goumas
2014 Zenodo  
In this paper we report our experiences in porting the FEASTFLOW software infrastructure to the Intel Xeon Phi coprocessor.  ...  Our experimental results on these building blocks indicate the Xeon Phi can serve as a promising accelerator for our software infrastructure.  ...  In our second set of experiments we evaluated the impact of parallelization and vectorization on the Intel Xeon Phi coprocessor and compare the results with the best performing version on Sandy Bridge.  ... 
doi:10.5281/zenodo.822670 fatcat:qmcxfe6z2fhsnltprheuewrwwa

Programming Xeon Phi [chapter]

Rezaur Rahman
2013 Intel® Xeon Phi™ Coprocessor Architecture and Tools  
The objective of this book is to introduce you to Intel Xeon Phi architecture in as much as it affects software performance through programming.  ...  Intel Xeon Phi Execution Models Intel Xeon Phi cores are Pentium cores and work as coprocessors to the host processor.  ... 
doi:10.1007/978-1-4302-5927-5_2 fatcat:c33wyuvworacnnj2hkpy7s66ae

The Potential of the Intel Xeon Phi for Supervised Deep Learning [article]

Andre Viebke, Sabri Pllana
2015 arXiv   pre-print
Our approach is evaluated on the Intel Xeon Phi 7120P using the MNIST dataset of handwritten digits for various thread counts and CNN architectures.  ...  While numerous research groups have addressed the training of CNNs using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor.  ...  Evaluation performed on an Intel Xeon Phi 5110P resulted with a speed up of 7 to 10 times compared to an Intel Xeon E5620.  ... 
arXiv:1506.09067v1 fatcat:txivrpxupjh6lb2x3fiannfaja

Using the Xeon Phi Platform to Run Speculatively-Parallelized Codes

Alvaro Estebanez, Diego R. Llanos, Arturo Gonzalez-Escribano
2016 International journal of parallel programming  
In this article we evaluate the performance delivered by an Intel Xeon Phi coprocessor when using a software, state-of-the-art thread-level speculative parallelization library in the execution of well-known  ...  Intel Xeon Phi accelerators are one of the newest devices used in the field of parallel computing.  ...  One of the most recent approaches is the Intel R Xeon Phi TM [3, 11, 20] , a coprocessor with more than 60 cores able to execute both offloaded and native codes.  ... 
doi:10.1007/s10766-016-0421-x fatcat:ja6kzfpk3fcnzmdd2cz3caws2m

Optimizing legacy molecular dynamics software with directive-based offload

W. Michael Brown, Jan-Michael Y. Carrillo, Nitin Gavhane, Foram M. Thakkar, Steven J. Plimpton
2015 Computer Physics Communications  
We provide results for LAMMPS benchmarks and for production molecular dynamics simulations using the Stampede hybrid supercomputer with both Intel R Xeon Phi TM coprocessors and Nvidia GPUs.  ...  The optimizations are available as part of the "Intel package" supplied with LAMMPS. (W. Michael Brown), carrillojy@ornl.gov (Jan-Michael Y.  ...  , Intel R Xeon Phi TM performance was better.  ... 
doi:10.1016/j.cpc.2015.05.004 fatcat:xbyof4cbcngi7cv5damtslie2a

Parallel Mutual Information Based Construction of Whole-Genome Networks on the Intel (R) Xeon Phi (TM) Coprocessor

Sanchit Misra, Kiran Pamnany, Srinivas Aluru
2014 2014 IEEE 28th International Parallel and Distributed Processing Symposium  
In this paper, we present a solution on the Intel R Xeon Phi TM coprocessor, taking advantage of its multi-level parallelism including many x86-based cores, multiple threads per core, and vector processing  ...  We also present a solution on the Intel R Xeon R processor.  ...  ACKNOWLEDGMENTS The work of Srinivas Aluru is supported in part by the U.S. National Science Foundation under IOS-1257631 and a Swarnajayanti Fellowship from the Government of India.  ... 
doi:10.1109/ipdps.2014.35 dblp:conf/ipps/MisraPA14 fatcat:zbkz7kentrbarpefrzjf7ruuba

First evaluation of the CPU, GPGPU and MIC architectures for real time particle tracking based on Hough transform at the LHC

V Halyo V Halyo, P LeGresley, P Lujan, V Karpusenko, A Vladimirov
2014 Journal of Instrumentation  
In this article, a new tracking algorithm based on the Hough transform will be evaluated for the first time on a multi-core Intel Xeon E5-2697v2 CPU, an NVIDIA Tesla K20c GPU, and an Intel 7120 coprocessor  ...  At the same time, it is crucial to explore the performance limits achievable on the latest generation multicore CPUs with the use of the best software optimization methods.  ...  Because of the similarities of Intel i7 and Intel Xeon CPUs with Xeon Phi coprocessors, we developed a single code for all three platforms.  ... 
doi:10.1088/1748-0221/9/04/p04005 fatcat:hu7sgx4efnblfcchvlbo7irqxa

Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor [article]

Jiri Dokulil, Enes Bajrovic, Siegfried Benkner, Sabri Pllana, Martin Sandrieser, Beverly Bachmayer
2012 arXiv   pre-print
The introduction of Intel(R) Xeon Phi(TM) coprocessors opened up new possibilities in development of highly parallel applications.  ...  on Xeon Phi.  ...  INTRODUCTION The Intel R Xeon Phi TM coprocessor is a new contender in the HPC market.  ... 
arXiv:1211.5530v1 fatcat:f3vyuiywozbonjsnfg2i3ebexq

Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor

Lei Jin, Zhaokang Wang, Rong Gu, Chunfeng Yuan, Yihua Huang
2014 2014 IEEE International Parallel & Distributed Processing Symposium Workshops  
Also, we ran the fully-optimized code on both the Intel Xeon Phi coprocessor and an expensive Intel Xeon CPU.  ...  Our method on the Intel Xeon Phi coprocessor is 7 to 10 times faster than the Intel Xeon CPU for this application.  ...  ACKNOWLEDGMENT This work is funded in part by China NSF Grants (No. 61223003), and the USA Intel Labs University Resea rch Program.  ... 
doi:10.1109/ipdpsw.2014.194 dblp:conf/ipps/JinWGYH14 fatcat:tpmcupt4indklfoj3dtwm65rze

MrPhi: An Optimized MapReduce Framework on Intel Xeon Phi Coprocessors

Mian Lu, Yun Liang, Huynh Phung Huynh, Zhongliang Ong, Bingsheng He, Rick Siow Mong Goh
2015 IEEE Transactions on Parallel and Distributed Systems  
Additionally, the performance of four applications is able to achieve linear scalability on a platform equipped with up to four Xeon Phi coprocessors.  ...  We first focus on employing advanced features of the Xeon Phi to achieve high performance on a single coprocessor.  ...  This work was partially supported by the National Natural Science Foundation of China (No. 61300005). Bingsheng He is partly supported by a MoE AcRF Tier 2 grant (MOE2012-T2-2-067) in Singapore.  ... 
doi:10.1109/tpds.2014.2365784 fatcat:o2tktcwqxrdk3e4qkjg7zgs7vu
« Previous Showing results 1 — 15 out of 597 results