Filters








444 Hits in 6.2 sec

Transparent Runtime Migration of Loop-Based Traces of Processor Instructions to Reconfigurable Processing Units

João Bispo, Nuno Paulino, João M. P. Cardoso, João Canas Ferreira
2013 International Journal of Reconfigurable Computing  
The ability to map instructions running in a microprocessor to a reconfigurable processing unit (RPU), acting as a coprocessor, enables the runtime acceleration of applications and ensures code and possibly  ...  In this work, we focus on the mapping of loop-based instruction traces (called Megablocks) to RPUs.  ...  mapped to DIM Detect repeating patterns of instructions in the execution trace and migrate those loops to an RPU Coupling Loose RPU/GPP coupling, shared instruction and data memory Tight RPU coupling  ... 
doi:10.1155/2013/340316 fatcat:nyypx6imyvcinksmma7rvv7ksa

Selected Papers from the 2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)

René Cumplido, Peter Athanas, Jürgen Becker
2013 International Journal of Reconfigurable Computing  
We would like to thank all the reviewers for their valuable time and effort in the review process, and to provide constructive feedbacks to authors.  ...  We hope that you will find in this special issue a valuable source of information to your future research.  ...  Finally, one paper is within the area of reconfiguration techniques. In "Transparent runtime migration of loop-based traces of processor instructions to reconfigurable processing units," J.  ... 
doi:10.1155/2013/597323 fatcat:l3v7jtnc5fcsthqi7zluutdnqa

Architecture for Transparent Binary Acceleration of Loops with Memory Accesses [chapter]

Nuno Paulino, João Canas Ferreira, João M. P. Cardoso
2013 Lecture Notes in Computer Science  
This paper presents an extension to a hardware/software system architecture in which repetitive instruction traces, called Megablocks, are accelerated by a Reconfigurable Processing Unit (RPU).  ...  Switching between hardware and software execution is done transparently, without modifications to source code or executable binaries.  ...  Conclusion This paper presented a general-purpose computing architecture based on a General Purpose Processor (GPP) and a Reconfigurable Processing Unit (RPU) automatically generated offline from instruction  ... 
doi:10.1007/978-3-642-36812-7_12 fatcat:p4kifbrcmngerorcyu3jotyx7q

Low-Complexity Online Synthesis for AMIDAR Processors

Stefan Döbrich, Christian Hochberger
2010 International Journal of Reconfigurable Computing  
Thus, we believe that online synthesis that takes place during the execution of an application is one way to broaden the applicability of reconfigurable architectures.  ...  Reconfigurable logic has often been promoted as a solution to these problems. Today, it can be found in two varieties: field programmable gate arrays or coarse-grained reconfigurable arrays.  ...  Thus, the process of creating new functional units is transparent to the processor. Hence, a runtime prediction is not possible yet.  ... 
doi:10.1155/2010/953693 fatcat:peb7ybgujjcjjegwb7o2ualsau

Supporting runtime reconfigurable VLIWs cores through dynamic binary translation

Simon Rokicki, Erven Rohou, Steven Derrien
2018 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE)  
To preserve the single ISA programming model, we resort to Dynamic Binary Translation, and use this technique to enable dynamic code specialization for Runtime Reconfigurable VLIWs cores.  ...  Such architectures combine Out of Order cores with smaller in-order ones to offer different power/energy profiles.  ...  inter-block transformations are performed to merge blocks, build traces 3 and unroll loops.  ... 
doi:10.23919/date.2018.8342160 dblp:conf/date/RokickiRD18 fatcat:g5hf657vx5b3xb736kmwnf4fi4

DeSyRe: On-demand system reliability

I. Sourdis, C. Strydis, A. Armato, C.S. Bouganis, B. Falsafi, G.N. Gaydadjiev, S. Isaza, A. Malek, R. Mariani, D. Pnevmatikatos, D.K. Pradhan, G. Rauwerda (+6 others)
2013 Microprocessors and microsystems  
In our attempt to reduce the overheads of fault-tolerance, only a small fraction of the chip is built to be fault-free.  ...  This fault-free part is then employed to manage the remaining fault-prone resources of the SoC.  ...  To one extent, the Middleware makes all its actions transparent to the Runtime system in order to provide well-functioning hardware; that is in order to ensure the reconfiguration process is performed  ... 
doi:10.1016/j.micpro.2013.08.008 fatcat:jg623r4hmngthk652u7qi6ibme

Trace-Based Reconfigurable Acceleration with Data Cache and External Memory Support

Nuno Miguel Cardanha Paulino, Joao Canas Ferreira, Joao Manuel Paiva Cardoso
2014 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications  
This paper presents a binary acceleration approach based on extending a General Purpose Processor (GPP) with a Reconfigurable Processing Unit (RPU), both sharing an external data memory.  ...  In this approach repeating sequences of GPP instructions are migrated to the RPU. The RPU resources are selected and organized off-line using execution trace information.  ...  Our previous work presented a binary acceleration approach in which the execution of frequently executed loops is transparently migrated at run-time to a Reconfigurable Processing Unit (RPU), a tailored  ... 
doi:10.1109/ispa.2014.29 dblp:conf/ispa/PaulinoFC14 fatcat:u3mcgsqss5cf5lzno7pzbustga

A Reconfigurable Architecture for Binary Acceleration of Loops with Memory Accesses

Nuno Paulino, João Canas Ferreira, João M. P. Cardoso
2014 ACM Transactions on Reconfigurable Technology and Systems  
A Reconfigurable Processing Unit (RPU) is used as a coprocessor of the General Purpose Processor (GPP) to accelerate the execution of repetitive instruction sequences called Megablocks.  ...  A toolchain detects Megablocks from instruction traces and generates customized RPU implementations.  ...  The WARP processor is a MicroBlaze-based system that is able to migrate the innermost loops to an FPGA-based reconfigurable logic fabric connected to the MicroBlaze [Lysecky and Vahid 2009].  ... 
doi:10.1145/2629468 fatcat:eogxo4p4yrb4tdehmgfxa7yvra

Reconfigurable Acceleration with Binary Compatibility for General Purpose Processors [chapter]

Antonio Carlos Schneider Beck, Luigi Carro
2009 IFIP International Federation for Information Processing  
Based on all these facts, this work proposes a new Binary Translation algorithm, implemented in hardware and working in parallel to the processor, responsible for transforming sequences of instructions  ...  Therefore, we can take advantage of using pure combinational logic to optimize even control-flow oriented code in a totally transparent process, without any modification in the source code or binary.  ...  of instructions at run time to be executed on a reconfigurable array, in a totally transparent process: there is no necessity of changing the code before its execution at all.  ... 
doi:10.1007/978-0-387-89558-1_15 fatcat:njmcai5ndfgwvkkn6lgflczp3y

Analyzing Behavior Specialized Acceleration

Tony Nowatzki, Karthikeyan Sankaralingam
2016 ACM SIGOPS Operating Systems Review  
For example, a 2-wide OOO processor with three BSAs matches the performance of a conventional 6-wide OOO core, has 40% lower area, and is 2.6× more energy efficient.  ...  To study the potential of BSAs, we propose a novel modeling technique called the Transformable Dependence Graph (TDG) -a higher level alternative to the time-consuming traditional compiler+simulator approach  ...  At run-time, depending on the affinity of the code, the execution may migrate to one of the BSAs, a process which is transparent to the programmer.  ... 
doi:10.1145/2954680.2872412 fatcat:66uy7l3ggbh6ze2mp33wtgmbtm

Message from the Organizers

2012 2012 Third Workshop on Applications for Multi-Core Architecture  
We suggest that schedulers designed with an understanding of the requirements of all process classes and their mixes, as well the abilities of the underlying architecture, might be the solution to this  ...  Specifically, we are interested in parallel process scheduling, which has been a topic of significant study in the supercomputing community, but so far little of this research has trickled down to the  ...  ACKNOWLEDGEMENTS We would like to thank the Xilinx University Program for its valuable help on partial reconfiguration and its hardware and software donations.  ... 
doi:10.1109/wamca.2012.4 fatcat:weftcii3gnfdrcgqqhop72muqq

Message from Organizers

2015 2015 Third International Symposium on Computing and Networking (CANDAR)  
We suggest that schedulers designed with an understanding of the requirements of all process classes and their mixes, as well the abilities of the underlying architecture, might be the solution to this  ...  Specifically, we are interested in parallel process scheduling, which has been a topic of significant study in the supercomputing community, but so far little of this research has trickled down to the  ...  ACKNOWLEDGEMENTS We would like to thank the Xilinx University Program for its valuable help on partial reconfiguration and its hardware and software donations.  ... 
doi:10.1109/candar.2015.4 fatcat:fcvejkemfjaczbcxgjrviru3pu

Message from the Organizers

2007 Fourth International Workshop on Model-Based Methodologies for Pervasive and Embedded Software (MOMPES'07)  
We suggest that schedulers designed with an understanding of the requirements of all process classes and their mixes, as well the abilities of the underlying architecture, might be the solution to this  ...  Specifically, we are interested in parallel process scheduling, which has been a topic of significant study in the supercomputing community, but so far little of this research has trickled down to the  ...  ACKNOWLEDGEMENTS We would like to thank the Xilinx University Program for its valuable help on partial reconfiguration and its hardware and software donations.  ... 
doi:10.1109/mompes.2007.11 fatcat:xxlvjyydibg53oonip25ubnz4q

Message from the Organizers

2008 2008 5th International Workshop on Model-based Methodologies for Pervasive and Embedded Software  
We suggest that schedulers designed with an understanding of the requirements of all process classes and their mixes, as well the abilities of the underlying architecture, might be the solution to this  ...  Specifically, we are interested in parallel process scheduling, which has been a topic of significant study in the supercomputing community, but so far little of this research has trickled down to the  ...  ACKNOWLEDGEMENTS We would like to thank the Xilinx University Program for its valuable help on partial reconfiguration and its hardware and software donations.  ... 
doi:10.1109/mompes.2008.4 fatcat:ctsxrgwiibd5fcgvenehisljs4

Message from the Organizers

2012 2012 Third International Conference on Networking and Computing  
We suggest that schedulers designed with an understanding of the requirements of all process classes and their mixes, as well the abilities of the underlying architecture, might be the solution to this  ...  Specifically, we are interested in parallel process scheduling, which has been a topic of significant study in the supercomputing community, but so far little of this research has trickled down to the  ...  ACKNOWLEDGEMENTS We would like to thank the Xilinx University Program for its valuable help on partial reconfiguration and its hardware and software donations.  ... 
doi:10.1109/icnc.2012.5 fatcat:mmkxzhoj2fbctfhufxjeepr6jm
« Previous Showing results 1 — 15 out of 444 results