Advertisement

© 2016

Instruction Level Parallelism

Book
  • 4.4k Downloads

Table of contents

  1. Front Matter
    Pages i-xxi
  2. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 1-7
  3. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 9-42
  4. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 43-77
  5. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 79-116
  6. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 117-132
  7. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 133-165
  8. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 167-203
  9. Alex Aiken, Utpal Banerjee, Arun Kejariwal, Alexandru Nicolau
    Pages 205-208
  10. Back Matter
    Pages 209-255

About this book

Introduction

Since its introduction decades ago, Instruction Level Parallelism (ILP) has gradually become ubiquitous and is now featured in virtually every processor built today, from general purpose CPUs to application-specific and embedded processors.  Because these architectures could not exist or (in the case of superscalar machines) cannot achieve their full potential without specific sophisticated compilation techniques to exploit ILP, the development of architectures that support ILP has proceeded hand-in-hand with the development of sophisticated compiler technology, such as Trace Scheduling and Software Pipelining. While essential for achieving the full potential of ILP, in both performance as well as power consumption management, these techniques are still not widely known, in part because of their intricacy and in part because the only widely available references for ILP techniques are the primary resources, with the brevity of introduction common to conference proceedings.

This book precisely formulates, and simplifies the presentation of Instruction Level Parallelism (ILP) compilation techniques. It uniquely offers consistent and uniform descriptions of the code transformations involved. Due to the ubiquitous nature of ILP in virtually every processor built today, from general purpose CPUs to application-specific and embedded processors, this book is useful to the student, the practitioner and also the researcher of advanced compilation techniques.  With an emphasis on fine-grain instruction level parallelism, this book will also prove interesting to researchers and students of parallelism at large, in as much as the techniques described yield insights that go beyond superscalar and VLIW (Very Long Instruction Word) machines compilation and are more widely applicable to optimizing compilers in general. ILP techniques have found wide and crucial application in Design Automation, where they have been used extensively in the optimization of performance as well as area and power minimization of computer designs.




Keywords

Parallelism Instruction-level parallelism VLIW Superscalar Loop parallelization Compilers Optimization Scheduling Trace scheduling Percolation scheduling Modulo scheduling Software pipelining Performance GPUs GPU architecture scheduling algorithms Parallel processing Parallel computing

Authors and affiliations

  1. 1.Department of Computer ScienceStanford UniversityStanfordUSA
  2. 2.Department of Computer ScienceUniversity of California at IrvineIrvineUSA
  3. 3.MZ Inc.Palo AltoUSA
  4. 4.Center for Embedded Computer SystemsUniversity of California at IrvineIrvineUSA

About the authors

Alex Aiken is the Alcatel-Lucent Professor and the current chair of the Computer Science Department at Stanford. His research interests include most areas of programming languages and compilers and particularly automated methods of analysis for both high performance and high reliability.


Utpal Banerjee has a PhD in mathematics from Carnegie-Mellon University and a PhD in computer science from the University of Illinois at Urbana-Champaign. He has taught at the University of Cincinnati, Arizona State University and the University of Illinois. Dr. Banerjee has served as a research staff member at Honeywell, Fairchild, Control Data and Intel corporations. His current affiliation is with the Department of Computer Science, University of California at Irvine. He has published a number of papers and books on restructuring compilers, including encyclopedia articles and a series of books on loop transformations. He is a fellow of the IEEE and a fellow of the ACM. 


Arun Kejariwal is a Statistical Learning Principal at Machine Zone. He co-founded MZ Research and currently manages a team of research scientists. He is leading the research and development of novel algorithms for fraud detection, anomaly detection in security and operational data. Prior to joining Machine Zone, he was a lead in the Data Fidelity Team at Twitter and open sourced standalone R packages for anomaly detection and breakout detection. He received Ph.D. in Computer Science from UC Irvine and is a Senior Member of IEEE and ACM.


Alexandru Nicolau’s research is in the areas of Parallel Processing/ILP, and Embedded Systems/Design Automation. His interests focus on Computer Performance/power tradeoffs, parallelizing compilers, GPUs. His current work involves collaborations both within and outside UCI, most recently with researchers at Stanford, University of Michigan, UCLA, UCSD as part of a flagship NSF Expedition project, and a separate grant with UIUC. He authored over 300 peer-reviewed papers and several books. He is the Editor-in-Chief of the International Journal of Parallel Processing, and an IEEE Fellow.

Bibliographic information

Reviews