Buch, Englisch, 356 Seiten, Format (B × H): 151 mm x 235 mm, Gewicht: 550 g
Buch, Englisch, 356 Seiten, Format (B × H): 151 mm x 235 mm, Gewicht: 550 g
Reihe: Chapman & Hall/CRC Computational Science
ISBN: 978-1-4398-1192-4
Verlag: CRC Press
The text first introduces the architecture of modern cache-based microprocessors and discusses their inherent performance limitations, before describing general optimization strategies for serial code on cache-based architectures. It next covers shared- and distributed-memory parallel computer architectures and the most relevant network topologies. After discussing parallel computing on a theoretical level, the authors show how to avoid or ameliorate typical performance problems connected with OpenMP. They then present cache-coherent nonuniform memory access (ccNUMA) optimization techniques, examine distributed-memory parallel programming with message passing interface (MPI), and explain how to write efficient MPI code. The final chapter focuses on hybrid programming with MPI and OpenMP.
Users of high performance computers often have no idea what factors limit time to solution and whether it makes sense to think about optimization at all. This book facilitates an intuitive understanding of performance limitations without relying on heavy computer science knowledge. It also prepares readers for studying more advanced literature.
Read about the authors’ recent honor: Informatics Europe Curriculum Best Practices Award for Parallelism and Concurrency
Zielgruppe
Academic and Professional Practice & Development
Autoren/Hrsg.
Fachgebiete
- Interdisziplinäres Wissenschaften Wissenschaften: Forschung und Information Kybernetik, Systemtheorie, Komplexe Systeme
- Mathematik | Informatik EDV | Informatik Technische Informatik Grid-Computing & Paralleles Rechnen
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Programmier- und Skriptsprachen
- Mathematik | Informatik Mathematik Mathematik Interdisziplinär Systemtheorie
- Mathematik | Informatik EDV | Informatik Informatik Rechnerarchitektur
Weitere Infos & Material
Modern Processors. Basic Optimization Techniques for Serial Code. Data Access Optimization. Parallel Computers. Basics of Parallelization. Shared-Memory Parallel Programming with OpenMP. Efficient OpenMP Programming. Locality Optimizations on ccNUMA Architectures. Distributed-Memory Parallel Programming with MPI. Efficient MPI Programming. Hybrid Parallelization with MPI and OpenMP. Appendices. Bibliography. Index.