عنوان

Accelerator programming using Directives :

پدید آورنده

Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

موضوع

Computer programming, Congresses.,High performance computing, Congresses.,Computer programming.,High performance computing.

رده

QA76
.
751

کتابخانه

کتابخانه مطالعات اسلامی به زبان های اروپایی

محل استقرار

استان: قم ـ شهر: قم

تماس با کتابخانه : 32910706-025

3030122743

9783030122744

9783030122737

Accelerator programming using Directives :

[Book]

5th International Workshop, WACCPD 2018, Dallas, TX, USA, November 11-17, 2018, Proceedings /

Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

Cham, Switzerland :

Springer,

2019.

1 online resource (ix, 137 pages) :

illustrations (some color)

Lecture notes in computer science ;

LNCS sublibrary. SL 2, Programming and software engineering

11381

Includes author index.

International conference proceedings.

Intro; 2018: 5th Workshop on Accelerator Programming Using Directives (WACCPD) http://waccpd.org/; Organization; Contents; Applications; Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code Using Directives; Abstract; 1 Introduction; 2 Simulation Platforms: Titan, SummitDev, and Summit; 3 Scientific Methods of GTC; 4 Porting and Optimization Strategy; 5 GPU Porting Status; 6 Performance; 6.1 Solver Performance Improvement; 6.2 Scaling Performance; 6.3 Tests on SummitDev; 6.4 Performance and Scalability on Summit; 7 Conclusion; Acknowledgments; References

2 Background on Warp Specialization and Elision3 Fission of Multiple-Parallel-Region Target Regions; 4 Overlapping Data Transfer and Split Kernel Execution; 5 Pipelining Data Transfer and Parallel Loop Execution; 6 Custom Grid Geometry; 7 Estimating Potential Benefits of Transformations; 7.1 Combining Kernel Splitting with Elision Improves Performance; 7.2 Elision Amplifies Benefits of Custom Grid Geometry; 7.3 Pipelining Improves Performance for High Trip Counts; 8 Related Work; 9 Conclusion

6.1 Use of OpenACC for the Squared Distance Calculation: GPU6.2 Comparison to CUDA Kernel; 6.3 OpenACC on the CPU; 6.4 Comparison to a Purely BLAS-Based Algorithm: Lowest Programming Knowledge Required; 7 Programming Effort; 8 Conclusions; A Artifact Description Appendix: Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation; A.1 Abstract; A.2 Description; References; Using OpenMP; OpenMP Code Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries; 1 Introduction

A Artifact Description Appendix: OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid GeometriesA. 1 Abstract; A.2 Description; A.3 Installation; A.4 Experiment Workflow; A.5 Evaluation and Expected Results; A.6 Experiment Customization; A.7 Notes; References; A Case Study for Performance Portability Using OpenMP 4.5; 1 Introduction; 2 The GPP Kernel and Its Baseline CPU Implementation; 2.1 GPP Kernel; 2.2 Baseline CPU Implementation; 3 GPU Implementations of the GPP Kernel; 3.1 Implementation Groundwork; 3.2 OpenMP 4.5

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation1 Introduction; 2 Background; 2.1 Performance Portability; 2.2 Molecular Dynamics; 3 Portability Goals: Timings and Architectures; 4 Designing the Kernels; 4.1 The Programming Model and Its Portable Subset; 4.2 Modular Format and Kernels; 5 Binning Module (Neighbor-List Updates): Bin-Assign, Bin-Count, and Bin Sorting; 5.1 Bin-Assign, Bin-Count; 5.2 Parallel Algorithm Design for Bin Count and Gather; 6 The Squared Pairwise Distance Calculation: Performance, Portability, and Effort

This book constitutes the refereed post-conference proceedings of the 5th International Workshop on Accelerator Programming Using Directives, WACCPD 2018, held in Dallas, TX, USA, in November 2018. The 6 full papers presented have been carefully reviewed and selected from 12 submissions. The papers share knowledge and experiences to program emerging complex parallel computing systems. They are organized in the following three sections: applications; using openMP; and program evaluation.

Springer Nature

com.springer.onix.9783030122744

9783030122737

9783030122751

WACCPD 2018

Computer programming, Congresses.

High performance computing, Congresses.

Computer programming.

High performance computing.

COM051010

UMC

UMX

005

QA76

751

Chandrasekaran, Sunita

Juckeland, Guido

Wienke, Sandra

WACCPD (Workshop)(5th :2018 :, Dallas, Tex.)

20200823082910.0

[Book]

عنوان Accelerator programming using Directives :

پدید آورنده Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

موضوع Computer programming, Congresses.,High performance computing, Congresses.,Computer programming.,High performance computing.

رده QA76.751

کتابخانه کتابخانه مطالعات اسلامی به زبان های اروپایی