عنوان

Accelerator programming using Directives :

پدید آورنده

Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

موضوع

Computer programming, Congresses.,High performance computing, Congresses.,Computer programming.,High performance computing.

رده

QA76
.
751

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom

تماس با کتابخانه : 32910706-025

INTERNATIONAL STANDARD BOOK NUMBER

(Number (ISBN

3030122743

(Number (ISBN

9783030122744

Erroneous ISBN

9783030122737

TITLE AND STATEMENT OF RESPONSIBILITY

Title Proper

Accelerator programming using Directives :

General Material Designation

[Book]

Other Title Information

5th International Workshop, WACCPD 2018, Dallas, TX, USA, November 11-17, 2018, Proceedings /

First Statement of Responsibility

Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

.PUBLICATION, DISTRIBUTION, ETC

Place of Publication, Distribution, etc.

Cham, Switzerland :

Name of Publisher, Distributor, etc.

Springer,

Date of Publication, Distribution, etc.

2019.

PHYSICAL DESCRIPTION

Specific Material Designation and Extent of Item

1 online resource (ix, 137 pages) :

Other Physical Details

illustrations (some color)

SERIES

Series Title

Lecture notes in computer science ;

Series Title

LNCS sublibrary. SL 2, Programming and software engineering

Volume Designation

11381

GENERAL NOTES

Text of Note

Includes author index.

Text of Note

International conference proceedings.

CONTENTS NOTE

Text of Note

Intro; 2018: 5th Workshop on Accelerator Programming Using Directives (WACCPD) http://waccpd.org/; Organization; Contents; Applications; Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code Using Directives; Abstract; 1 Introduction; 2 Simulation Platforms: Titan, SummitDev, and Summit; 3 Scientific Methods of GTC; 4 Porting and Optimization Strategy; 5 GPU Porting Status; 6 Performance; 6.1 Solver Performance Improvement; 6.2 Scaling Performance; 6.3 Tests on SummitDev; 6.4 Performance and Scalability on Summit; 7 Conclusion; Acknowledgments; References

Text of Note

2 Background on Warp Specialization and Elision3 Fission of Multiple-Parallel-Region Target Regions; 4 Overlapping Data Transfer and Split Kernel Execution; 5 Pipelining Data Transfer and Parallel Loop Execution; 6 Custom Grid Geometry; 7 Estimating Potential Benefits of Transformations; 7.1 Combining Kernel Splitting with Elision Improves Performance; 7.2 Elision Amplifies Benefits of Custom Grid Geometry; 7.3 Pipelining Improves Performance for High Trip Counts; 8 Related Work; 9 Conclusion

Text of Note

6.1 Use of OpenACC for the Squared Distance Calculation: GPU6.2 Comparison to CUDA Kernel; 6.3 OpenACC on the CPU; 6.4 Comparison to a Purely BLAS-Based Algorithm: Lowest Programming Knowledge Required; 7 Programming Effort; 8 Conclusions; A Artifact Description Appendix: Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation; A.1 Abstract; A.2 Description; References; Using OpenMP; OpenMP Code Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries; 1 Introduction

Text of Note

A Artifact Description Appendix: OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid GeometriesA. 1 Abstract; A.2 Description; A.3 Installation; A.4 Experiment Workflow; A.5 Evaluation and Expected Results; A.6 Experiment Customization; A.7 Notes; References; A Case Study for Performance Portability Using OpenMP 4.5; 1 Introduction; 2 The GPP Kernel and Its Baseline CPU Implementation; 2.1 GPP Kernel; 2.2 Baseline CPU Implementation; 3 GPU Implementations of the GPP Kernel; 3.1 Implementation Groundwork; 3.2 OpenMP 4.5

Text of Note

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation1 Introduction; 2 Background; 2.1 Performance Portability; 2.2 Molecular Dynamics; 3 Portability Goals: Timings and Architectures; 4 Designing the Kernels; 4.1 The Programming Model and Its Portable Subset; 4.2 Modular Format and Kernels; 5 Binning Module (Neighbor-List Updates): Bin-Assign, Bin-Count, and Bin Sorting; 5.1 Bin-Assign, Bin-Count; 5.2 Parallel Algorithm Design for Bin Count and Gather; 6 The Squared Pairwise Distance Calculation: Performance, Portability, and Effort

SUMMARY OR ABSTRACT

Text of Note

This book constitutes the refereed post-conference proceedings of the 5th International Workshop on Accelerator Programming Using Directives, WACCPD 2018, held in Dallas, TX, USA, in November 2018. The 6 full papers presented have been carefully reviewed and selected from 12 submissions. The papers share knowledge and experiences to program emerging complex parallel computing systems. They are organized in the following three sections: applications; using openMP; and program evaluation.

ACQUISITION INFORMATION NOTE

Source for Acquisition/Subscription Address

Springer Nature

Stock Number

com.springer.onix.9783030122744

OTHER EDITION IN ANOTHER MEDIUM

International Standard Book Number

9783030122737

International Standard Book Number

9783030122751

PARALLEL TITLE PROPER

Parallel Title

WACCPD 2018

TOPICAL NAME USED AS SUBJECT

Computer programming, Congresses.

High performance computing, Congresses.

Computer programming.

High performance computing.

(SUBJECT CATEGORY (Provisional

COM051010

UMC

UMX

DEWEY DECIMAL CLASSIFICATION

Number

005

Edition

LIBRARY OF CONGRESS CLASSIFICATION

Class number

QA76

751

PERSONAL NAME - ALTERNATIVE RESPONSIBILITY

Chandrasekaran, Sunita

Juckeland, Guido

Wienke, Sandra

CORPORATE BODY NAME - PRIMARY RESPONSIBILITY

WACCPD (Workshop)(5th :2018 :, Dallas, Tex.)

ORIGINATING SOURCE

Date of Transaction

20200823082910.0

Cataloguing Rules (Descriptive Conventions))

ELECTRONIC LOCATION AND ACCESS

Electronic name

[Book]

عنوان Accelerator programming using Directives :

پدید آورنده Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

موضوع Computer programming, Congresses.,High performance computing, Congresses.,Computer programming.,High performance computing.

رده QA76.751

کتابخانه Center and Library of Islamic Studies in European Languages

محل استقرار استان: Qom ـ شهر: Qom

INTERNATIONAL STANDARD BOOK NUMBER

TITLE AND STATEMENT OF RESPONSIBILITY

.PUBLICATION, DISTRIBUTION, ETC

PHYSICAL DESCRIPTION

SERIES

GENERAL NOTES

CONTENTS NOTE

SUMMARY OR ABSTRACT

ACQUISITION INFORMATION NOTE

OTHER EDITION IN ANOTHER MEDIUM

PARALLEL TITLE PROPER

TOPICAL NAME USED AS SUBJECT

(SUBJECT CATEGORY (Provisional

DEWEY DECIMAL CLASSIFICATION

LIBRARY OF CONGRESS CLASSIFICATION

PERSONAL NAME - ALTERNATIVE RESPONSIBILITY

CORPORATE BODY NAME - PRIMARY RESPONSIBILITY

ORIGINATING SOURCE

ELECTRONIC LOCATION AND ACCESS

عنوان

Accelerator programming using Directives :

پدید آورنده

Sunita Chandrasekaran, Guido Juckeland, Sandra Wienke (eds.).

موضوع

Computer programming, Congresses.,High performance computing, Congresses.,Computer programming.,High performance computing.

رده

QA76
.
751

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom