LNCS sublibrary. SL 2, Programming and software engineering
مشخصه جلد
11381
يادداشت کلی
متن يادداشت
Includes author index.
متن يادداشت
International conference proceedings.
یادداشتهای مربوط به مندرجات
متن يادداشت
Intro; 2018: 5th Workshop on Accelerator Programming Using Directives (WACCPD) http://waccpd.org/; Organization; Contents; Applications; Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code Using Directives; Abstract; 1 Introduction; 2 Simulation Platforms: Titan, SummitDev, and Summit; 3 Scientific Methods of GTC; 4 Porting and Optimization Strategy; 5 GPU Porting Status; 6 Performance; 6.1 Solver Performance Improvement; 6.2 Scaling Performance; 6.3 Tests on SummitDev; 6.4 Performance and Scalability on Summit; 7 Conclusion; Acknowledgments; References
متن يادداشت
2 Background on Warp Specialization and Elision3 Fission of Multiple-Parallel-Region Target Regions; 4 Overlapping Data Transfer and Split Kernel Execution; 5 Pipelining Data Transfer and Parallel Loop Execution; 6 Custom Grid Geometry; 7 Estimating Potential Benefits of Transformations; 7.1 Combining Kernel Splitting with Elision Improves Performance; 7.2 Elision Amplifies Benefits of Custom Grid Geometry; 7.3 Pipelining Improves Performance for High Trip Counts; 8 Related Work; 9 Conclusion
متن يادداشت
6.1 Use of OpenACC for the Squared Distance Calculation: GPU6.2 Comparison to CUDA Kernel; 6.3 OpenACC on the CPU; 6.4 Comparison to a Purely BLAS-Based Algorithm: Lowest Programming Knowledge Required; 7 Programming Effort; 8 Conclusions; A Artifact Description Appendix: Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation; A.1 Abstract; A.2 Description; References; Using OpenMP; OpenMP Code Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries; 1 Introduction
متن يادداشت
A Artifact Description Appendix: OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid GeometriesA. 1 Abstract; A.2 Description; A.3 Installation; A.4 Experiment Workflow; A.5 Evaluation and Expected Results; A.6 Experiment Customization; A.7 Notes; References; A Case Study for Performance Portability Using OpenMP 4.5; 1 Introduction; 2 The GPP Kernel and Its Baseline CPU Implementation; 2.1 GPP Kernel; 2.2 Baseline CPU Implementation; 3 GPU Implementations of the GPP Kernel; 3.1 Implementation Groundwork; 3.2 OpenMP 4.5
متن يادداشت
Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation1 Introduction; 2 Background; 2.1 Performance Portability; 2.2 Molecular Dynamics; 3 Portability Goals: Timings and Architectures; 4 Designing the Kernels; 4.1 The Programming Model and Its Portable Subset; 4.2 Modular Format and Kernels; 5 Binning Module (Neighbor-List Updates): Bin-Assign, Bin-Count, and Bin Sorting; 5.1 Bin-Assign, Bin-Count; 5.2 Parallel Algorithm Design for Bin Count and Gather; 6 The Squared Pairwise Distance Calculation: Performance, Portability, and Effort
بدون عنوان
0
بدون عنوان
8
بدون عنوان
8
بدون عنوان
8
بدون عنوان
8
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
This book constitutes the refereed post-conference proceedings of the 5th International Workshop on Accelerator Programming Using Directives, WACCPD 2018, held in Dallas, TX, USA, in November 2018. The 6 full papers presented have been carefully reviewed and selected from 12 submissions. The papers share knowledge and experiences to program emerging complex parallel computing systems. They are organized in the following three sections: applications; using openMP; and program evaluation.
یادداشتهای مربوط به سفارشات
منبع سفارش / آدرس اشتراک
Springer Nature
شماره انبار
com.springer.onix.9783030122744
ویراست دیگر از اثر در قالب دیگر رسانه
شماره استاندارد بين المللي کتاب و موسيقي
9783030122737
شماره استاندارد بين المللي کتاب و موسيقي
9783030122751
عنوان اصلی به زبان دیگر
عنوان اصلي به زبان ديگر
WACCPD 2018
موضوع (اسم عام یاعبارت اسمی عام)
موضوع مستند نشده
Computer programming, Congresses.
موضوع مستند نشده
High performance computing, Congresses.
موضوع مستند نشده
Computer programming.
موضوع مستند نشده
High performance computing.
مقوله موضوعی
موضوع مستند نشده
COM051010
موضوع مستند نشده
UMC
موضوع مستند نشده
UMX
موضوع مستند نشده
UMX
رده بندی ديویی
شماره
005
.
13
ويراست
23
رده بندی کنگره
شماره رده
QA76
.
751
نام شخص - (مسئولیت معنوی برابر )
مستند نام اشخاص تاييد نشده
Chandrasekaran, Sunita
مستند نام اشخاص تاييد نشده
Juckeland, Guido
مستند نام اشخاص تاييد نشده
Wienke, Sandra
نام تنالگان به منزله سر شناسه - (مسئولیت معنوی درجه اول )