عنوان

General-purpose graphics processor architectures /

پدید آورنده

Tor M. Aamodt, Wilson Wai Lun Fung, Timothy G. Rogers.

موضوع

Computer architecture.,Graphics processing units.,Computer architecture.,COMPUTERS-- General.,Graphics processing units.

رده

T385
.
A243

2018

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom

تماس با کتابخانه : 32910706-025

INTERNATIONAL STANDARD BOOK NUMBER

(Number (ISBN

1627056181

(Number (ISBN

9781627056182

Erroneous ISBN

9781627059237

Erroneous ISBN

9781681733586

TITLE AND STATEMENT OF RESPONSIBILITY

Title Proper

General-purpose graphics processor architectures /

General Material Designation

[Book]

First Statement of Responsibility

Tor M. Aamodt, Wilson Wai Lun Fung, Timothy G. Rogers.

.PUBLICATION, DISTRIBUTION, ETC

Place of Publication, Distribution, etc.

[San Rafael, California] :

Name of Publisher, Distributor, etc.

Morgan & Claypool Publishers,

Date of Publication, Distribution, etc.

[2018]

Date of Publication, Distribution, etc.

PHYSICAL DESCRIPTION

Specific Material Designation and Extent of Item

1 online resource (xvii, 122 pages) :

Other Physical Details

illustrations

SERIES

Series Title

Synthesis lectures on computer architecture,

Volume Designation

#44

ISSN of Series

1935-3243 ;

INTERNAL BIBLIOGRAPHIES/INDEXES NOTE

Text of Note

Includes bibliographical references (pages 103-119).

CONTENTS NOTE

Text of Note

1. Introduction -- 1.1 The landscape of computation accelerators -- 1.2 GPU hardware basics -- 1.3 A brief history of GPUs -- 1.4 Book outline.

Text of Note

2. Programming model -- 2.1 Execution model -- 2.2 GPU instruction set architectures -- 2.2.1 NVIDIA GPU instruction set architectures -- 2.2.2 AMD graphics core next instruction set architecture.

Text of Note

3. The SIMT core: instruction and register data flow -- 3.1 One-loop approximation -- 3.1.1 SIMT execution masking -- 3.1.2 SIMT deadlock and stackless SIMT architectures -- 3.1.3 Warp scheduling -- 3.2 Two-loop approximation -- 3.3 Three-loop approximation -- 3.3.1 Operand collector -- 3.3.2 Instruction replay: handling structural hazards -- 3.4 Research directions on branch divergence -- 3.4.1 Warp compaction -- 3.4.2 Intra-warp divergent path management -- 3.4.3 Adding MIMD capability -- 3.4.4 Complexity-effective divergence management -- 3.5 Research directions on scalarization and affine execution -- 3.5.1 Detection of uniform or affine variables -- 3.5.2 Exploiting uniform or affine variables in GPU -- 3.6 Research directions on register file architecture -- 3.6.1 Hierarchical register file -- 3.6.2 Drowsy state register file -- 3.6.3 Register file virtualization -- 3.6.4 Partitioned register file -- 3.6.5 RegLess.

Text of Note

4. Memory system -- 4.1 First-level memory structures -- 4.1.1 Scratchpad memory and L1 data cache -- 4.1.2 L1 texture cache -- 4.1.3 Unified texture and data cache -- 4.2 On-chip interconnection network -- 4.3 Memory partition unit -- 4.3.1 L2 cache -- 4.3.2 Atomic operations -- 4.3.3 Memory access scheduler -- 4.4 Research directions for GPU memory systems -- 4.4.1 Memory access scheduling and interconnection network design -- 4.4.2 Caching effectiveness -- 4.4.3 Memory request prioritization and cache bypassing -- 4.4.4 Exploiting inter-warp heterogeneity -- 4.4.5 Coordinated cache bypassing -- 4.4.6 Adaptive cache management -- 4.4.7 Cache prioritization -- 4.4.8 Virtual memory page placement -- 4.4.9 Data placement -- 4.4.10 Multi-chip-module GPUs.

Text of Note

5. Crosscutting research on GPU computing architectures -- 5.1 Thread scheduling -- 5.1.1 Research on assignment of threadblocks to cores -- 5.1.2 Research on cycle-by-cycle scheduling decisions -- 5.1.3 Research on scheduling multiple kernels -- 5.1.4 Fine-grain synchronization aware scheduling -- 5.2 Alternative ways of expressing parallelism -- 5.3 Support for transactional memory -- 5.3.1 Kilo TM -- 5.3.2 Warp TM and temporal conflict detection -- 5.4 Heterogeneous systems.

Text of Note

Bibliography -- Authors' biographies.

SUMMARY OR ABSTRACT

Text of Note

Originally developed to support video games, graphics processor units (GPUs) are now increasingly used for general-purpose (non-graphics) applications ranging from machine learning to mining of cryptographic currencies. GPUs can achieve improved performance and efficiency versus central processing units (CPUs) by dedicating a larger fraction of hardware resources to computation. In addition, their general-purpose programmability makes contemporary GPUs appealing to software developers in comparison to domain-specific accelerators. This book provides an introduction to those interested in studying the architecture of GPUs that support general-purpose computing. It collects together information currently only found among a wide range of disparate sources. The authors led development of the GPGPU-Sim simulator widely used in academic research on GPU architectures. The first chapter of this book describes the basic hardware structure of GPUs and provides a brief overview of their history. Chapter 2 provides a summary of GPU programming models relevant to the rest of the book. Chapter 3 explores the architecture of GPU compute cores. Chapter 4 explores the architecture of the GPU memory system. After describing the architecture of existing systems, Chapters 3 and 4 provide an overview of related research. Chapter 5 summarizes cross-cutting research impacting both the compute core and memory system. This book should provide a valuable resource for those wishing to understand the architecture of graphics processor units (GPUs) used for acceleration of general-purpose applications and to those who want to obtain an introduction to the rapidly growing body of research exploring how to improve the architecture of these GPUs.

OTHER EDITION IN ANOTHER MEDIUM

International Standard Book Number

9781681733586

TOPICAL NAME USED AS SUBJECT

Computer architecture.

Graphics processing units.

Computer architecture.

COMPUTERS-- General.

Graphics processing units.

(SUBJECT CATEGORY (Provisional

COM-- 000000

DEWEY DECIMAL CLASSIFICATION

Number

006

6869

Edition

LIBRARY OF CONGRESS CLASSIFICATION

Class number

T385

Book number

A243

2018

PERSONAL NAME - PRIMARY RESPONSIBILITY

Aamodt, Tor M.

PERSONAL NAME - ALTERNATIVE RESPONSIBILITY

Fung, Wilson Wai Lun

Rogers, Timothy G.

ORIGINATING SOURCE

Date of Transaction

20200823051201.0

Cataloguing Rules (Descriptive Conventions))

ELECTRONIC LOCATION AND ACCESS

Electronic name

[Book]

عنوان General-purpose graphics processor architectures /

پدید آورنده Tor M. Aamodt, Wilson Wai Lun Fung, Timothy G. Rogers.

موضوع Computer architecture.,Graphics processing units.,Computer architecture.,COMPUTERS-- General.,Graphics processing units.

رده T385 .A243 2018

کتابخانه Center and Library of Islamic Studies in European Languages

محل استقرار استان: Qom ـ شهر: Qom

INTERNATIONAL STANDARD BOOK NUMBER

TITLE AND STATEMENT OF RESPONSIBILITY

.PUBLICATION, DISTRIBUTION, ETC

PHYSICAL DESCRIPTION

SERIES

INTERNAL BIBLIOGRAPHIES/INDEXES NOTE

CONTENTS NOTE

SUMMARY OR ABSTRACT

OTHER EDITION IN ANOTHER MEDIUM

TOPICAL NAME USED AS SUBJECT

(SUBJECT CATEGORY (Provisional

DEWEY DECIMAL CLASSIFICATION

LIBRARY OF CONGRESS CLASSIFICATION

PERSONAL NAME - PRIMARY RESPONSIBILITY

PERSONAL NAME - ALTERNATIVE RESPONSIBILITY

ORIGINATING SOURCE

ELECTRONIC LOCATION AND ACCESS

عنوان

General-purpose graphics processor architectures /

پدید آورنده

Tor M. Aamodt, Wilson Wai Lun Fung, Timothy G. Rogers.

موضوع

Computer architecture.,Graphics processing units.,Computer architecture.,COMPUTERS-- General.,Graphics processing units.

رده

T385
.
A243

2018

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom