عنوان

Hands-On Reinforcement Learning with Python :

پدید آورنده

موضوع

Machine learning.,Artificial intelligence.,Computers-- Intelligence (AI) & Semantics.,Computers-- Neural Networks.,Computers-- Social Aspects-- Human-Computer Interaction.,Human-computer interaction.,Machine learning.,Neural networks & fuzzy systems.

رده

Q325
.
5
.
R385

2018eb

کتابخانه

مرکز و کتابخانه مطالعات اسلامی به زبان‌های اروپایی

محل استقرار

استان: قم ـ شهر: قم

تماس با کتابخانه : 32910706-025

شابک

1788836529

شابک

178883691X

شابک

9781788836524

شابک

9781788836913

عنوان و نام پديدآور

عنوان اصلي

Hands-On Reinforcement Learning with Python :

نام عام مواد

[Book]

ساير اطلاعات عنواني

Master Reinforcement and Deep Reinforcement Learning Using OpenAI Gym and TensorFlow.

وضعیت نشر و پخش و غیره

محل نشرو پخش و غیره

Birmingham :

نام ناشر، پخش کننده و غيره

Packt Publishing Ltd,

تاریخ نشرو بخش و غیره

2018.

مشخصات ظاهری

نام خاص و کميت اثر

1 online resource (309 pages)

يادداشت کلی

متن يادداشت

Activation functions.

یادداشتهای مربوط به مندرجات

متن يادداشت

Cover; Title Page; Copyright and Credits; Dedication; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Reinforcement Learning; What is RL?; RL algorithm; How RL differs from other ML paradigms; Elements of RL; Agent; Policy function; Value function; Model; Agent environment interface; Types of RL environment; Deterministic environment; Stochastic environment; Fully observable environment; Partially observable environment; Discrete environment; Continuous environment; Episodic and non-episodic environment; Single and multi-agent environment; RL platforms.

متن يادداشت

ConstantsPlaceholders; Computation graph; Sessions; TensorBoard; Adding scope; Summary; Questions; Further reading; Chapter 3: The Markov Decision Process and Dynamic Programming; The Markov chain and Markov process; Markov Decision Process; Rewards and returns; Episodic and continuous tasks; Discount factor; The policy function; State value function; State-action value function (Q function); The Bellman equation and optimality; Deriving the Bellman equation for value and Q functions; Solving the Bellman equation; Dynamic programming; Value iteration; Policy iteration.

متن يادداشت

OpenAI Gym and UniverseDeepMind Lab; RL-Glue; Project Malmo; ViZDoom; Applications of RL; Education; Medicine and healthcare; Manufacturing; Inventory management; Finance; Natural Language Processing and Computer Vision; Summary; Questions; Further reading; Chapter 2: Getting Started with OpenAI and TensorFlow; Setting up your machine; Installing Anaconda; Installing Docker; Installing OpenAI Gym and Universe; Common error fixes; OpenAI Gym; Basic simulations; Training a robot to walk; OpenAI Universe; Building a video game bot; TensorFlow; Variables, constants, and placeholders; Variables.

متن يادداشت

Solving the frozen lake problemValue iteration; Policy iteration; Summary; Questions; Further reading; Chapter 4: Gaming with Monte Carlo Methods; Monte Carlo methods; Estimating the value of pi using Monte Carlo; Monte Carlo prediction; First visit Monte Carlo; Every visit Monte Carlo; Let's play Blackjack with Monte Carlo; Monte Carlo control; Monte Carlo exploration starts; On-policy Monte Carlo control; Off-policy Monte Carlo control; Summary; Questions; Further reading; Chapter 5: Temporal Difference Learning; TD learning; TD prediction; TD control; Q learning.

متن يادداشت

Solving the taxi problem using Q learningSARSA; Solving the taxi problem using SARSA; The difference between Q learning and SARSA; Summary; Questions; Further reading; Chapter 6: Multi-Armed Bandit Problem; The MAB problem; The epsilon-greedy policy; The softmax exploration algorithm; The upper confidence bound algorithm; The Thompson sampling algorithm; Applications of MAB; Identifying the right advertisement banner using MAB; Contextual bandits; Summary; Questions; Further reading; Chapter 7: Deep Learning Fundamentals; Artificial neurons; ANNs; Input layer; Hidden layer; Output layer.

بدون عنوان

یادداشتهای مربوط به خلاصه یا چکیده

متن يادداشت

Reinforcement learning is a self-evolving type of machine learning that takes us closer to achieving true artificial intelligence. This easy-to-follow guide explains everything from scratch using rich examples written in Python.

یادداشتهای مربوط به سفارشات

منبع سفارش / آدرس اشتراک

01201872

شماره انبار

B09792

ویراست دیگر از اثر در قالب دیگر رسانه

عنوان

Hands-On Reinforcement Learning with Python : Master Reinforcement and Deep Reinforcement Learning Using OpenAI Gym and TensorFlow.

شماره استاندارد بين المللي کتاب و موسيقي

9781788836524

موضوع (اسم عام یاعبارت اسمی عام)

موضوع مستند نشده

Machine learning.

موضوع مستند نشده

Artificial intelligence.

موضوع مستند نشده

Computers-- Intelligence (AI) & Semantics.

موضوع مستند نشده

Computers-- Neural Networks.

موضوع مستند نشده

Computers-- Social Aspects-- Human-Computer Interaction.

موضوع مستند نشده

Human-computer interaction.

موضوع مستند نشده

Machine learning.

موضوع مستند نشده

Neural networks & fuzzy systems.

رده بندی ديویی

شماره

006

ويراست

رده بندی کنگره

شماره رده

Q325

نشانه اثر

R385

2018eb

نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )

مستند نام اشخاص تاييد نشده

Ravichandiran, Sudharsan.

مبدا اصلی

تاريخ عمليات

20200823060459.0

قواعد فهرست نويسي ( بخش توصيفي )

دسترسی و محل الکترونیکی

نام الکترونيکي

اطلاعات رکورد کتابشناسی

نوع ماده

[Book]

اطلاعات دسترسی رکورد

تكميل شده

عنوان Hands-On Reinforcement Learning with Python :

پدید آورنده

موضوع Machine learning.,Artificial intelligence.,Computers-- Intelligence (AI) & Semantics.,Computers-- Neural Networks.,Computers-- Social Aspects-- Human-Computer Interaction.,Human-computer interaction.,Machine learning.,Neural networks & fuzzy systems.

رده Q325.5 .R385 2018eb

کتابخانه مرکز و کتابخانه مطالعات اسلامی به زبان‌های اروپایی

محل استقرار استان: قم ـ شهر: قم

شابک

عنوان و نام پديدآور

وضعیت نشر و پخش و غیره

مشخصات ظاهری

يادداشت کلی

یادداشتهای مربوط به مندرجات

یادداشتهای مربوط به خلاصه یا چکیده

یادداشتهای مربوط به سفارشات

ویراست دیگر از اثر در قالب دیگر رسانه

موضوع (اسم عام یاعبارت اسمی عام)

رده بندی ديویی

رده بندی کنگره

نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )

مبدا اصلی

دسترسی و محل الکترونیکی

اطلاعات رکورد کتابشناسی

اطلاعات دسترسی رکورد

عنوان

Hands-On Reinforcement Learning with Python :

موضوع

Machine learning.,Artificial intelligence.,Computers-- Intelligence (AI) & Semantics.,Computers-- Neural Networks.,Computers-- Social Aspects-- Human-Computer Interaction.,Human-computer interaction.,Machine learning.,Neural networks & fuzzy systems.

رده

Q325
.
5
.
R385

2018eb

کتابخانه

مرکز و کتابخانه مطالعات اسلامی به زبان‌های اروپایی

محل استقرار

استان: قم ـ شهر: قم