전남대학교 중앙도서관

  • 중앙도서관
  • 여수캠퍼스도서관
  • 법학도서관
  • 치의학도서관
  • 의학도서관

주메뉴

전체메뉴


  • 홈
  • 상세정보

상세정보

상세정보

부가기능

Python Reinforcement Learning [electronic resource] : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow

상세 프로파일

상세정보
자료유형e-Book
서명/저자사항Python Reinforcement Learning [electronic resource]: Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow / Sudharsan Ravichandiran, Sean Saito, Rajalingappaa Shanmugamani and Yang Wenzhuo.
개인저자Ravichandiran, Sudharsan, author.
Saito, Sean, author.
Shanmugamani, Rajalingappaa, author.
Wenzhuo, Yang, author.
발행사항Birmingham: Packt Publishing, Limited, 2019.
형태사항1 online resource (484 pages).
기타형태 저록Print version: Ravichandiran, Sudharsan. Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow. Birmingham : Packt Publishing, Limited, 짤2019 9781838649777
ISBN9781838640149
일반주기 Implementation of the Atari emulator
내용주기Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Reinforcement Learning; What is RL?; RL algorithm; How RL differs from other ML paradigms; Elements of RL; Agent; Policy function; Value function; Model; Agent environment interface; Types of RL environment; Deterministic environment; Stochastic environment; Fully observable environment; Partially observable environment; Discrete environment; Continuous environment; Episodic and non-episodic environment; Single and multi-agent environment; RL platforms
OpenAI Gym and UniverseDeepMind Lab; RL-Glue; Project Malmo; ViZDoom; Applications of RL; Education; Medicine and healthcare; Manufacturing; Inventory management; Finance; Natural Language Processing and Computer Vision; Summary; Questions; Further reading; Chapter 2: Getting Started with OpenAI and TensorFlow; Setting up your machine; Installing Anaconda; Installing Docker; Installing OpenAI Gym and Universe; Common error fixes; OpenAI Gym; Basic simulations; Training a robot to walk; OpenAI Universe; Building a video game bot; TensorFlow; Variables, constants, and placeholders; Variables
ConstantsPlaceholders; Computation graph; Sessions; TensorBoard; Adding scope; Summary; Questions; Further reading; Chapter 3: The Markov Decision Process and Dynamic Programming; The Markov chain and Markov process; Markov Decision Process; Rewards and returns; Episodic and continuous tasks; Discount factor; The policy function; State value function; State-action value function (Q function); The Bellman equation and optimality; Deriving the Bellman equation for value and Q functions; Solving the Bellman equation; Dynamic programming; Value iteration; Policy iteration
Solving the frozen lake problemValue iteration; Policy iteration; Summary; Questions; Further reading; Chapter 4: Gaming with Monte Carlo Methods; Monte Carlo methods; Estimating the value of pi using Monte Carlo; Monte Carlo prediction; First visit Monte Carlo; Every visit Monte Carlo; Let's play Blackjack with Monte Carlo; Monte Carlo control; Monte Carlo exploration starts; On-policy Monte Carlo control; Off-policy Monte Carlo control; Summary; Questions; Further reading; Chapter 5: Temporal Difference Learning; TD learning; TD prediction; TD control; Q learning
Solving the taxi problem using Q learningSARSA; Solving the taxi problem using SARSA; The difference between Q learning and SARSA; Summary; Questions; Further reading; Chapter 6: Multi-Armed Bandit Problem; The MAB problem; The epsilon-greedy policy; The softmax exploration algorithm; The upper confidence bound algorithm; The Thompson sampling algorithm; Applications of MAB; Identifying the right advertisement banner using MAB; Contextual bandits; Summary; Questions; Further reading; Chapter 7: Playing Atari Games; Introduction to Atari games; Building an Atari emulator; Getting started
요약Reinforcement learning and deep reinforcement learning are the trending and most promising branches of artificial intelligence. This Learning Path will enable you to master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms and their limitations.
일반주제명Python (Computer program language)
Reinforcement learning.
분류기호(DDC)006.31
언어영어
바로가기URL
QR Code

소장정보

  • 소장정보

보존/밀집/기증 자료 신청 보존/밀집/기증 자료 신청 분관대출 분관대출 서가부재도서 서가부재도서 무인예약대출 이미지 무인예약대출 배달서비스 배달서비스 소장위치출력 소장위치출력

메세지가 없습니다
No. 등록번호 청구기호 소장처 밀집번호 도서상태 반납예정일 예약 서비스 매체정보
1 E190843 EB 006.31 중앙도서관[본관]/E-Book/ 대출가능 무인예약대출 이미지
true|true|true|true |true|true |
 

서평

  • 서평

태그

  • 태그

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0) 태그 목록형 보기 태그 구름형 보기
 
메세지가 없습니다

QUICK LINK

  • 희망도서신청
  • 대출/연장조회
  • 서가부재도서
  • 이용교육

마이메뉴추가


QRCode
  • 개인정보호정책
  • 이메일무단수집거부
  • 도서관이용문의

  • 도서관자치위원회  원격제어  Instagram  facebook  w  kakao 플친
500-757 광주광역시 북구 용봉로 77   TEL  062)530-3571~2(대출반납실)   FAX  062)530-3529
  • 103741
  • 129702103