Cargando…

Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /

This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL met...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Lapan, Maxim (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Publishing, 2018.
Temas:
Acceso en línea:Texto completo
Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBSCO_on1046682461
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 180731s2018 enka ob 001 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d STF  |d TOH  |d OCLCF  |d EBLCP  |d N$T  |d MERUC  |d ZCU  |d NLE  |d TEFOD  |d CEF  |d UKMGB  |d OCLCQ  |d G3B  |d S9I  |d UAB  |d C6I  |d OCLCQ  |d UX1  |d K6U  |d OCLCQ  |d OCLCO  |d AAA  |d OCLCQ  |d PSYSI  |d OCLCQ  |d OCLCO 
016 7 |a 018936109  |2 Uk 
019 |a 1042318736  |a 1175638157 
020 |a 9781788839303 
020 |a 1788839307 
020 |a 1788834240 
020 |a 9781788834247 
020 |z 9781788834247 
024 3 |a 9781788834247 
029 1 |a AU@  |b 000067301454 
029 1 |a GBVCP  |b 1029873801 
029 1 |a UKMGB  |b 018936109 
035 |a (OCoLC)1046682461  |z (OCoLC)1042318736  |z (OCoLC)1175638157 
037 |a CL0500000982  |b Safari Books Online 
050 4 |a Q325.5 
072 7 |a COM  |x 000000  |2 bisacsh 
082 0 4 |a 006.31  |2 23 
049 |a UAMI 
100 1 |a Lapan, Maxim,  |e author. 
245 1 0 |a Deep reinforcement learning hands-on :  |b apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /  |c Maxim Lapan. 
264 1 |a Birmingham, UK :  |b Packt Publishing,  |c 2018. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a data file 
588 0 |a Online resource; title from cover (Safari, viewed July 30, 2018). 
500 |a "Expert insight." 
504 |a Includes bibliographical references and index. 
505 0 |a Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients -- An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions -- TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free -- ImaginationAlphaGo Zero. 
520 |a This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ... 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Reinforcement learning. 
650 0 |a Machine learning. 
650 0 |a Natural language processing (Computer science) 
650 0 |a Artificial intelligence. 
650 2 |a Natural Language Processing 
650 2 |a Artificial Intelligence 
650 2 |a Machine Learning 
650 6 |a Apprentissage par renforcement (Intelligence artificielle) 
650 6 |a Apprentissage automatique. 
650 6 |a Traitement automatique des langues naturelles. 
650 6 |a Intelligence artificielle. 
650 7 |a artificial intelligence.  |2 aat 
650 7 |a COMPUTERS  |x General.  |2 bisacsh 
650 7 |a Artificial intelligence  |2 fast 
650 7 |a Machine learning  |2 fast 
650 7 |a Natural language processing (Computer science)  |2 fast 
650 7 |a Reinforcement learning  |2 fast 
776 0 8 |i Print version:  |a Lapan, Maxim.  |t Deep Reinforcement Learning Hands-On : Apply Modern RL Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More.  |d Birmingham : Packt Publishing Ltd, ©2018  |z 9781788834247 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1837369  |z Texto completo 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781788834247/?ar  |z Texto completo 
938 |a EBL - Ebook Library  |b EBLB  |n EBL5434975 
938 |a EBSCOhost  |b EBSC  |n 1837369 
994 |a 92  |b IZTAP