Cargando…

Reinforcement learning and dynamic programming using function approximators /

From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control them. While Dynamic Programming (DP) has provided researchers with a way to optimally solve decision and control problems involving complex dyn...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Otros Autores: Busoniu, Lucian
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Boca Raton, FL : CRC Press, [2010]
Colección:Automation and control engineering.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBOOKCENTRAL_ocn666378166
003 OCoLC
005 20240329122006.0
006 m o d
007 cr |n|||||||||
008 100928s2010 flua ob 001 0 eng d
010 |z  2010010207 
040 |a CUS  |b eng  |e rda  |e pn  |c CUS  |d N$T  |d EBLCP  |d IL4J6  |d OCLCQ  |d E7B  |d CDX  |d IDEBK  |d OCLCQ  |d DEBSZ  |d OCLCQ  |d OCLCF  |d YDXCP  |d CRCPR  |d OCLCQ  |d CDN  |d OCLCQ  |d PIFBY  |d OTZ  |d OCLCQ  |d MERUC  |d UAB  |d OCLCQ  |d ERL  |d CEF  |d NLE  |d AU@  |d OCLCQ  |d UKMGB  |d WYU  |d YDX  |d LEAUB  |d OCLCQ  |d UHL  |d LOA  |d UKAHL  |d OCLCQ  |d VT2  |d SFB  |d OCLCO  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCLCL 
015 |a GBB7B0042  |2 bnb 
016 7 |a 018392675  |2 Uk 
019 |a 669515938  |a 680628488  |a 712994361  |a 741351085  |a 991896883  |a 994951696  |a 1031041668  |a 1065934179  |a 1122511267  |a 1129361127  |a 1135541711  |a 1228600550  |a 1260357290 
020 |a 9781439821091 
020 |a 1439821097 
020 |z 9781439821084  |q (hardcover ;  |q alk. paper) 
020 |z 1439821089  |q (hardcover ;  |q alk. paper) 
020 |a 9781315217932  |q (electronic bk.) 
020 |a 1315217937  |q (electronic bk.) 
029 1 |a AU@  |b 000065168825 
029 1 |a DEBSZ  |b 372814433 
029 1 |a DEBSZ  |b 430895453 
029 1 |a DEBSZ  |b 449208834 
029 1 |a DEBSZ  |b 454894848 
029 1 |a HEBIS  |b 228931878 
029 1 |a NZ1  |b 13761393 
029 1 |a UKMGB  |b 018392675 
035 |a (OCoLC)666378166  |z (OCoLC)669515938  |z (OCoLC)680628488  |z (OCoLC)712994361  |z (OCoLC)741351085  |z (OCoLC)991896883  |z (OCoLC)994951696  |z (OCoLC)1031041668  |z (OCoLC)1065934179  |z (OCoLC)1122511267  |z (OCoLC)1129361127  |z (OCoLC)1135541711  |z (OCoLC)1228600550  |z (OCoLC)1260357290 
037 |a TANDF_211565  |b Ingram Content Group 
050 4 |a TJ223.M53  |b R44 2010 
072 7 |a TEC  |x 004000  |2 bisacsh 
082 0 4 |a 629.8/9  |2 22 
084 |a SK 880  |2 rvk 
084 |a ST 304  |2 rvk 
049 |a UAMI 
245 0 0 |a Reinforcement learning and dynamic programming using function approximators /  |c Lucian Buðsoniu [and others]. 
264 1 |a Boca Raton, FL :  |b CRC Press,  |c [2010] 
300 |a 1 online resource (xiii, 270 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a data file  |2 rda 
490 1 |a Automation and control engineering 
504 |a Includes bibliographical references and index. 
505 0 |a ch. 1. Introduction -- ch. 2. An introduction to dynamic programming and reinforcement learning -- ch. 3. Dynamic programming and reinforcement learning in large and continuous spaces -- ch. 4. Approximate value iteration with a fuzzy representation -- ch. 5. Approximate policy iteration for online learning and continuous-action control -- ch. 6. Approximate policy search with cross-entropy optimization of basis functions. 
588 0 |a Print version record. 
520 |a From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control them. While Dynamic Programming (DP) has provided researchers with a way to optimally solve decision and control problems involving complex dynamic systems, its practical value was limited by algorithms that lacked the capacity to scale up to realistic problems. However, in recent years, dramatic developments in Reinforcement Learning (RL), the model-free counterpart of DP, changed our understanding of what is possible. Those dev. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
650 0 |a Digital control systems. 
650 0 |a Dynamic programming. 
650 6 |a Commande numérique. 
650 6 |a Programmation dynamique. 
650 7 |a TECHNOLOGY & ENGINEERING  |x Automation.  |2 bisacsh 
650 7 |a Digital control systems  |2 fast 
650 7 |a Dynamic programming  |2 fast 
700 1 |a Busoniu, Lucian. 
758 |i has work:  |a Reinforcement learning and dynamic programming using function approximators (Work)  |1 https://id.oclc.org/worldcat/entity/E39PD3yHPq6QtxppxP86x7k8P3  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |t Reinforcement learning and dynamic programming using function approximators.  |d Boca Raton, FL : CRC Press, ©2010  |z 9781439821084  |w (DLC) 2010010207  |w (OCoLC)406174370 
830 0 |a Automation and control engineering. 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=589872  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n AH33084637 
938 |a Askews and Holts Library Services  |b ASKH  |n AH24133798 
938 |a Coutts Information Services  |b COUT  |n 16838932 
938 |a CRC Press  |b CRCP  |n CRC0KE10992PDF 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL589872 
938 |a ebrary  |b EBRY  |n ebr10419897 
938 |a EBSCOhost  |b EBSC  |n 339010 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n 290296 
938 |a YBP Library Services  |b YANK  |n 15930057 
938 |a YBP Library Services  |b YANK  |n 3492747 
994 |a 92  |b IZTAP