|
|
|
|
LEADER |
00000nam a22000005i 4500 |
001 |
978-3-642-27645-3 |
003 |
DE-He213 |
005 |
20220116230947.0 |
007 |
cr nn 008mamaa |
008 |
120305s2012 gw | s |||| 0|eng d |
020 |
|
|
|a 9783642276453
|9 978-3-642-27645-3
|
024 |
7 |
|
|a 10.1007/978-3-642-27645-3
|2 doi
|
050 |
|
4 |
|a Q342
|
072 |
|
7 |
|a UYQ
|2 bicssc
|
072 |
|
7 |
|a TEC009000
|2 bisacsh
|
072 |
|
7 |
|a UYQ
|2 thema
|
082 |
0 |
4 |
|a 006.3
|2 23
|
245 |
1 |
0 |
|a Reinforcement Learning
|h [electronic resource] :
|b State-of-the-Art /
|c edited by Marco Wiering, Martijn van Otterlo.
|
250 |
|
|
|a 1st ed. 2012.
|
264 |
|
1 |
|a Berlin, Heidelberg :
|b Springer Berlin Heidelberg :
|b Imprint: Springer,
|c 2012.
|
300 |
|
|
|a XXXIV, 638 p.
|b online resource.
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
347 |
|
|
|a text file
|b PDF
|2 rda
|
490 |
1 |
|
|a Adaptation, Learning, and Optimization,
|x 1867-4542 ;
|v 12
|
505 |
0 |
|
|a Continous State and Action Spaces -- Relational and First-Order Knowledge Representation -- Hierarchical Approaches -- Predictive Approaches -- Multi-Agent Reinforcement Learning -- Partially Observable Markov Decision Processes (POMDPs) -- Decentralized POMDPs (DEC-POMDPs) -- Features and Function Approximation -- RL as Supervised Learning (or batch learning) -- Bounds and complexity -- RL for Games -- RL in Robotics -- Policy Gradient Techniques -- Least Squares Value Iteration -- Models and Model Induction -- Model-based RL -- Transfer Learning in RL -- Using of and extracting Knowledge in RL -- Biological or Psychological Background -- Evolutionary Approaches -- Closing chapter, prospects, future issues.
|
520 |
|
|
|a Reinforcement learning encompasses both a science of adaptive behavior of rational beings in uncertain environments and a computational methodology for finding optimal behaviors for challenging problems in control, optimization and adaptive behavior of intelligent agents. As a field, reinforcement learning has progressed tremendously in the past decade. The main goal of this book is to present an up-to-date series of survey articles on the main contemporary sub-fields of reinforcement learning. This includes surveys on partially observable environments, hierarchical task decompositions, relational knowledge representation and predictive state representations. Furthermore, topics such as transfer, evolutionary methods and continuous spaces in reinforcement learning are surveyed. In addition, several chapters review reinforcement learning methods in robotics, in games, and in computational neuroscience. In total seventeen different subfields are presented by mostly young experts in those areas, and together they truly represent a state-of-the-art of current reinforcement learning research. Marco Wiering works at the artificial intelligence department of the University of Groningen in the Netherlands. He has published extensively on various reinforcement learning topics. Martijn van Otterlo works in the cognitive artificial intelligence group at the Radboud University Nijmegen in The Netherlands. He has mainly focused on expressive knowledge representation in reinforcement learning settings.
|
650 |
|
0 |
|a Computational intelligence.
|
650 |
|
0 |
|a Artificial intelligence.
|
650 |
1 |
4 |
|a Computational Intelligence.
|
650 |
2 |
4 |
|a Artificial Intelligence.
|
700 |
1 |
|
|a Wiering, Marco.
|e editor.
|4 edt
|4 http://id.loc.gov/vocabulary/relators/edt
|
700 |
1 |
|
|a van Otterlo, Martijn.
|e editor.
|4 edt
|4 http://id.loc.gov/vocabulary/relators/edt
|
710 |
2 |
|
|a SpringerLink (Online service)
|
773 |
0 |
|
|t Springer Nature eBook
|
776 |
0 |
8 |
|i Printed edition:
|z 9783642276446
|
776 |
0 |
8 |
|i Printed edition:
|z 9783642446856
|
776 |
0 |
8 |
|i Printed edition:
|z 9783642276460
|
830 |
|
0 |
|a Adaptation, Learning, and Optimization,
|x 1867-4542 ;
|v 12
|
856 |
4 |
0 |
|u https://doi.uam.elogim.com/10.1007/978-3-642-27645-3
|z Texto Completo
|
912 |
|
|
|a ZDB-2-ENG
|
912 |
|
|
|a ZDB-2-SXE
|
950 |
|
|
|a Engineering (SpringerNature-11647)
|
950 |
|
|
|a Engineering (R0) (SpringerNature-43712)
|