|
|
|
|
LEADER |
00000cam a2200000Ma 4500 |
001 |
EBOOKCENTRAL_on1225551708 |
003 |
OCoLC |
005 |
20240329122006.0 |
006 |
m o d |
007 |
cr cnu---unuuu |
008 |
201205s2020 xx o 000 0 eng d |
040 |
|
|
|a EBLCP
|b eng
|c EBLCP
|d N$T
|d EBLCP
|d N$T
|d OCLCO
|d OCLCF
|d OCLCO
|d OCLCQ
|d OCLCO
|d OCLCL
|
020 |
|
|
|a 1119698995
|
020 |
|
|
|a 9781119698999
|q (electronic bk.)
|
035 |
|
|
|a (OCoLC)1225551708
|
050 |
|
4 |
|a Q325.6
|
082 |
0 |
4 |
|a 006.3/1
|2 23
|
049 |
|
|
|a UAMI
|
100 |
1 |
|
|a Sadhu, Arup Kumar.
|
245 |
1 |
0 |
|a Multi-Agent Coordination
|h [electronic resource] :
|b A Reinforcement Learning Approach.
|
260 |
|
|
|a Newark :
|b John Wiley & Sons, Incorporated,
|c 2020.
|
300 |
|
|
|a 1 online resource (321 p.).
|
490 |
1 |
|
|a Wiley - IEEE Ser.
|
500 |
|
|
|a Description based upon print version of record.
|
505 |
0 |
|
|a Cover -- Title Page -- Copyright Page -- Contents -- Preface -- Acknowledgments -- Chapter 1 Introduction: Multi-agent Coordination by Reinforcement Learning and Evolutionary Algorithms -- 1.1 Introduction -- 1.2 Single Agent Planning -- 1.2.1 Terminologies Used in Single Agent Planning -- 1.2.2 Single Agent Search-Based Planning Algorithms -- 1.2.2.1 Dijkstra's Algorithm -- 1.2.2.2 A* (A-star) Algorithm -- 1.2.2.3 D* (D-star) Algorithm -- 1.2.2.4 Planning by STRIPS-Like Language -- 1.2.3 Single Agent RL -- 1.2.3.1 Multiarmed Bandit Problem -- 1.2.3.2 DP and Bellman Equation
|
505 |
8 |
|
|a 1.2.3.3 Correlation Between RL and DP -- 1.2.3.4 Single Agent Q-Learning -- 1.2.3.5 Single Agent Planning Using Q-Learning -- 1.3 Multi-agent Planning and Coordination -- 1.3.1 Terminologies Related to Multi-agent Coordination -- 1.3.2 Classification of MAS -- 1.3.3 Game Theory for Multi-agent Coordination -- 1.3.3.1 Nash Equilibrium -- 1.3.3.2 Correlated Equilibrium -- 1.3.3.3 Static Game Examples -- 1.3.4 Correlation Among RL, DP, and GT -- 1.3.5 Classification of MARL -- 1.3.5.1 Cooperative MARL -- 1.3.5.2 Competitive MARL -- 1.3.5.3 Mixed MARL -- 1.3.6 Coordination and Planning by MAQL
|
505 |
8 |
|
|a 1.3.7 Performance Analysis of MAQL and MAQL-Based Coordination -- 1.4 Coordination by Optimization Algorithm -- 1.4.1 PSO Algorithm -- 1.4.2 Firefly Algorithm -- 1.4.2.1 Initialization -- 1.4.2.2 Attraction to Brighter Fireflies -- 1.4.2.3 Movement of Fireflies -- 1.4.3 Imperialist Competitive Algorithm -- 1.4.3.1 Initialization -- 1.4.3.2 Selection of Imperialists and Colonies -- 1.4.3.3 Formation of Empires -- 1.4.3.4 Assimilation of Colonies -- 1.4.3.5 Revolution -- 1.4.3.6 Imperialistic Competition -- 1.4.4 Differential Evolution Algorithm -- 1.4.4.1 Initialization -- 1.4.4.2 Mutation
|
505 |
8 |
|
|a 1.4.4.3 Recombination -- 1.4.4.4 Selection -- 1.4.5 Off-line Optimization -- 1.4.6 Performance Analysis of Optimization Algorithms -- 1.4.6.1 Friedman Test -- 1.4.6.2 Iman-Davenport Test -- 1.5 Summary -- References -- Chapter 2 Improve Convergence Speed of Multi-Agent Q-Learning for Cooperative Task Planning -- 2.1 Introduction -- 2.2 Literature Review -- 2.3 Preliminaries -- 2.3.1 Single Agent Q-learning -- 2.3.2 Multi-agent Q-learning -- 2.4 Proposed MAQL -- 2.4.1 Two Useful Properties -- 2.5 Proposed FCMQL Algorithms and Their Convergence Analysis -- 2.5.1 Proposed FCMQL Algorithms
|
505 |
8 |
|
|a 2.5.2 Convergence Analysis of the Proposed FCMQL Algorithms -- 2.6 FCMQL-Based Cooperative Multi-agent Planning -- 2.7 Experiments and Results -- 2.8 Conclusions -- 2.9 Summary -- 2.A More Details on Experimental Results -- 2.A.1 Additional Details of Experiment 2.1 -- 2.A.2 Additional Details of Experiment 2.2 -- 2.A.3 Additional Details of Experiment 2.4 -- References -- Chapter 3 Consensus Q-Learning for Multi-agent Cooperative Planning -- 3.1 Introduction -- 3.2 Preliminaries -- 3.2.1 Single Agent Q-Learning -- 3.2.2 Equilibrium-Based Multi-agent Q-Learning -- 3.3 Consensus
|
590 |
|
|
|a ProQuest Ebook Central
|b Ebook Central Academic Complete
|
650 |
|
0 |
|a Reinforcement learning.
|
650 |
|
0 |
|a Multiagent systems.
|
650 |
|
6 |
|a Apprentissage par renforcement (Intelligence artificielle)
|
650 |
|
6 |
|a Systèmes multiagents (Intelligence artificielle)
|
650 |
|
7 |
|a Multiagent systems
|2 fast
|
650 |
|
7 |
|a Reinforcement learning
|2 fast
|
700 |
1 |
|
|a Konar, Amit.
|
758 |
|
|
|i has work:
|a Multi-Agent Coordination (Text)
|1 https://id.oclc.org/worldcat/entity/E39PCFxfTp9BP6CCMTg6ccVqDC
|4 https://id.oclc.org/worldcat/ontology/hasWork
|
776 |
0 |
8 |
|i Print version:
|a Sadhu, Arup Kumar
|t Multi-Agent Coordination : A Reinforcement Learning Approach
|d Newark : John Wiley & Sons, Incorporated,c2020
|z 9781119699033
|
830 |
|
0 |
|a Wiley - IEEE Ser.
|
856 |
4 |
0 |
|u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=6413914
|z Texto completo
|
938 |
|
|
|a ProQuest Ebook Central
|b EBLB
|n EBL6413914
|
938 |
|
|
|a EBSCOhost
|b EBSC
|n 2692266
|
994 |
|
|
|a 92
|b IZTAP
|