Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games

Abstract

In this paper, a novel adaptive dynamic programming (ADP) algorithm, called "iterative zero-sum ADP algorithm," is developed to solve infinite-horizon discrete-time two-player zero-sum games of nonlinear systems. The present iterative zero-sum ADP algorithm permits arbitrary positive semidefinite functions to initialize the upper and lower iterations. A novel convergence analysis is developed to guarantee the upper and lower iterative value functions to converge to the upper and lower optimums, respectively. When the saddle-point equilibrium exists, it is emphasized that both the upper and lower iterative value functions are proved to converge to the optimal solution of the zero-sum game, where the existence criteria of the saddle-point equilibrium are not required. If the saddle-point equilibrium does not exist, the upper and lower optimal performance index functions are obtained, respectively, where the upper and lower performance index functions are proved to be not equivalent. Finally, simulation results and comparisons are shown to illustrate the performance of the present method.

References

Page 1

	Year	Citations
L/sub 2/-gain analysis of nonlinear systems and nonlinear state-feedback H/sub infinity / control Arjan van der Schaft IEEE Transactions on Automatic Control Nonlinear ControlNonlinear AnalogEngineeringSmooth Nonlinear SystemsLyapunov Analysis	1992	1.6K
H/sup ∞/-0ptimal Control and Related Minimax Design Problems: A Dynamic Game Approach Tamer Başar, P. Bernhard IEEE Transactions on Automatic Control Mathematical ProgrammingControl TheoryEngineeringRobust ControlH/sup ∞/-0Ptimal Control	1996	1.1K
Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof A. Al-Tamimi, Frank L. Lewis, Murad Abu-Khalaf IEEE Transactions on Systems Man and Cybernetics Part B (Cybernetics) Mathematical ProgrammingNumerical AnalysisNonlinear ControlOptimal ControlEngineering	2008	1.1K
Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers Frank L. Lewis, Draguna Vrabie, Kyriakos G. Vamvoudakis IEEE Control Systems Artificial IntelligenceOptimal ControlEngineeringOptimal Control PoliciesMathematical Control Theory	2012	1.1K
Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics Yu Jiang, Zhong‐Ping Jiang Automatica Nonlinear ControlUnknown DynamicsRobust ControlMathematical Control TheoryProcess Control	2012	1K
Online learning control by association and reinforcement Jennie Si, Yu-tsung Wang IEEE Transactions on Neural Networks Artificial IntelligenceGeneric OnlineCognitive ScienceEngineeringMachine Learning	2001	773
Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method Huaguang Zhang, Lili Cui, Xin Zhang, IEEE Transactions on Neural Networks Nonlinear ControlUnknown System DynamicsAdp MethodRobust ControlMathematical Control Theory	2011	627
Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality Kyriakos G. Vamvoudakis, Frank L. Lewis, Greg Hudas Automatica Artificial IntelligenceDifferential GameEngineeringStochastic GameGame Theory	2012	541
A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm Huaguang Zhang, Qinglai Wei, Yanhong Luo IEEE Transactions on Systems Man and Cybernetics Part B (Cybernetics) Nonlinear ControlPerformance IndexOptimal Tracking ProblemIteration AlgorithmLearning Control	2008	483
Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP Huaguang Zhang, Lili Cui, Yanhong Luo IEEE Transactions on Cybernetics Differential GameNonlinear ControlEngineeringDynamic OptimizationOptimal Control Policies	2012	452

Page 1