参考文献
[1] Werbos P J. Advanced Forecasting Methods for Global Crisis Warning and Models of Intelligence[J].General Systems Yearbook,1977,22(6):25-38.
[2] Richard. Dynamic Programming[M].Princeton University Press,1957.
[3] Wei Q,Liu D,Lin H. Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems[J].IEEE Transactions on Cybernetics,2016,46(3):840-853.
[4] Wang D,Liu D,Zhang Q,et al. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics[J].IEEE Transactions on Systems Man & Cybernetics Systems,2016,46(11):1544- 1555.
[5] Wang Z,Ding S,Huang Z,et al. Exponential Stability and Stabilization of Delayed Memristive Neural Networks Based on Quadratic Convex Combination Method[J].IEEE Transactions on Neural Networks & Learning Systems,
2015,27(11):2337-2350.
[6] Sun Q,Zhang Y,He H,et al. A Novel Energy Function-Based Stability Evaluation and Nonlinear Control Approach for Energy Internet[J].IEEE Transactions on Smart Grid,2017,8(3):1195-1210.
[7] Xu X,Huang Z,Zuo L,et al. Manifold-Based Reinforcement Learning via Locally Linear Reconstruction[J]. IEEE Transactions on Neural Networks & Learning Systems,2017,28(4):934-947.
[8] Cai H,Lewis F L,Hu G,et al. The Adaptive Distributed Observer Approach to the Cooperative Output Regulation of Linear Multi-Agent Systems[J].Automatica,2017,75:299-305.
[9] Nasirian V,Shafiee Q,Guerrero J M,et al. Droop-Free Distributed Control for AC Microgrids[J].IEEE Transactions on Power Electronics,2016,31(2):1600-1617.
[10] Sahoo A,Xu H,Jagannathan S. Adaptive Neural Network-Based Event-Triggered Control of Single-Input Single- Output Nonlinear Discrete-Time Systems[J].IEEE Transactions on Neural Networks & Learning Systems, 2016,27(1):151-164.
[11] Narayanan V,Jagannathan S. Event-Triggered Distributed Approximate Optimal State and Output Control of Affine Nonlinear Interconnected Systems[J].IEEE Transactions on Neural Networks & Learning Systems,2017,PP(99): 1-11.
[12] Lin Q,Wei Q,Liu D. A Novel Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems Using Generalised Policy Iteration Adaptive Dynamic Programming Algorithm[J].International Journal of Systems Science,2017,48(3):1-10.
[13] Zhu Y,Zhao D,He H,et al. Event-Triggered Optimal Control for Partially-Unknown Constrained-Input Systems via Adaptive Dynamic Programming[J].IEEE Transactions on Industrial Electronics,2016,PP(99):1-1.
[14] Liu D,Wei Q. Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems
[J].IEEE Transactions on Cybernetics,2012,43(2):779-789.
[15] Wei Q,Song R,Yan P. Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP[J].IEEE Transactions on Neural Networks & Learning Systems, 2016,27(2):444-458.
[16] Song R,Lewis F L,Wei Q. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous- Time Multiplayer Nonzero-Sum Game[s J].IEEE Transactions on Neural Networks & Learning Systems,2016(99): 1-10.
[17] Luo B,Liu D,Huang T,et al. Model-Free Optimal Tracking Control via Critic-Only Q-Learning[J].IEEE Transactions on Neural Networks & Learning Systems,2016,27(10):2134-2144.
[18] Zhang H,Jiang H,Luo Y,et al. Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method[J].IEEE Transactions on Industrial Electronics,2017,64(5):4091-4100.
[19] Yan J,He H,Zhong X,et al. Q-Learning-Based Vulnerability Analysis of Smart Grid Against Sequential Topology Attacks[J].IEEE Transactions on Information Forensics & Security,2017,12(1):200-210.
[20] Gao W,Jiang Z P,Ozbay K. Data-Driven Adaptive Optimal Control of Connected Vehicles[J].IEEE Transactions on Intelligent Transportation Systems,2017,18(5):1122-1133.
[21] Zhang Q,Zhao D,Wang D. Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming[J].IEEE Transactions on Neural Networks & Learning Systems,2016,PP(99):1-14.
[22] Dong L,Sun C,He H. Dual heuristic dynamic programming based event-triggered control for nonlinear continuous-time systems[C]// International Joint Conference on Neural Networks. 2016:4241-4248.
[23] Wang D,He H,Liu D. Improving the Critic Learning for Event-Based Nonlinear H ∞ Control Design[J].IEEE Transactions on Cybernetics,2017:1-12.
[24] Wang D,Mu C,Zhang Q,et al. Event-based input-constrained nonlinear H ∞ state feedback with adaptive critic and neural implementation[J].Neurocomputing,2016,214:848-856.
[25] Wang D,He H,Zhong X,et al. Event-Driven Nonlinear Discounted Optimal Regulation Involving A Power System Application[J].IEEE Transactions on Industrial Electronics,2017,PP(99):1-10.
[26] Cui X,Zhang H,Luo Y,et al. Adaptive Dynamic Programming for H ∞ Tracking Design of Uncertain Nonlinear Systems with Disturbances and Input Constraints[J].International Journal of Adaptive Control & Signal Processing,2017(5).
[27] Liu Y,Zhang H,Luo Y,et al. ADP Based Optimal Tracking Control for A Class of Linear Discrete-Time System with Multiple Delays[J].Journal of the Franklin Institute,2016,353(9):2117-2136.
[28] Zhang K,Zhang H,Jiang H,et al. Data-driven Optimal Control for a Class of Unknown Continuous-Time Nonlinear System Using a Novel ADP Method[C]// International Conference on Intelligent Control & Information Processing. IEEE,2017:117-124.
[29] Qu Q,Zhang H,Feng T,et al. Decentralized Adaptive Tracking Control Scheme for Nonlinear Large-Scale Interconnected Systems via Adaptive Dynamic Programming[J].Neurocomputing,2017,225:1-10.
[30] Jiang H,Zhang H,Liu Y,et al. Neural-Network-Based Control Scheme for a Class of Nonlinear Systems with Actuator Faults via Data-Driven Reinforcement Learning Method[J].Neurocomputing,2017,239:1-8.
[31] Wei Q,Lewis F L,Liu D,et al. Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis[J].IEEE Transactions on Systems Man & Cybernetics Systems,2016(99):1-17.
[32] Wei Q,Lewis F L,Sun Q,et al. Discrete-Time Deterministic $Q$ -Learning:A Novel Convergence Analysis[J]. IEEE Transactions on Cybernetics,2016,47(5):1224-1237.
[33] Wei Q,Liu D,Lin Q,et al. Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games[J].IEEE Transactions on Neural Networks & Learning Systems,2017(99):1-13.
[34] Wei Q,Liu D,Lewis F L,et al. Mixed Iterative Adaptive Dynamic Programming for Optimal Battery Energy Control in Smart Residential Microgrids[J].IEEE Transactions on Industrial Electronics,2017,64(5): 4110-4120.
[35] Zhao B,Liu D,Li Y. Observer Based Adaptive Dynamic Programming for Fault Tolerant Control of a Class of Nonlinear Systems[J].Information Sciences,2016,384:21-33.
[36] Huang Y,Wang D,Liu D. Bounded Robust Control Design for Uncertain Nonlinear Systems Using Single-Network Adaptive Dynamic Programming[J].Neurocomputing,2017,266:128-140.
[37] Luo B,Liu D,Wu H N,et al. Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control[J]. IEEE Transactions on Cybernetics,2017(99):1-14.
[38] Jiang Y,Jiang Z P. Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems[J].IEEE Transactions on Automatic Control,2015,60(11):2917-2929.
[39] Zhang S,Xiong R. Adaptive Energy Management of a Plug-In Hybrid Electric Vehicle Based on Driving Pattern Recognition and Dynamic Programming[J].Applied Energy,2015,155:68-78.
[40] Gao W,Jiang Y,Jiang Z P,et al. Output-feedback Adaptive Optimal Control of Interconnected Systems Based on Robust Adaptive Dynamic Programming[J].Automatica,2016,72:37-45.
[41] Xie S,Zhong W,Xie K,et al. Fair Energy Scheduling for Vehicle-to-Grid Networks Using Adaptive Dynamic Programming[J].IEEE Transactions on Neural Networks & Learning Systems,2016,27(8):1697-1707.
[42] Zhong X,Ni Z,He H. Convergence Analysis of GrDHP-based Optimal Control for Discrete-Time Nonlinear System[C]// International Joint Conference on Neural Networks. 2016:4557-4564.
[43] Mu C,Tang Y,He H. Improved Sliding Mode Design for Load Frequency Control of Power System Integrated an Adaptive Learning Strategy[J].IEEE Transactions on Industrial Electronics,2017,64(8):6742-6751.
[44] Wang D,He H,Mu C,et al. Intelligent Critic Control With Disturbance Attenuation for Affine Dynamics Including an Application to a Microgrid System[J].IEEE Transactions on Industrial Electronics,2017,64(6):4935- 4944.
[45] Song R,Lewis F L,Wei Q. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous- Time Multiplayer Nonzero-Sum Game[s J].IEEE Transactions on Neural Networks & Learning Systems,2016(99): 1-10.
[46] Modares H,Nageshrao S P,Lopes G A D,et al. Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning[J].Automatica,2016,71(C):334-341.