Discrete-Time Self-Learning Parallel Control
Wei, Qinglai1,2,3; Wang, Lingxiao1,2,3; Lu, Jingwei1,2,3; Wang, Fei-Yue1,2,3
刊名IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
2022
卷号52期号:1页码:192-204
关键词Optimal control Nonlinear systems Time-varying systems Performance analysis Complex systems Biological neural networks ACP adaptive dynamic programming (ADP) approximate dynamic programming nonlinear systems optimal control parallel control reinforcement learning
ISSN号2168-2216
DOI10.1109/TSMC.2020.2995646
通讯作者Wei, Qinglai(qinglai.wei@ia.ac.cn)
英文摘要In this article, a new self-learning parallel control method, which is based on adaptive dynamic programming (ADP) technique, is developed for solving the optimal control problem of discrete- time time-varying nonlinear systems. It aims to obtain an approximate optimal control law sequence and simultaneously guarantees the convergence of the value function. Establishing the time-varying artificial system by neural networks in a certain time-horizon, a control-sequence-improvement ADP algorithm is developed to obtain the control law sequence. For the first time, the criteria of the parallel execution are presented, such that the value function is proven to converge to a finite neighborhood of the optimal performance index function. Finally, numerical results and analysis are presented to demonstrate the effectiveness of the parallel control method.
资助项目National Natural Science Foundation of China[61722312] ; National Natural Science Foundation of China[61533017] ; National Key Research and Development Program of China[2018YFB1702300]
WOS关键词TRACKING CONTROL ; SYSTEMS
WOS研究方向Automation & Control Systems ; Computer Science
语种英语
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
WOS记录号WOS:000731147700025
资助机构National Natural Science Foundation of China ; National Key Research and Development Program of China
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/47136]  
专题自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队
通讯作者Wei, Qinglai
作者单位1.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
2.Qingdao Acad Intelligent Ind, Parallel Intelligence Innovat Ctr, Qingdao 266109, Peoples R China
3.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Wei, Qinglai,Wang, Lingxiao,Lu, Jingwei,et al. Discrete-Time Self-Learning Parallel Control[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,2022,52(1):192-204.
APA Wei, Qinglai,Wang, Lingxiao,Lu, Jingwei,&Wang, Fei-Yue.(2022).Discrete-Time Self-Learning Parallel Control.IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,52(1),192-204.
MLA Wei, Qinglai,et al."Discrete-Time Self-Learning Parallel Control".IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 52.1(2022):192-204.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace