的个人主页 http://faculty.nuaa.edu.cn/cyh3/zh_CN/index.htm
点击次数:
所属单位:自动化学院
发表刊物:INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS
关键字:Adaptive dynamic programming attitude control system multi-mission constraints on-orbit reconfiguration reinforcement learning
摘要:For the on-orbit reconfiguration problem of spacecraft attitude control systems under multi-mission constraints, the idea of a reinforcement-learning algorithm is adopted, and an adaptive dynamic programming algorithm for on-orbit reconfiguration decision-making that is based on a dual optimization index is proposed. Two optimization objectives, total mission reward and total control cost (energy consumption), are defined to obtain the optimal reconfiguration policy of the spacecraft attitude control system reconfiguration, and the on-orbit reconfiguration model for multi-mission constraints is established. Then, based on the Bellman optimality principle, the optimal reconfiguration policy formulated by the discrete HJB equation is obtained. Since the HJB equation is difficult to solve accurately, a method of bi-objective adaptive dynamic programming is proposed to obtain the optimal reconfiguration policy. This method constructs a mission network and an energy network. The method then adopts a Q-learning-based algorithm to train the networks to estimate the values of total mission reward and total control cost to achieve the on-orbit optimal reconfiguration decision under multi-mission constraints. Simulation results for different cases demonstrate the validity and rationality of the proposed method.
ISSN号:1598-6446
是否译文:否
发表时间:2019-04-01
合写作者:姜斌,Li, Huan,黄现礼,Han, Xiao-dong
通讯作者:程月华,姜斌