基于零和微分博弈的仿射非线性系统预设时间容错控制

杨朋昕; 张爽; 于欣波

doi:10.13374/j.issn2095-9389.2025.04.18.001

基于零和微分博弈的仿射非线性系统预设时间容错控制

Prescribed-time fault-tolerant control for affine nonlinear systems based on zero-sum differential games

摘要

摘要: 针对一类带有执行器故障的仿射非线性系统，本文提出了一种基于零和微分博弈的预设时间最优容错控制策略. 该方法通过辅助函数构建具有时间以及空间约束性能的状态方程. 基于此状态方程，将控制信号以及偏置故障作为博弈双方，构建微分博弈模型. 结合纳什–庞特里亚金最大最小原理，系统地推导了Hamilton–Jacobi–Isaacs（HJI）方程，以求解鞍点平衡，从而获得最优控制策略和偏差故障的边界值. 为了解决求解高阶偏微分方程时固有的“维数灾难”，基于神经网络技术提出了自适应动态规划算法. 设计的最优容错控制策略可以保证系统在执行器故障的情况下具有预设时间稳定性以及最优性能，并且该预设时间是显性的，可以由用户进行自行调整. 仿真结果表明了本文设计算法的可行性与有效性.

Abstract: A prescribed-time fault-tolerant control strategy based on zero-sum differential games is proposed for affine nonlinear systems with actuator faults. The goal is to address the critical issue that current fault-tolerant control methods fail to balance control optimality with explicit time adjustability. As intelligent systems have become more prevalent in such fields as power grids, high-speed railways, and deep-space and deep-sea exploration, safety concerns arising from actuator faults have become increasingly prominent. Traditional methods, such as fuzzy control, adaptive control, and sliding mode control, ensure system stability but struggle to achieve optimal control performance. Moreover, the convergence time typically depends on the initial state or controller parameters, limiting flexibility for user customization. To address this, a differential game framework using an auxiliary function with time and space constraint characteristics was developed. The fault-tolerant control problem is transformed into an adversarial optimization problem, and neural-network technology is employed to mitigate the “curse of dimensionality.” Ultimately, the system achieves both stability and optimal control performance within the prescribed time. For the affine nonlinear system model, an auxiliary function \xi (t) with time–space constraints is introduced. It transforms the original state into a new state constrained by the prescribed time T_\textp and desired accuracy \varepsilon . When the new state function \xi (t) is bounded, the system state converges to the designated accuracy range within a user-specified time. Based on this, the control signal u_\texts and actuator bias fault u_\text0 are modeled as opposing sides of a zero-sum differential game. Using the Nash–Pontryagin maximum–minimum principle, the Hamilton–Jacobi–Isaacs (HJI) equation is derived to solve for the game equilibrium point, yielding the optimal control strategy and boundary values for the bias fault. To address the curse-of-dimensionality problem in solving the HJI equation, an adaptive dynamic programming framework is constructed using a single-critic neural network, which approximates the optimal cost function through online learning, significantly reducing the algorithm complexity. In addition, a real-time adaptive estimation of the fault coefficient matrix is incorporated to enhance the ability of the system to adapt to multiplicative faults. This method not only explicitly defines the convergence time T_\textp but also synchronously optimizes control performance. A simulation using a two-link robotic manipulator demonstrated that, under actuator faults and bias faults, the system can still converge the tracking error within the desired accuracy in the prescribed time. The system exhibits stable tracking performance across different initial states, controller parameters, and prescribed times. Furthermore, the control torque adjusts rapidly during the fault period, thereby suppressing disturbances. Comparative simulations confirmed that the system convergence time is positively correlated with the prescribed time value and independent of the initial state and controller parameters of the system, thereby validating the effectiveness and superiority of the algorithm. The contributions of this study are threefold. First, a zero-sum differential game framework is employed to design an optimal fault-tolerant control strategy, ensuring both the stability of the affine nonlinear system and optimal control performance. This stability is achieved in finite time, in contrast to the asymptotic stability often discussed in differential game theory, which typically assumes infinite time. Second, unlike algorithms that consider only a single fault, in this study, the coexistence of actuator partial faults and bias faults is accounted for because the extreme values of the bias faults are derived based on the game strategy. Furthermore, to address the curse of dimensionality in solving the HJI equation, a single-critic network is constructed using an adaptive dynamic programming algorithm that is further combined with the estimated values of the coefficient fault matrix for integrated updates, thus reducing the complexity of the control algorithm. Third, unlike finite-time and fixed-time control methods, where the convergence time depends on the initial state or controller parameters of the system, the proposed algorithm enables the user to predesign the convergence time, ensuring that the system state converges within a finite time, unaffected by the initial state or controller parameters.

HTML全文

参考文献(35)

施引文献

资源附件(0)