MODDPG-NSGA2双层架构:面向动态场景的物流无人机多目标优化方法

Research on Key Technologies of Logistics Unmanned Aerial Vehicle Routes Based on the Double-layer MODDPG-NSGA2 Architecture

  • 摘要: 为有效解决物流无人机货物运输中路径长度、时效达标率、能耗及多约束条件下的多目标优化难题,针对传统NSGA2算法在动态场景中缺乏“环境变化感知-约束动态整合-策略实时调整”的闭环能力,本研究提出MODDPG-NSGA2(Multi Objective Deep Deterministic Policy Gradient-Non-dominated Sorting Genetic Algorithm)双层架构算法。上层NSGA2算法采用改进的非支配排序策略,结合精英保留机制,构建多目标优化模型,生成覆盖路径长度、时效达标率和能耗的全局初始帕累托最优解集。下层MODDPG算法实时感知动态环境订单、负载均衡等状态信息,通过深度神经网络逼近状态-动作价值函数,根据环境变化动态调整路径策略,实现局部动态环境的重规划。双层架构的深度交互协同,解决了局部失衡与短视的问题。实验表明,对比传统 NSGA2 算法,MODDPG-NSGA2算法突发场景完成任务时效达标率提升了24.9%,配送路径和能耗均降低13.4%以上,多目标协同能力提升效果较为明显。为了进一步验证实验的鲁棒性,引入5种经典的多目标优化算法进一步对比,结果显示该算法在路径、时效及能耗优化上比均值高了23.48%以上,跨模态优化能力更优,为复杂城市环境中提升物流无人机运输效率、降低成本及增强多目标动态环境适应性具有重要意义,也为多目标优化算法在动态复杂系统中的应用提供了新的思路。

     

    Abstract: In order to effectively solve the multi-objective optimization problems of path length, time compliance, energy consumption and multi-constraints in logistics UAV cargo transportation, and to address the lack of closed-loop capability of “environment change perception - constraints dynamic integration-real-time adjustment of policy” of the traditional NSGA2 algorithm in dynamic scenarios, the present study proposes MODDPG-NSGA2(Multi Objective Deep Deterministic Policy), which is a multi-objective optimization algorithm for logistics UAV cargo transportation. NSGA2 (Multi Objective Deep Deterministic Policy In this study, we propose MODDPG-NSGA2 (Multi Objective Deep Deterministic Policy Gradient-Non-dominated Sorting Genetic Algorithm) two-layer architecture algorithm. The upper layer NSGA2 algorithm adopts an improved non-dominated sorting policy combined with an elite retention mechanism to construct a multi-objective optimization model to generate a global initial Pareto-optimal solution set covering path lengths, time-to-compliance rates and energy consumption. The lower layer MODDPG algorithm senses the dynamic environment orders, load balancing and other state information in real time, approximates the state-action value function through deep neural network, and dynamically adjusts the path strategy according to the environmental changes to realize the replanning of the local dynamic environment. The deep interactive synergy of the two-layer architecture solves the problem of local imbalance and short-sightedness. The experiments show that compared with the traditional NSGA2 algorithm, the MODDPG-NSGA2 algorithm bursty scenario completes the task time compliance rate by 24.9%, the distribution path and energy consumption are reduced by more than 13.4%, and the effect of multi-objective synergy ability enhancement is more obvious. In order to further verify the robustness of the experiment, five classical multi-objective optimization algorithms are introduced for further comparison, and the results show that the algorithm is more than 23.48% higher than the average value in the optimization of path, time and energy consumption, and the cross-modal optimization ability is better, which is of great significance for improving the efficiency of logistics UAV transportation in complex urban environments, reducing the cost, and enhancing the multi-objective dynamic environment adaptability, and it also provides an opportunity for the application of the multi-objective optimization algorithms in dynamic complex systems. It also provides a new idea for the application of multi-objective optimization algorithms in dynamic complex systems.

     

/

返回文章
返回