中文核心期刊
CSCD来源期刊
中国科技核心期刊
RCCSE中国核心学术期刊

Journal of Chongqing Jiaotong University(Natural Science) ›› 2024, Vol. 43 ›› Issue (9): 50-58.DOI: 10.3969/j.issn.1674-0696.2024.09.07

• Transportation+Big Data & Artificial Intelligence • Previous Articles    

Reinforcement Learning Algorithm for Airport Flight Delay Recovery

DING Jianli, LIU Dekang   

  1. (School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China)
  • Received:2023-10-10 Revised:2024-03-05 Published:2024-09-25

机场航班延误恢复的强化学习算法

丁建立,刘德康   

  1. (中国民航大学 计算机科学与技术学院, 天津 300300)
  • 作者简介:丁建立(1963—),男,河南洛阳人,教授,博士,主要从事智能仿生算法及民航应用、民航大数据及智能信息服务等方面的研究。E-mail:jlding@cauc.edu.cn 通信作者:刘德康(1999—),男,山东济宁人,硕士,主要从事航班延误恢复、强化学习方面的研究。E-mail:2021052071@cauc.edu.cn
  • 基金资助:
    国家自然基金民航联合研究基金重点支持项目(U2033205, U2233214)

Abstract: Flight delays at airports resulted in aircraft and passengers being stranded at the airport, and improper recovery and scheduling of flight delays can exacerbate the losses caused by delays. Aiming to the issue of minimizing losses in flight delay recovery scheduling, a target function was formulated to calculate the total delay loss, a Markov decision-making process was constructed for flight delay recovery, and an airport flight delay recovery rescheduling model was established. To address computational complexity, a deep learning neural network parameterized policy function was employed to parameterize the strategy of reducing the delay loss objective function value, which was trained by the reward function and advantage function. A reinforcement learning algorithm for airport flight delay recovery was proposed. The research results show that the proposed model can reduce the total loss of flight delays by 7.83% and the duration of passenger delays by 7.23%. The proposed deep reinforcement learning algorithm outperforms other algorithms in both time and performance.

Key words: traffic and transportation engineering; flight delay recovery; delay losses; flight rescheduling; Markov decision; deep reinforcement learning

摘要: 机场出现航班延误会导致飞行器和乘客滞留机场,若航班延误恢复调度不当会扩大延误造成的损失。针对航班延误恢复调度的损失最小化问题,设计了延误总损失计算的目标函数,构建航班延误恢复马尔科夫决策过程,建立了机场航班延误恢复重排班模型。为了解决计算的复杂性问题,采用深度学习神经网络参数化策略函数对减小延误损失目标函数值的策略进行参数化,利用奖励函数和优势函数对其进行训练,提出了一种机场航班延误恢复强化学习算法。研究结果表明:该算法能够将航班延误总损失降低7.83%,将旅客延误时长降低7.23%,相比于其他算法,该算法在时间和性能上均取得优势。

关键词: 交通运输工程;航班延误恢复;延误损失;航班重排班;马尔科夫决策;深度强化学习

CLC Number: