Reinforcement Learning Algorithm for Airport Flight Delay Recovery

doi:10.3969/j.issn.1674-0696.2024.09.07

Abstract

Abstract: Flight delays at airports resulted in aircraft and passengers being stranded at the airport, and improper recovery and scheduling of flight delays can exacerbate the losses caused by delays. Aiming to the issue of minimizing losses in flight delay recovery scheduling, a target function was formulated to calculate the total delay loss, a Markov decision-making process was constructed for flight delay recovery, and an airport flight delay recovery rescheduling model was established. To address computational complexity, a deep learning neural network parameterized policy function was employed to parameterize the strategy of reducing the delay loss objective function value, which was trained by the reward function and advantage function. A reinforcement learning algorithm for airport flight delay recovery was proposed. The research results show that the proposed model can reduce the total loss of flight delays by 7.83% and the duration of passenger delays by 7.23%. The proposed deep reinforcement learning algorithm outperforms other algorithms in both time and performance.

Key words: traffic and transportation engineering; flight delay recovery; delay losses; flight rescheduling; Markov decision; deep reinforcement learning

摘要： 机场出现航班延误会导致飞行器和乘客滞留机场，若航班延误恢复调度不当会扩大延误造成的损失。针对航班延误恢复调度的损失最小化问题，设计了延误总损失计算的目标函数，构建航班延误恢复马尔科夫决策过程，建立了机场航班延误恢复重排班模型。为了解决计算的复杂性问题，采用深度学习神经网络参数化策略函数对减小延误损失目标函数值的策略进行参数化，利用奖励函数和优势函数对其进行训练，提出了一种机场航班延误恢复强化学习算法。研究结果表明：该算法能够将航班延误总损失降低7.83%，将旅客延误时长降低7.23%，相比于其他算法，该算法在时间和性能上均取得优势。

关键词: 交通运输工程；航班延误恢复；延误损失；航班重排班；马尔科夫决策；深度强化学习

CLC Number:

U8
TP399

DING Jianli, LIU Dekang. Reinforcement Learning Algorithm for Airport Flight Delay Recovery[J]. Journal of Chongqing Jiaotong University(Natural Science), 2024, 43(9): 50-58.

丁建立，刘德康. 机场航班延误恢复的强化学习算法[J]. 重庆交通大学学报（自然科学版）, 2024, 43(9): 50-58.

References

［1］ ?ＡＦＡＫ ?，ＧüＲＥＬＳ，ＡＫＴüＲＫＭＳ. Integrated aircraft-path assign-ment and robust schedule design with cruise speed control ［J］. Computers & Operations Research, 2017, 84: 127-145.
［2］ WU Chenglung, LAW K. Modelling the delay propagation effects of multiple resource connections in an airline network using a Bayesian network model ［J］. Transportation Research Part E: Logistics and Transportation Review, 2019, 122: 62-77.
［3］何坚, 果红艳, 姚远, 等. 基于有效中转时间预测的不正常航班恢复技术［J］. 北京航空航天大学学报, 2022, 48(3): 384-393.
HE Jian, GUO Hongyan, YAO Yuan, et al. Irregular flight recovery tech-nique based on accurate transit time prediction ［J］. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(3): 384-393.
［4］丁建立, 王新茹, 徐涛. 航班延误恢复调度的混合粒子群算法［J］. 交通运输工程学报, 2008, 8(2): 90-95.
DING Jianli, WANG Xinru, XU Tao. Hybrid particle swarm optimi-zation arithmetic for recovery scheduling of flight delays ［J］. Journal of Traffic and Transportation Engineering, 2008, 8(2): 90-95.
［5］ XU Bo. An efficient Ant Colony algorithm based on wake-vortex mode-ling method for aircraft scheduling problem ［J］. Journal of Computa-tional and Applied Mathematics, 2017, 317: 157-170.
［6］ IKLI S, MANCEL C, MONGEAU M, et al. An optimistic planning app-roach for the aircraft landing problem ［C］∥ Air Traffic Management and Systems IV: Selected Papers of the 6th ENRI International Workshop on ATM/CNS (EIWAC2019). Singapore: Springer, 2021: 173-188.
［7］ MUNOS R. From bandits to Monte-Carlo tree search: The optimistic pri-nciple applied to optimization and planning ［J］. Foundations and Trends in Machine Learning, 2014, 7(1): 1-129.
［8］ SOARES I B, DE HAUWERE Y M, JANUARIUS K, et al. Departure management with a reinforcement learning approach: Respecting CFMU slots ［C］ ∥2015 IEEE 18th International Conference on Intelligent Transportation Systems. IEEE, 2015: 1169-1176.
［9］李亚飞, 吴庆顺, 徐明亮, 等. 基于强化学习的舰载机保障作业实时调度方法［J］. 中国科学: 信息科学, 2021, 51(2): 247-262.
LI Yafei, WU Qingshun, XU Mingliang, et al. Real-time scheduling for carrier-borne aircraft support operations: A reinforcement learning approach ［J］. Scientia Sinica (Informationis), 2021, 51(2): 247-262.
［10］赵秀丽, 朱金福, 郭梅. 不正常航班延误调度模型及算法［J］. 系统工程理论与实践, 2008, 28(4): 129-134.
ZHAO Xiuli, ZHU Jinfu, GUO Mei. Study on modelling and algorithm of irregular flight delay operation ［J］. Systems Engineering-Theory & Practice, 2008, 28(4): 129-134.
［11］ GLOROT X, BORDES A, BENGIO Y. Deep sparse rectifier neural networks ［C］∥The 14th International Conference on Artificial Intelli-gence and Statistics (AISTATS), Fort Lauderdale, Florida, USA，2011: 315-323.
［12］ MNIH V, KAVUKCUOGLU K, SILVER D, et al. Playing Atari with deep reinforcement learning ［C］ ∥Proceedings of the NIPS Workshop on Deep Learning. Lake Tahoe: MIT Press, 2013.
［13］梁星星，冯旸赫，马扬，等. 多Agent深度强化学习综述［J］.自动化学报, 2020, 46(12): 2537-2557
LIANG Xingxing, FENG Yanghe, MA Yang, et al. Deep multi-agent reinforcement learning: A survey ［J］. Acta Automatica Sinica, 2020, 46(12): 2537-2557.
［14］ KINGA D, ADAM J B. A method for stochastic optimization ［C］∥ The 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 2015.
［15］ MNIH V, BADIA A P, MIRZA M, et al. Asynchronous methods for deep reinforcement learning ［C］∥ The 33rd International Conference on Machine Learning. New York, USA, 2016: 1928-1937.
［16］ MARINI F, WALCZAK B. Particle swarm optimization (PSO): A tutorial ［J］. Chemometrics and Intelligent Laboratory Systems, 2015, 149: 153-165.
［17］ NIU Huimin, ZHOU Xuesong. Optimizing urban rail timetable under time-dependent demand and oversaturated conditions ［J］. Transpor-tation Research Part C: Emerging Technologies, 2013, 36: 212-230.

[1]	ZHU Jinfu1, MA Ruixin1,2, PENG Anna1, YAN Chen1. Flight Schedule Optimization in Multi-airport System Based on Particle Swarm Optimization Algorithm [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(09): 1-8.
[2]	CAO Jianqiu, XU Peng,ZHANG Guangyan. Path Planning of UAV Based on Hybrid Ant Colony Algorithm under Greedy Strategy [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(09): 9-16.
[3]	JIN Huibin1, HU Zhanyao2, YU Guihua2. Pilots Attention Allocation Behavior in Single-Engine Failure Scenario [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(04): 1-5.
[4]	LI Nan1, JIAO Qingyu1, ZHANG Liandong2, FAN Rui1. Taxi Time Prediction of Departure Aircraft [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(03): 1-6.
[5]	LI Nan, QIANG Yigeng, FAN Rui. Aircraft Flight Trajectory Clustering Based on Trajectory Compression [J]. Journal of Chongqing Jiaotong University(Natural Science), 2021, 40(01): 1-6.
[6]	ZHAO Guihong1, QIN Zhen1, LI Jianfu2. Optimization Algorithm and Implementation of Dispatched Vehicles between Several Flights in Condition of Flights Delay [J]. Journal of Chongqing Jiaotong University(Natural Science), 2020, 39(10): 5-9.
[7]	CHU Yanchang, CHEN Feichao, HOU Yunyan. Coordinated Development of Civil Aviation in Beijing, Tianjin and Hebei Based on the Synergy Model of Composite System [J]. Journal of Chongqing Jiaotong University(Natural Science), 2020, 39(10): 18-23.
[8]	LIU Lingli1, YU Meichen1, WANG Yue1, YUN Tianyu2. Applicability of Revised SCQ-MD Scale in Airport Pilots [J]. Journal of Chongqing Jiaotong University(Natural Science), 2020, 39(09): 46-46.
[9]	WANG Lili, LIU Ziang. Reroute Planning for Parallel Route [J]. Journal of Chongqing Jiaotong University(Natural Science), 2020, 39(08): 45-50.
[10]	CHU Yanchang, CHEN Feichao. Evaluation of Operation Efficiency of China’s Airport Industry Based on Super-efficiency DEA-Malmquist Model [J]. Journal of Chongqing Jiaotong University(Natural Science), 2019, 38(12): 115-122.
[11]	ZHANG Zhaoning,CHEN Weibo. Evaluation of Terminal Area Utilization Rate Based on Entropy Method [J]. Journal of Chongqing Jiaotong University(Natural Science), 2019, 38(11): 98-103.
[12]	WANG Jiening1,2, ZHANG Congjun1,2, FANG Xiaodan1,2 , SUN Xiaomeng1,2. LEADSTO Dynamic Simulation Analysis on ATC Pressure Generation Process [J]. Journal of Chongqing Jiaotong University(Natural Science), 2019, 38(10): 116-120.
[13]	LIU Jixin, ZENG Xiaoyu, YIN Minjia, ZHU Xuehua. Stress Level Prediction of Controller Based on Cumulative Logistic Regression Model [J]. Journal of Chongqing Jiaotong University(Natural Science), 2019, 38(03): 97-102.
[14]	ZHANG Zhaoning，CAO Yueqi. Model of Route Short-Time Utilization Rate Based on Flight Level [J]. Journal of Chongqing Jiaotong University(Natural Science), 2018, 37(08): 107-111.
[15]	QIN Rui1，SHI Yaqi2，WANG Mingke3. Spatial Distribution Programming Method of ADS-B Ground Station Oriented to Low Altitude Flight Safety [J]. Journal of Chongqing Jiaotong University(Natural Science), 2018, 37(07): 100-105.