Application of Seq2Seq models for predicting the development of thunderstorm activity to enhance the pilot’s situational awareness in flight
https://doi.org/10.26467/2079-0619-2025-28-1-20-38
Abstract
The paper presents the results of application of Seq2seq models based on neural networks for nowcasting-forecasting with a lead time of up to 2 hours – of thunderstorm activity in order to increase situational awareness of aircraft crews. Various recurrent and convolutional recurrent models were created and trained on the basis of radar meteorological observations of thunderstorm cells. The results showed that convolutional recurrent neural networks (ConvRNN, ConvLSTM, ConvGRU) outperform classical recurrent models and improve the thunderstorm forecast by 25–30% in terms of RMSE (root mean square error) metric compared to the baseline model, which always selects the most recent radar image available at the time of prediction. Nevertheless, despite the fact that the convolution recurrence models can accurately represent the general trend of thunderstorm cloud shape changes, the accuracy of predicting the intensity of thunderstorm cells is usually overestimated. Application of the proposed thunderstorm activity forecasting technology can enhance the situational awareness of the flight crew improving the projection of the current situation into the near future and optimizing the decision-making process for thunderstorm avoidance by providing crew members with predictive information about thunderstorm development on the navigation display screen. Future research is expected to further optimize the model architecture and integrate the predictive technology into flight crew decision support systems.
About the Authors
G. V. KovalenkoRussian Federation
Gennadiy V. Kovalenko, Doctor of Technical Sciences, Professor, Professor of the Chair of Flight Operations and Flight Safety in Civil Aviation
Saint Petersburg
I. A. Yadrov
Russian Federation
Ilya A. Yadrov, Postgraduate Student of the Chair of Flight Operations and Flight Safety in Civil Aviation
Saint Petersburg
References
1. Bolstad, C.A., Riley, J.M. (2002). Using goal directed task analysis with Army brigade officer teams. In: Proceedings of the Human Factors and Ergonomics Society Annual Meeting. SAGE Publications, vol. 46, no. 3, pp. 472–476. DOI: 10.1177/154193120204600354
2. Stanton, N.A., Chambers, P.R.G., Piggott, J. (2001). Situational awareness and safety. Safety science, vol. 39, no. 3, pp. 189–204. DOI: 10.1016/S0925-7535(01)00010-8
3. Sarter, N.B., Woods, D.D. (1991). Situation awareness: A critical but Ill-defined phenomenon. The International Journal of Aviation Psychology, vol. 1, no. 1, pp. 45–57. DOI: 10.1207/s15327108ijap0101_4
4. Strelkov, Yu.K. (2001). Engineering and professional psychology: textbook. Moscow: Vysshaya shkola; Akademiya, 360 p. (in Russian)
5. De Gooijer, J.G., Hyndman, R.J. (2006). 25 years of time series forecasting. International Journal of Forecasting, vol. 22, no. 3, pp. 443–473. DOI: 10.1016/j.ijforecast.2006.01.001
6. Stevenson, S. (2007). A comparison of the forecasting ability of ARIMA models. Journal of Property Investment & Finance, vol. 25, no. 3, pp. 223–240. DOI: 10.1108/14635780710746902
7. Chatfield, C., Koehler, A.B., Ord, J.K., Snyder, R.D. (2001). A new look at models for exponential smoothing. Journal of the Royal Statistical Society: Series D (The Statistician), vol. 50, no. 2, pp. 147–159. DOI: 10.1111/1467-9884.00267
8. Bentéjac, C., Csörgő, A., Martínez-Muñoz, G. (2021). A comparative analysis of gradient boosting algorithms. Artificial Intelligence Review, vol. 54, pp. 1937–1967. DOI: 10.1007/s10462-020-09896-5
9. Faloutsos, C., Gasthaus, J., Januschowski, T., Wang, Y. (2019). Classical and contemporary approaches to big time series forecasting. In: SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data, pp. 2042–2047. DOI: 10.1145/3299 869.3314033
10. Makridakis, S., Spiliotis, E., Assimakopoulos, V. (2018). The M4 Competition: Results, findings, conclusion and way forward. International Journal of Forecasting, vol. 34, issue 4, pp. 802–808. DOI: 10.1016/j.ijforecast.2018.06.001
11. Taieb, S.B., Sorjamaa, A., Bontempi, G. (2010). Multiple-output modeling for multi-step-ahead time series forecasting. Neurocomputing, vol. 73, issue 10–12, pp. 1950–1957. DOI: 10.1016/j.neucom.2009.11.030
12. Sutskever, I., Vinyals, O., Quoc, V.L. (2014). Sequence to sequence learning with neural networks. In: Proceedings of the 27th International Conference on Neural Information Processing Systems, no. 2, pp. 3104–3112. DOI: 10.48550/arXiv.1409.3215
13. Caterini, A.L., Chang, D.E., Caterini, A.L., Chang, D.E. (2018). Recurrent neural networks. In book: Deep Neural Networks in a Mathematical Framework, Springer Briefs in Computer Science. Springer, Cham, pp. 59–79. DOI: 10.1007/978-3-319-75304-1_5
14. Rumelhart, D.E., Hinton, G.E., Williams, R.J. (1986). Learning representations by back-propagating errors. Nature, no. 323, pp. 533–536. DOI: 10.1038/323533a0
15. Rasamoelina, A.D., Adjailia, F., Sinčák, P. (2020). A review of activation function for artificial neural network. In: 2020 IEEE 18th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herlany, Slovakia, pp. 281–286. DOI: 10.1109/SAMI48414.2020.9108717
16. Toharudin, T., Pontoh, R.S., Caraka, R.E., Zahroh, S., Lee, Y., Chen, R.C. (2023). Employing long short-term memory and Facebook prophet model in air temperature forecasting. Communications in Statistics-Simulation and Computation, vol. 52, issue 2, pp. 279–290. DOI: 10.1080/03610918.2020.1854302
17. Schmidhuber, J., Hochreiter, S. (1997). Long short-term memory. Neural Computation, vol. 9, issue 8, pp. 1735–1780. DOI: 10.1162/neco.1997.9.8.1735
18. De Mulder, W., Bethard, S., Moens, M. F. (2015). A survey on the application of recurrent neural networks to statistical language modeling. Computer Speech & Language, vol. 30, issue 1, pp. 61–98. DOI: 10.1016/j.csl.2014.09.005
19. Shi, X., Chen, Z., Wang, H., Yeung, D.Y., Wong, W.K., Woo, W.C. (2015). Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: NIPS'15: Proceedings of the 28th International Conference on Neural Information Processing Systems, vol. 1, pp. 802–810. DOI: 10.48550/arXiv.1506.04214
20. Li, Z., Liu, F., Yang, W., Peng, S., Zhou, J. (2021). A survey of convolutional neural networks: analysis, applications, and prospects. In: IEEE transactions on neural networks and learning systems, vol. 33, no. 12, pp. 6999–7019. DOI: 10.1109/TNNLS.2021.3084827
21. Ballas, N., Yao, L., Pal, C.J., Courville, A. (2016). Delving deeper into convolutional networks for learning video representations. In: 4th International Conference on Learning Representations, 2 p. DOI: 10.48550/arXiv.1511.06432 (accessed: 08.10.2024).
22. Mahafza, B.R. (2005). Radar systems analysis and design using MATLAB, 2nd ed. Chapman and Hall, CRC, 638 p. DOI: 10.1201/9781420057072
23. Van Dyk, D.A., Meng, X.L. (2001). The art of data augmentation. Journal of Computational and Graphical Statistics, vol. 10, no. 1, pp. 1–50. DOI: 10.1198/10618600152418584
24. Masters, D., Luschi, C. (2018). Revisiting small batch training for deep neural networks. Computer Science and Machine Learning, pp. 1–18. DOI: 10.48550/arXiv.1804.07612 (accessed: 08.10.2024).
25. Werbos, P.J. (1990). Backpropagation through time: what it does and how to do it. In: Proceedings of the IEEE, vol. 78, no. 10, pp. 1550–1560. DOI: 10.1109/5.58337
26. Llugsi, R., Yacoubi, S.E., Fontaine, A., Lupera, P. (2021). Comparison between Adam, AdaMax and Adam W optimizers to implement a weather forecast based on neural networks for the Andean city of Quito. In: 2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM), pp. 1–6. DOI: 10.1109/ETCM53643.2021.9590681
27. Bejani, M.M., Ghatee, M. (2021). A systematic review on overfitting control in shallow and deep neural networks. Artificial Intelligence Review, vol. 54. Pp. 6391–6438. DOI: 10.1007/s10462-021-09975-1
28. Passos, D., Mishra, P. (2022). A tutorial on automatic hyperparameter tuning of deep spectral modelling for regression and classification tasks. Chemometrics and Intelligent Laboratory Systems, no. 223. ID: 104520. DOI: 10.1016/j.chemolab.2022.104520 (accessed: 08.10.2024).
29. Koloskova, A., Hendrikx, H., Stich, S.U. (2023). Revisiting gradient clipping: Stochastic bias and tight convergence guarantees. International Conference on Machine Learning, pp. 17343–17363. DOI: 10.48550/arXiv.2305.01588
30. Endsley, M.R. (1995). Toward a theory of situation awareness in dynamic systems. Human factors, vol. 37, no. 1, pp. 32–64. DOI: 10.1518/001872095779049543
31. Kaikkonen, L., Parviainen, T., Rahikainen, M., Uusitalo, L., Lehikoinen, A. (2021). Bayesian networks in environmental risk assessment: A review. Integrated environmental assessment and management, vol. 17, no. 1, pp. 62–78. DOI: 10.1002/ieam.4332
32. Kovalenko, G.V., Yadrov, I.A., Kuts, K.A. (2023). Intelligent adaptive flight crew decision support system for thunderstorm avoidance. Russian Aeronautics, vol. 66. Pp. 552–559. DOI: 10.3103/S1068799823030170
Review
For citations:
Kovalenko G.V., Yadrov I.A. Application of Seq2Seq models for predicting the development of thunderstorm activity to enhance the pilot’s situational awareness in flight. Civil Aviation High Technologies. 2025;28(1):20-38. (In Russ.) https://doi.org/10.26467/2079-0619-2025-28-1-20-38