Projecte llegit
Títol: Uncovering Flight Delay Patterns by using Subgroup Discovery technique
Estudiants que han llegit aquest projecte:
TORREIRO PINTO, ADRIÁN (data lectura: 15-09-2025)- Cerca aquest projecte a Bibliotècnica

Director/a: KULJANIN, JOVANA
Departament: FIS
Títol: Uncovering Flight Delay Patterns by using Subgroup Discovery technique
Data inici oferta: 24-01-2025 Data finalització oferta: 24-09-2025
Estudis d'assignació del projecte:
GR ENG SIST AEROESP
Tipus: Individual | |
Lloc de realització: EETAC | |
Paraules clau: | |
Airline Delay, Cost, Airline Sustainability | |
Descripció del contingut i pla d'activitats: | |
This thesis investigates the phenomenon of flight delays in commercial aviation through the application of advanced data analysis and unsupervised machine learning techniques. Using a large anonymised operational dataset from a European airline, the study develops a multi-stage methodology encompassing data engineering, cleaning and preparation, exploratory statistics, and the application of Subgroup Discovery to uncover hidden patterns of delays.
The research first provides an overview of the operational dataset and the main variables influencing delay performance, such as temporal factors, city-pair concentration and flight duration. Exploratory analysis identifies recurrent trends and contextualises the distribution of delays across years. Subsequently, Subgroup Discovery is applied as an innovative technique to reveal interpretable rules that explain under which conditions delays are more likely to occur, highlighting the impact of operational contexts such as time of day, seasonality and specific route profiles. The findings contribute to a better understanding of the complex drivers of flight delays, offering insights that go beyond aggregate statistics. They also demonstrate how rule-based unsupervised learning methods can support airlines in identifying critical operational scenarios and mitigating inefficiencies. Finally, the study outlines the implications of delay reduction in terms of cost savings, environmental benefits and improved passenger experience, aligning with broader sustainability and competitiveness goals in the aviation sector. |
|
Overview (resum en anglès): | |
This thesis investigates the phenomenon of flight delays in commercial aviation through the application of advanced data analysis and unsupervised machine learning techniques. Using a large anonymized operational dataset from a European airline, the study develops a multi-stage methodology encompassing data engineering, cleaning and preparation, exploratory statistics, and the application of Subgroup Discovery to uncover hidden patterns of delays.
The research first provides an overview of the operational dataset and the main variables influencing delay performance, such as temporal factors, city-pair concentration and flight duration. Exploratory analysis identifies recurrent trends and contextualises the distribution of delays across years. Subsequently, Subgroup Discovery is applied as an innovative technique to reveal interpretable rules that explain under which conditions delays are more likely to occur, highlighting the impact of operational contexts such as time of day, seasonality and specific route profiles. The findings contribute to a better understanding of the complex drivers of flight delays, offering insights that go beyond aggregate statistics. They also demonstrate how rule-based unsupervised learning methods can support airlines in identifying critical operational scenarios and mitigating inefficiencies. Finally, the study outlines the implications of delay reduction in terms of cost savings, environmental benefits and improved passenger experience, aligning with broader sustainability and competitiveness goals in the aviation sector. |