Computational & Technology Resources
an online resource for computational,
engineering & technology publications
Civil-Comp Conferences
ISSN 2753-3239
CCC: 7
PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON RAILWAY TECHNOLOGY: RESEARCH, DEVELOPMENT AND MAINTENANCE
Edited by: J. Pombo
Paper 11.11

Freight Train Derailment Severity Estimation using Clustering and Machine Learning Techniques

Z. Saghian1 and M. Bagheri2

1School of Industrial Engineering, Iran University of Science and Technology, Tehran, Iran
2School of Railway Engineering, Iran University of Science and Technology, Tehran, Iran

Full Bibliographic Reference for this paper
Z. Saghian, M. Bagheri, "Freight Train Derailment Severity Estimation using Clustering and Machine Learning Techniques", in J. Pombo, (Editor), "Proceedings of the Sixth International Conference on Railway Technology: Research, Development and Maintenance", Civil-Comp Press, Edinburgh, UK, Online volume: CCC 7, Paper 11.11, 2024, doi:10.4203/ccc.7.11.11
Keywords: train derailment, clustering, machine learning, freight train, k-nearest neighbors, support vector machines, random forest, gradient boosting.

Abstract
This study aims to estimate freight train derailment severity using the U.S. FRA rail accident database spanning from 1997 to 2023. After preprocessing, which included data cleaning and normalization, the dataset comprised 3967 records. The data was split into training (80%) and testing (20%) sets. Using the NBclust function in the R programming environment, optimal clustering for causes was determined, resulting in four clusters based on specific criteria. Each cluster was analyzed using four machine learning techniques: K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Random Forest, and Gradient Boosting. The results were aggregated based on cluster weights derived from the clustering process. Performance evaluation metrics, including RMSE and Accuracy, were used to assess the models. The findings indicate that all classifiers performed well, with KNN demonstrating superior performance, achieving an accuracy of 92.36% and an RMSE of 0.26. Additionally, the proposed model's average accuracy of 91.53% outperforms the previous benchmark study, which reported an average accuracy of 79.56%. These results suggest that the proposed model is effective in estimating derailment severity and can be a valuable tool for railway safety management.

download the full-text of this paper (PDF, 10 pages, 543 Kb)

go to the previous paper
return to the table of contents
return to the volume description