Rail break prediction and cause analysis using imbalanced in-service train data
More Info
expand_more
Abstract
Timely detection and identification of rail breaks are crucial for safety and reliability of railway networks. This paper proposes a new deep learning-based approach using the daily monitoring data from in-service trains. A time-series generative adversarial network (TimeGAN) is employed to mitigate the problem of data imbalance and preserve the temporal dynamics for generating synthetic rail breaks. A feature-level attention-based bidirectional recurrent neural networks (AM-BRNN) is proposed to enhance feature extraction and capture two-direction dependencies in sequential data for accurate prediction. The proposed approach is implemented on a three-year dataset collected from a section of railroads (up to 350 km) in Australia. A real-life validation is carried out to evaluate the prediction performance of the proposed model, where historical data is used to train the model and future ’unseen’ rail breaks along the whole track section are used for testing. The results show that the model can successfully predict 9 out of 11 rail breaks three months ahead of time with a false prediction of non-break of 8.2%. Predicting rail breaks three months ahead of time will provide railroads enough time for maintenance planning. Given the prediction results, SHAP method is employed to perform cause analysis for individual rail break. The results of cause analysis can assist railroads to plan appropriate maintenance to prevent rail breaks.