Journal article · Preprint article
Application of data clustering to railway delay pattern recognition
Department of Management Engineering, Technical University of Denmark1
Transport DTU, Department of Management Engineering, Technical University of Denmark2
Transport Modelling, Department of Management Engineering, Technical University of Denmark3
Department of Applied Mathematics and Computer Science, Technical University of Denmark4
Statistics and Data Analysis, Department of Applied Mathematics and Computer Science, Technical University of Denmark5
Management Science, Department of Management Engineering, Technical University of Denmark6
Operations Management, Management Science, Department of Management Engineering, Technical University of Denmark7
K-means clustering is employed to identify recurrent delay patterns on a high traffic railway line north of Copenhagen, Denmark. The clusters identify behavioral patterns in the very large (“big data”) data sets generated automatically and continuously by the railway signal system. The results reveal where corrective actions are necessary, showing where recurrent delay patterns take place.
Delay profiles and delay-change profiles are generated from timestamps to compare different train runs, and to partition the set of observations into groups of similar elements. K-means clustering can identify and discriminate different patterns affecting the same stations, which is otherwise difficult in previous approaches based on visual inspection.
Classical methods of univariate analysis do not reveal these patterns. The demonstrated methodology is scalable and can be applied to any system of transport.
Language: | English |
---|---|
Publisher: | Journal of Advanced Transportation |
Year: | 2018 |
Pages: | 1-18 |
ISSN: | 20423195 and 01976729 |
Types: | Journal article and Preprint article |
DOI: | 10.1155/2018/6164534 |
ORCIDs: | Cerreto, Fabrizio , Nielsen, Bo Friis , Nielsen, Otto Anker and Harrod, Steven |