Temporal Sequence Modeling for Rare Failure Prediction in Industrial Machinery Using a Hybrid CNN-LSTM Model - UTU Research Portal

A4 Refereed article in a conference publication

Temporal Sequence Modeling for Rare Failure Prediction in Industrial Machinery Using a Hybrid CNN-LSTM Model

Authors: Faraz, Mehdi; Shubina, Viktoriia; Mäkilä, Tuomas; Heikkonen, Jukka

Editors: N/A

Conference name: International Conference on System Reliability and Safety

Publication year: 2025

Book title : 2025 9th International Conference on System Reliability and Safety (ICSRS)

First page : 277

Last page: 286

ISBN: 979-8-3315-4953-4

eISBN: 979-8-3315-4952-7

DOI: https://doi.org/10.1109/ICSRS68021.2025.11422078

Publication's open availability at the time of reporting: No Open Access

Publication channel's open availability : No Open Access publication channel

Web address : https://ieeexplore.ieee.org/document/11422078

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/515771401

Self-archived copy's version: Final draft

Abstract

Predictive maintenance is vital for enhancing industrial machinery reliability by detecting rare failures in high-dimensional, imbalanced sensor data. This study analyzes a dataset of 220320 samples, with failure events (BROKEN) constituting only 0.003%. Initial state classification using autoencoders and ensemble classifiers (Random Forest, Extreme Gradient Boosting) failed to detect these rare failures due to extreme class imbalance. To address this, we developed a hybrid Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) model for temporal sequence modeling, utilizing convolutional layers for spatial feature extraction and LSTM layers for capturing temporal dependencies within 6-sample windows, classified as Normal or Transient (failure precursors). Main contributions include a robust feature selection method for imbalanced datasets, a comprehensive comparative analysis of state classification, temporal modeling, and unsupervised anomaly detection (Isolation Forest, One-Class Support Vector Machine), insights into temporal failure precursors, and an evaluation framework using Leave-One-Out Cross-Validation (LOOCV) to ensure robust assessment of rare events, balancing accuracy and early detection. The hybrid CNN-LSTM model achieved 97.88% overall accuracy, detecting 6 out of 7 failure cases with 85.71% recall, though precision was 30% due to false positives. Compared to baseline CNN (4.82% precision, 57.14% recall), LSTM (1.02% precision, 28.57% recall), Isolation Forest (50.00% precision, 14.29% recall), and One-Class SVM (8.00% precision, 28.57% recall), the hybrid model significantly improved rare failure detection by effectively capturing spatial and temporal patterns. These results highlight the efficacy of the CNN-LSTM approach with LOOCV for proactive maintenance, offering substantial improvements in industrial reliability and safety for real-world applications with extreme class imbalance.

Index Terms—Predictive Maintenance, Convolutional Neural Network, Long Short-Term Memory, Industrial Machinery, Anomaly Detection, Rare Failure Detection

Funding information in the publication:
We gratefully acknowledge the support of Business Finland (Funding Decision No. 6819/31/2023), which made this work possible.