Data augmentation to improve the performance of ensemble learning for system failure prediction with limited observations

Shi, Guo and Liu, Bin and Walls, Lesley; (2022) Data augmentation to improve the performance of ensemble learning for system failure prediction with limited observations. In: 2022 13th International Conference on Reliability, Maintainability, and Safety (ICRMS). 13th International Conference on Reliability, Maintainability, and Safety: Reliability and Safety of Intelligent Systems, ICRMS 2022 . IEEE, Piscataway, NJ, pp. 296-300. ISBN 9781665486903 (https://doi.org/10.1109/ICRMS55680.2022.9944577)

[thumbnail of Shi-etal-ICRMS-2022-Data-augmentation-to-improve-the-performance-of-ensemble-learning]
Preview
Text. Filename: Shi_etal_ICRMS_2022_Data_augmentation_to_improve_the_performance_of_ensemble_learning.pdf
Accepted Author Manuscript
License: Strathprints license 1.0

Download (3MB)| Preview

Abstract

Ensemble learning has been widely used to improve the performance and robustness of machine learning algorithms on time series data. However, in real operational processes where the observed data is limited, it hinders the capability of ensemble learning algorithms. To address the challenge of limited observed data, this paper proposes a novel three-layer ensemble learning framework by use of data augmentation. Firstly, multiple classical time series augmentation methods are applied to increase the size of the data set. Subsequently, after pre-processing, these augmented data is trained by multiple basic learners with K-fold cross-validation as the first layer of the developed ensemble learning framework. The outputs of the first layer are integrated via LASSO to further improve the prediction performance, which serves as the second layer of the developed framework. Finally, the third-layer output is generated by averaging the prediction of the second layer and the output from an improved Long-Short Term Memory model that provides prediction based on the augmented data. A case study on a real wastewater treatment plant is used to illustrate the effectiveness of the proposed method.

ORCID iDs

Shi, Guo, Liu, Bin ORCID logoORCID: https://orcid.org/0000-0002-3946-8124 and Walls, Lesley ORCID logoORCID: https://orcid.org/0000-0001-7016-9141;