Random forest based optimal feature selection for partial discharge pattern recognition in HV cables

Peng, Xiaosheng and Li, Jinshu and Wang, Ganjun and Wu, Yiijiang and Li, Lee and Li, Zhaohui and Ahmed Bhatti, Ashfaque and Zhou, Chengke and Hepburn, Donald M. and Reid, Alistair J and Judd, Martin D. and Siew, W. H. (2019) Random forest based optimal feature selection for partial discharge pattern recognition in HV cables. IEEE Transactions on Power Delivery. ISSN 0885-8977 (In Press)

[thumbnail of Peng-etal-IEEE-TOPD-2019-Random-forest-based-optimal-feature-selection-for-partial]
Preview
Text. Filename: Peng_etal_IEEE_TOPD_2019_Random_forest_based_optimal_feature_selection_for_partial.pdf
Accepted Author Manuscript

Download (1MB)| Preview

Abstract

Optimal selection of features of Partial Discharge (PD) signals recorded from defects in High Voltage (HV) cables will contribute not only to the improvement of PD pattern recognition accuracy and efficiency but also to PD parameter visualization in HV cable condition monitoring and diagnostics. This paper presents a novel Random Forest (RF)-based feature selection algorithm for PD pattern recognition of HV cables. The algorithm is applied to feature selection of both PD signals and interference signals with the aim of obtaining the optimal features for data processing. Firstly, the experimental data acquisition and feature extraction processes are introduced. PD signals were captured from faults created in a cable to obtain the raw PD data, then a set of 3500 transient PD pulses and a set of 3500 typical interference pulses were extracted, based on which 34 PD features were extracted for further processing. Furthermore, 119 two-dimensional features and 1082 three-dimensional features were generated. The paper then discusses the basic principle of the RF algorithm. Finally, RF-based feature selection was implemented to determine the optimal features for PD pattern recognition. The results were obtained and evaluated with the Back Propagation Neural Network (BPNN) and Support Vector Machine (SVM). Results show that the proposed RF-based method is effective for PD feature selection of HV cables with the potential for application to additional HV power apparatus.