Research on the derated power data identification method of a wind turbine based on a multi-Gaussian-discrete joint probability model

Ma, Yuanchi and Liu, Yongqian and Yang, Zhiling and Yan, Jie and Tao, Tao and Infield, David (2022) Research on the derated power data identification method of a wind turbine based on a multi-Gaussian-discrete joint probability model. Sensors, 22 (22). 8891. ISSN 1424-8220 (https://doi.org/10.3390/s22228891)

[thumbnail of Ma-etal-Sensors-2022-Research-on-the-derated-power-data-identification-method]
Preview
Text. Filename: Ma_etal_Sensors_2022_Research_on_the_derated_power_data_identification_method.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (6MB)| Preview

Abstract

This paper focuses on how to identify normal, derated power and abnormal data in operation data, which is key to intelligent operation and maintenance applications such as wind turbine condition diagnosis and performance evaluation. Existing identification methods can distinguish normal data from the original data, but usually remove power curtailment data as outliers. A multi-Gaussian-discrete probability distribution model was used to characterize the joint probability distribution of wind speed and power from wind turbine SCADA data, taking the derated power of the wind turbine as a hidden random variable. The maximum expectation algorithm (EM), an iterative algorithm derived from model parameters estimation, was applied to achieve the maximum likelihood estimation of the proposed probability model. According to the posterior probability of the wind-power scatter points, the normal, derated power and abnormal data in the wind turbine SCADA data were identified. The validity of the proposed method was verified by three wind turbine operational data sets with different distribution characteristics. The results are that the proposed method has a degree of universality with regard to derated power operational data with different distribution characteristics, and in particular, it is able to identify the operating data with clustered distribution effectively.