A machine learning-driven approach to uncover the influencing factors resulting in soil mass displacement

Parasyris, Apostolos and Stankovic, Lina and Stankovic, Vladimir (2024) A machine learning-driven approach to uncover the influencing factors resulting in soil mass displacement. Geosciences, 14 (8). 220. ISSN 2076-3263 (https://doi.org/10.3390/geosciences14080220)

[thumbnail of Parasyris-etal-Gosciences-2024-A-machine-learning-driven-approach-to-uncover-the-influencing-factors-resulting-in-soil-mass-displacement]
Preview
Text. Filename: Parasyris-etal-Gosciences-2024-A-machine-learning-driven-approach-to-uncover-the-influencing-factors-resulting-in-soil-mass-displacement.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (2MB)| Preview

Abstract

For most landslides, several destabilising processes act simultaneously, leading to relative sliding along the soil or rock mass surface over time. A number of machine learning approaches have been proposed recently for accurate relative and cumulative landside displacement prediction, but researchers have limited their studies to only a few indicators of displacement. Determining which influencing factors are the most important in predicting different stages of failure is an ongoing challenge due to the many influencing factors and their inter-relationships. In this study, we take a data-driven approach to explore correlations between various influencing factors triggering slope movement to perform dimensionality reduction, then feature selection and extraction to identify which measured factors have the strongest influence in predicting slope movements via a supervised regression approach. Further, through hierarchical clustering of the aforementioned selected features, we identify distinct types of displacement. By selecting only the most effective measurands, this in turn informs the subset of sensors needed for deployment on slopes prone to failure to predict imminent failures. Visualisation of the important features garnered from correlation analysis and feature selection in relation to displacement show that no one feature can be effectively used in isolation to predict and characterise types of displacement. In particular, analysis of 18 different sensors on the active and heavily instrumented Hollin Hill Landslide Observatory in the north west UK, which is several hundred metres wide and extends two hundred metres downslope, indicates that precipitation, atmospheric pressure and soil moisture should be considered jointly to provide accurate landslide prediction. Additionally, we show that the above features from Random Forest-embedded feature selection and Variational Inflation Factor features (Soil heat flux, Net radiation, Wind Speed and Precipitation) are effective in characterising intermittent and explosive displacement.

ORCID iDs

Parasyris, Apostolos, Stankovic, Lina ORCID logoORCID: https://orcid.org/0000-0002-8112-1976 and Stankovic, Vladimir ORCID logoORCID: https://orcid.org/0000-0002-1075-2420;