Natural Hazards and Earth System Sciences | 2021

Near-real-time automated classification of seismic signals of slope failures with continuous random forests

 
 
 
 
 

Abstract


Abstract. In mountainous areas, rockfalls, rock avalanches, and debris flows constitute a risk to human life and property. Seismology has proven a useful tool to monitor such mass movements, while increasing data volumes and availability of real-time data streams demand new solutions for automatic signal classification. Ideally, seismic monitoring arrays have large apertures and record a significant number of mass movements to train detection algorithms. However, this is rarely the case, as a result of cost and time constraints and the rare occurrence of catastrophic mass movements. Here, we use the supervised random forest algorithm to classify windowed seismic data on a continuous data stream. We investigate algorithm performance for signal classification into noise\xa0(NO), slope failure\xa0(SF), and earthquake\xa0(EQ) classes and explore the influence of non-ideal though commonly encountered conditions: poor network coverage, imbalanced data sets, and low signal-to-noise ratios\xa0(SNRs). To this end we use data from two separate locations in the Swiss Alps: data set\xa0(i), recorded at Illgraben, contains signals of several dozen slope failures with low SNR; data set\xa0(ii), recorded at Pizzo Cengalo, contains only five slope failure events albeit with higher SNR. The low SNR of slope failure events in data set\xa0(i) leads to a classification accuracy of 70\u2009% for\xa0SF, with the largest confusion between\xa0NO and\xa0SF. Although data set\xa0(ii) is highly imbalanced, lowering the prediction threshold for slope failures leads to a prediction accuracy of 80\u2009% for\xa0SF, with the largest confusion between\xa0SF and\xa0EQ. Standard techniques to mitigate training data imbalance do not increase prediction accuracy. The classifier of data set\xa0(ii) is then used to train a model for the classification of 176\u2009d of continuous seismic recordings containing four slope failure events. The model classifies eight events as slope failures, of which two are snow avalanches, and one is a rock-slope failure. The other events are local or regional earthquakes. By including earthquake detection of a permanent seismic station at 131\u2009km distance to the test site into the decision-making process, all earthquakes falsely classified as slope failures can be excluded. Our study shows that, even for limited training data and non-optimal network geometry, machine learning algorithms applied to high-quality seismic records can be used to monitor mass movements automatically.

Volume 21
Pages 339-361
DOI 10.5194/NHESS-21-339-2021
Language English
Journal Natural Hazards and Earth System Sciences

Full Text