Return to search

Unsupervised Machine-Learning Applications in Seismology

Catalogs of seismic source parameters (hypocenter locations, origin times, and magnitudes) are vital for studying various Earth processes, greatly enhancing our understanding of the nature of seismic events, the structure of the Earth, and the dynamics of fault systems. Modern seismic analyses utilize supervised machine learning (ML) to build enhanced catalogs based on millions of examples of analyst-picked phase-arrivals in waveforms, yet the ability to characterize the time-varying spectral content of the waveforms underlying those catalogs remains lacking. Unsupervised machine learning (UML) methods provide powerful tools for inferring patterns from musical spectrograms with little a priori information, yet has been relatively underutilized in the field of seismology.

In this thesis, I leverage advanced tools from UML to analyze the temporal spectral content of large sets of spectrograms generated by different mechanisms in two distinct geologic settings: icequakes and tremors at Gornergletscher (a Swiss temperate glacier) and repeating earthquakes from a 10-km-long creeping segment of the San Andreas Fault. The core algorithm in this work, now known as Spectral Unsupervised Feature Extraction, or SpecUFEx, extracts time-varying frequency patterns from spectrograms and reduces them into low-dimensionality fingerprints via a combination of non-negative matrix factorization and hidden Markov Modeling (Holtzman et al. 2018), optimized for large data sets via stochastic variational inference.

This work describes the SpecUFEx algorithm and the suite of preprocessing, clustering, and visualization tools developed to create an UML workflow, SpecUFEx+, that is widely-accessible and applicable for many seismic settings. I apply theSpecUFEx+ workflow to single- and multi-station seismic data from Gornergletscher, and demonstrate how some fingerprint-clusters track diurnal tremor related to subglacial water flow, while others correspond to the onset of the subglacial and englacial components of a glacial lake outburst flood.

I also discover periods of harmonic tremor localized near the ice-bed interface that may be related to glacial stick-slip sliding. I additionally apply the SpecUFEx+ workflow to earthquakes on the San Andreas Fault to unveil far more repeating earthquake sequences than previously inferred, leading to enhanced slip-rate estimates at seismogenic depths and providing a more detailed image of seismic gaps along the fault interface. Unsupervised feature extraction is a novel tool to the field of seismology. This work demonstrates how scientific insight can be gained through the characterization of the spectral-temporal patterns of large seismic datasets within an UML-framework.

Identiferoai:union.ndltd.org:columbia.edu/oai:academiccommons.columbia.edu:10.7916/6745-zs49
Date January 2024
CreatorsSawi, Theresa
Source SetsColumbia University
LanguageEnglish
Detected LanguageEnglish
TypeTheses

Page generated in 0.002 seconds