论文标题
地震:基于机器学习的可扩展的地震监视工作流程与云计算
QuakeFlow: A Scalable Machine-learning-based Earthquake Monitoring Workflow with Cloud Computing
论文作者
论文摘要
地震监测工作流程旨在检测地震信号并确定连续波形数据的源特性。深度学习地震学的最新发展已被用来改善地震监测工作流程中的任务,这些工作流程允许快速,准确地检测到比常规目录中所存在的更小事件。为了促进机器学习算法在大批量地震记录中的应用,我们开发了一种基于云的地震监测工作流程,地震,该质量质量流,该杂物流了多个处理步骤,以从原始地震数据中生成地震目录。 Quakeflow使用深度学习模型Phasenet进行挑选P/S阶段,并使用机器学习模型Gamma进行相关,以近似地震位置和大小。质量流中的每个组件都是容器的,可以通过新的深度学习/机器学习模型以及添加新组件(例如地震重定位算法)的能力直接更新管道。我们在Kubernetes中构建了地震,以使其用于大型数据集自动尺度,并使其易于在云平台上部署,从而实现大规模并行处理。我们使用地质流从波多黎各处理了三年的连续存档数据,并发现了与以前已知的地震性相同的结构上发生的十倍以上的事件。我们将地质流应用于监测夏威夷频繁的地震,发现比标准目录中的事件多,包括许多阐明岩浆系统深层结构的事件。我们还添加了Kafka和Spark流,以提供实时地震监测结果。地质流是改善实时地震监测和采矿归档地震数据集的有效方法。
Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To facilitate the application of machine-learning algorithms to large-volume seismic records, we developed a cloud-based earthquake monitoring workflow, QuakeFlow, that applies multiple processing steps to generate earthquake catalogs from raw seismic data. QuakeFlow uses a deep learning model, PhaseNet, for picking P/S phases and a machine learning model, GaMMA, for phase association with approximate earthquake location and magnitude. Each component in QuakeFlow is containerized, allowing straightforward updates to the pipeline with new deep learning/machine learning models, as well as the ability to add new components, such as earthquake relocation algorithms. We built QuakeFlow in Kubernetes to make it auto-scale for large datasets and to make it easy to deploy on cloud platforms, which enables large-scale parallel processing. We used QuakeFlow to process three years of continuous archived data from Puerto Rico, and found more than a factor of ten more events that occurred on much the same structures as previously known seismicity. We applied Quakeflow to monitoring frequent earthquakes in Hawaii and found over an order of magnitude more events than are in the standard catalog, including many events that illuminate the deep structure of the magmatic system. We also added Kafka and Spark streaming to deliver real-time earthquake monitoring results. QuakeFlow is an effective and efficient approach both for improving realtime earthquake monitoring and for mining archived seismic data sets.