论文标题
利用用户访问模式和高级网络基础结构来加速共享的科学观测站的数据传递
Leveraging User Access Patterns and Advanced Cyberinfrastructure to Accelerate Data Delivery from Shared-use Scientific Observatories
论文作者
论文摘要
随着共享使用仪器和观察员的越来越多的数量和越来越多的可用性,观察数据正成为应用程序工作流程的重要组成部分,以及在一系列学科中对科学发现的贡献者的重要组成部分。但是,访问这些设施的用户数量的相应增长以及数据的规模和多样性的扩展使这些设施的挑战性,以确保可以及时访问,集成和分析其数据,并在其Cyberinfrasture(CI)上产生重大需求。 在本文中,我们介绍了一个基于推动的数据输送框架的设计,该框架利用了新兴的网络内部功能,以及基于混合数据管理模型的数据预取用技术。具体而言,我们分析了两个大型观测值的数据访问轨迹,海洋观测站计划(OOI)和地球科学(GAGE)的进步,以识别典型的用户访问模式,并开发可用于数据预取用的模型。此外,我们使用虚拟数据协作(VDC)平台的模拟来评估数据预取用模型和提出的框架,该平台提供网络内数据分期和处理功能。结果表明,该框架显着提高数据传递性能并降低观测值设施的网络流量的能力。
With the growing number and increasing availability of shared-use instruments and observatories, observational data is becoming an essential part of application workflows and contributor to scientific discoveries in a range of disciplines. However, the corresponding growth in the number of users accessing these facilities coupled with the expansion in the scale and variety of the data, is making it challenging for these facilities to ensure their data can be accessed, integrated, and analyzed in a timely manner, and is resulting significant demands on their cyberinfrastructure (CI). In this paper, we present the design of a push-based data delivery framework that leverages emerging in-network capabilities, along with data pre-fetching techniques based on a hybrid data management model. Specifically, we analyze data access traces for two large-scale observatories, Ocean Observatories Initiative (OOI) and Geodetic Facility for the Advancement of Geoscience (GAGE), to identify typical user access patterns and to develop a model that can be used for data pre-fetching. Furthermore, we evaluate our data pre-fetching model and the proposed framework using a simulation of the Virtual Data Collaboratory (VDC) platform that provides in-network data staging and processing capabilities. The results demonstrate that the ability of the framework to significantly improve data delivery performance and reduce network traffic at the observatories' facilities.