当前位置: X-MOL 学术Earth Sci. Inform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Enabling modern data discovery for atmospheric measurements
Earth Science Informatics ( IF 2.8 ) Pub Date : 2021-06-18 , DOI: 10.1007/s12145-021-00635-0
Kavya Guntupally , Kyle Dumas , Giri Prakash , Ranjeet Devarakonda , Wade Darnell , Maggie Davis , Richard Cederwall

The Atmospheric Radiation Measurement (ARM) user facility is a US Department of Energy Office of Science user facility that is managed and operated through a collaborative effort led by nine US Department of Energy national laboratories. The ARM Data Center, located at Oak Ridge National Laboratory, is responsible for the timely collection, processing, and delivery of data products to the scientific community. The ARM Data Center holds more than 11,000 data products, including metadata collected from field campaigns, instruments, value-added products, and principal investigator–contributed data. These data sets are checked for successful transfer (for most data, this transfer is carried out automatically via the network; however, some of the largest data sets and some of the most remote sites require manual shipping of hard disks) and both the data and metadata are processed to a standard format, which is an ARM-standardized structure, via the Network Common Data Form. The Network Common Data Form is a self-describing binary format with many compatible software tools. Once processed, the data are cataloged, stored in the ARM Data Archive, and made discoverable through association with an array of metadata-characterizing information, such as location and measurement classification. These metadata enable powerful search capabilities through the ARM Data Center Data Discovery interface. This paper discusses the workflow of how the new discovery system has been redesigned from user requirements and how the data are distributed to the scientific community.



中文翻译:

为大气测量启用现代数据发现

大气辐射测量 (ARM) 用户设施是美国能源部科学办公室的用户设施,由九个美国能源部国家实验室牵头的协作努力进行管理和运营。ARM 数据中心位于橡树岭国家实验室,负责及时收集、处理数据产品并将其交付给科学界。ARM 数据中心拥有超过 11,000 种数据产品,包括从现场活动、仪器、增值产品和首席研究员提供的数据中收集的元数据。检查这些数据集是否成功传输(对于大多数数据,此传输是通过网络自动执行的;但是,一些最大的数据集和一些最远程的站点需要手动运送硬盘),并且数据和元数据都通过网络公共数据表格处理为标准格式,这是一种 ARM 标准化结构。网络公共数据表格是一种自描述二进制格式,具有许多兼容的软件工具。处理后,数据被编目,存储在 ARM 数据存档中,并通过与一系列元数据特征信息(例如位置和测量分类)相关联而被发现。这些元数据通过 ARM 数据中心数据发现界面启用强大的搜索功能。本文讨论了如何根据用户需求重新设计新发现系统以及如何将数据分发给科学界的工作流程。

更新日期:2021-06-18
down
wechat
bug