当前位置: X-MOL 学术J Law Biosci › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Policy-aware data lakes: a flexible approach to achieve legal interoperability for global research collaborations
Journal of Law and the Biosciences ( IF 2.5 ) Pub Date : 2020-08-19 , DOI: 10.1093/jlb/lsaa065
Adrian Thorogood 1
Affiliation  

A popular model for global scientific repositories is the data commons, which pools or connects many datasets alongside supporting infrastructure. A data commons must establish legally interoperability between datasets to ensure researchers can aggregate and reuse them. This is usually achieved by establishing a shared governance structure. Unfortunately, governance often takes years to negotiate and involves a trade-off between data inclusion and data availability. It can also be difficult for repositories to modify governance structures in response to changing scientific priorities, data sharing practices, or legal frameworks. This problem has been laid bare by the sudden shock of the COVID-19 pandemic. This paper proposes a rapid and flexible strategy for scientific repositories to achieve legal interoperability: the policy-aware data lake. This strategy draws on technical concepts of modularity, metadata, and data lakes. Datasets are treated as independent modules, which can be subject to distinctive legal requirements. Each module must, however, be described using standard legal metadata. This allows legally compatible datasets to be rapidly combined and made available on a just-in-time basis to certain researchers for certain purposes. Global scientific repositories increasingly need such flexibility to manage scientific, organizational, and legal complexity, and to improve their responsiveness to global pandemics.

中文翻译:

具备政策意识的数据湖:一种灵活的方法来实现全球研究合作的法律互操作性

全球科学资源库的一种流行模型是数据共享库,它在支持基础结构的基础上合并或连接许多数据集。数据共享者必须在数据集之间建立合法的互操作性,以确保研究人员可以汇总和重用它们。这通常是通过建立共享的治理结构来实现的。不幸的是,治理通常需要花费数年时间进行谈判,并且需要在数据包含和数据可用性之间进行权衡。对于不断变化的科学优先级,数据共享实践或法律框架,存储库也可能难以修改治理结构。COVID-19大流行的突然冲击暴露了这个问题。本文提出了一种用于科学存储库以实现法律互操作性的快速而灵活的策略:政策感知型数据湖。该策略借鉴了模块化,元数据和数据湖的技术概念。数据集被视为独立的模块,可能需要遵守特殊的法律要求。但是,必须使用标准法律元数据描述每个模块。这样可以将合法兼容的数据集快速组合起来,并为某些目的而及时提供给某些研究人员。全球科学资料库越来越需要这种灵活性来管理科学,组织和法律上的复杂性,并提高其对全球流行病的反应能力。这样可以将合法兼容的数据集快速组合起来,并为某些目的而及时提供给某些研究人员。全球科学资料库越来越需要这种灵活性来管理科学,组织和法律上的复杂性,并提高其对全球流行病的反应能力。这样可以将合法兼容的数据集快速组合起来,并为某些目的而及时提供给某些研究人员。全球科学资料库越来越需要这种灵活性来管理科学,组织和法律上的复杂性,并提高其对全球流行病的反应能力。
更新日期:2020-10-04
down
wechat
bug