当前位置: X-MOL 学术Annu. Rev. Stat. Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Perspective on Data Science
Annual Review of Statistics and Its Application ( IF 7.9 ) Pub Date : 2022-03-07 , DOI: 10.1146/annurev-statistics-040220-013917
Roger D. Peng 1 , Hilary S. Parker 2
Affiliation  

The field of data science currently enjoys a broad definition that includes a wide array of activities which borrow from many other established fields of study. Having such a vague characterization of a field in the early stages might be natural, but over time maintaining such a broad definition becomes unwieldy and impedes progress. In particular, the teaching of data science is hampered by the seeming need to cover many different points of interest. Data scientists must ultimately identify the core of the field by determining what makes the field unique and what it means to develop new knowledge in data science. In this review we attempt to distill some core ideas from data science by focusing on the iterative process of data analysis and develop some generalizations from past experience. Generalizations of this nature could form the basis of a theory of data science and would serve to unify and scale the teaching of data science to large audiences.

中文翻译:


数据科学的观点

数据科学领域目前享有广泛的定义,其中包括从许多其他既定研究领域借来的广泛活动。在早期阶段对一个领域进行如此模糊的描述可能是很自然的,但随着时间的推移,保持如此广泛的定义变得笨拙并阻碍了进展。特别是,数据科学的教学因似乎需要涵盖许多不同的兴趣点而受到阻碍。数据科学家最终必须通过确定该领域的独特之处以及开发数据科学新知识的意义来确定该领域的核心。在这篇综述中,我们试图通过关注数据分析的迭代过程,从数据科学中提炼出一些核心思想,并从过去的经验中总结出一些概括。

更新日期:2022-03-07
down
wechat
bug