当前位置: X-MOL 学术IEEE Trans. Emerg. Top. Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
BD2K Training Coordinating Center's ERuDIte: the Educational Resource Discovery Index for Data Science
IEEE Transactions on Emerging Topics in Computing ( IF 5.1 ) Pub Date : 2019-01-01 , DOI: 10.1109/tetc.2019.2903466
José Luis Ambite 1 , Lily Fierro 1 , Jonathan Gordon 1 , Gully A Burns 1 , Florian Geigl 2 , Kristina Lerman 1 , John D Van Horn 3
Affiliation  

Data science is a field that has developed to enable efficient integration and analysis of increasingly large data sets in many domains. In particular, big data in genetics, neuroimaging, mobile health, and other subfields of biomedical science, promises new insights, but also poses challenges. To address these challenges, the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative, including a Training Coordinating Center (TCC) tasked with developing a resource for personalized data science training for biomedical researchers. The BD2K TCC web portal is powered by ERuDIte, the Educational Resource Discovery Index, which collects training resources for data science, including online courses, videos of tutorials and research talks, textbooks, and other web-based materials. While the availability of so many potential learning resources is exciting, they are highly heterogeneous in quality, difficulty, format, and topic, making the field intimidating to enter and difficult to navigate. Moreover, data science is rapidly evolving, so there is a constant influx of new materials and concepts. We leverage data science techniques to build ERuDIte itself, using data extraction, data integration, machine learning, information retrieval, and natural language processing to automatically collect, integrate, describe, and organize existing online resources for learning data science.

中文翻译:

BD2K 培训协调中心的 ERuDIte:数据科学教育资源发现指数

数据科学是一个已经发展到能够对许多领域中越来越大的数据集进行有效集成和分析的领域。特别是遗传学、神经影像学、移动健康和其他生物医学科学子领域的大数据,有望带来新的见解,但也带来了挑战。为了应对这些挑战,美国国立卫生研究院发起了大数据到知识 (BD2K) 计划,其中包括一个培训协调中心 (TCC),其任务是为生物医学研究人员开发个性化数据科学培训资源。BD2K TCC 门户网站由 ERuDIte(教育资源发现索​​引)提供支持,该索引收集数据科学的培训资源,包括在线课程、教程和研究讲座视频、教科书和其他基于网络的材料。虽然有如此多的潜在学习资源令人兴奋,但它们在质量、难度、格式和主题方面高度不同,使得该领域难以进入且难以驾驭。此外,数据科学正在迅速发展,因此新材料和概念不断涌入。我们利用数据科学技术来构建 ERuDIte 本身,使用数据提取、数据集成、机器学习、信息检索和自然语言处理来自动收集、集成、描述和组织现有的在线资源,用于学习数据科学。因此,新材料和概念不断涌入。我们利用数据科学技术来构建 ERuDIte 本身,使用数据提取、数据集成、机器学习、信息检索和自然语言处理来自动收集、集成、描述和组织现有的在线资源来学习数据科学。因此,新材料和概念不断涌入。我们利用数据科学技术来构建 ERuDIte 本身,使用数据提取、数据集成、机器学习、信息检索和自然语言处理来自动收集、集成、描述和组织现有的在线资源,用于学习数据科学。
更新日期:2019-01-01
down
wechat
bug