当前位置: X-MOL 学术Int. J. Med. Inform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Medical informatics labor market analysis using web crawling, web scraping, and text mining
International Journal of Medical Informatics ( IF 4.9 ) Pub Date : 2021-04-08 , DOI: 10.1016/j.ijmedinf.2021.104453
Jürgen Schedlbauer 1 , Georgios Raptis 1 , Bernd Ludwig 2
Affiliation  

Objectives

The European University Association (EUA) defines “employability” as a major goal of higher education. Therefore, competence-based orientation is an important aspect of education. The representation of a standardized job profile in the field of medical informatics, which is based on the most common labor market requirements, is fundamental for identifying and conveying the learning goals corresponding to these competences.

Methods

To identify the most common requirements, we extracted 544 job advertisements from the German job portal, STEPSTONE. This process was conducted via a program we developed in R with the “rvest” library, utilizing web crawling, web extraction, and text mining. After removing duplicates and filtering for jobs that required a bachelor's degree, 147 job advertisements remained, from which we extracted qualification terms. We categorized the terms into six groups: professional expertise, soft skills, teamwork, processes, learning, and problem-solving abilities.

Results

The results showed that only 45% of the terms are related to professional expertise, while 55% are related to soft skills. Studies of employee soft skills have shown similar results. The most prevalent terms were programming, experience, project, and server. Our second major finding is the importance of experience, further underlining how essential practical skills are.

Conclusions

Previous studies used surveys and narrative descriptions. This is the first study to use web crawling, web extraction, and text mining. Our research shows that soft skills and specialist knowledge carry equal weight. The insights gained from this study may be of assistance in developing curricula for medical informatics.



中文翻译:

使用Web爬行,Web抓取和文本挖掘的医学信息学劳动力市场分析

目标

欧洲大学协会(EUA)将“就业能力”定义为高等教育的主要目标。因此,基于能力的导向是教育的重要方面。基于最常见的劳动力市场要求的医学信息学领域标准化工作概况的表示,对于识别和传达与这些能力相对应的学习目标至关重要。

方法

为了确定最常见的要求,我们从德国工作门户网站STEPSTONE中提取了544个招聘广告。这个过程是通过我们在R中使用“ rvest”库开发的程序进行的,该程序利用了Web爬行,Web提取和文本挖掘。在删除了重复项并过滤了需要学士学位的职位后,剩下的147个职位广告从中提取了资格条件。我们将这些术语分为六类:专业知识,软技能,团队合作,流程,学习和解决问题的能力。

结果

结果表明,只有45%的术语与专业知识有关,而55%的术语与软技能有关。对员工软技能的研究显示出相似的结果。最流行的术语是编程,经验,项目和服务器。我们的第二个主要发现是经验的重要性,进一步强调了必不可少的实践技能。

结论

以前的研究使用调查和叙述性描述。这是第一个使用Web爬网,Web提取和文本挖掘的研究。我们的研究表明,软技能和专业知识同样重要。从这项研究中获得的见解可能有助于制定医学信息学课程。

更新日期:2021-04-13
down
wechat
bug