当前位置: X-MOL 学术Journal of Data and Information Science › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Priorities for Social and Humanities Projects Based on Text Analysis①
Journal of Data and Information Science ( IF 1.5 ) Pub Date : 2020-09-21 , DOI: 10.2478/jdis-2020-0036
Ülle Must 1
Affiliation  

Abstract Purpose Changes in the world show that the role, importance, and coherence of SSH (social sciences and the humanities) will increase significantly in the coming years. This paper aims to monitor and analyze the evolution (or overlapping) of the SSH thematic pattern through three funding instruments since 2007. Design/methodology/approach The goal of the paper is to check to what extent the EU Framework Program (FP) affects/does not affect research on national level, and to highlight hot topics from a given period with the help of text analysis. Funded project titles and abstracts derived from the EU FP, Slovenian, and Estonian RIS were used. The final analysis and comparisons between different datasets were made based on the 200 most frequent words. After removing punctuation marks, numeric values, articles, prepositions, conjunctions, and auxiliary verbs, 4,854 unique words in ETIS, 4,421 unique words in the Slovenian Research Information System (SICRIS), and 3,950 unique words in FP were identified. Findings Across all funding instruments, about a quarter of the top words constitute half of the word occurrences. The text analysis results show that in the majority of cases words do not overlap between FP and nationally funded projects. In some cases, it may be due to using different vocabulary. There is more overlapping between words in the case of Slovenia (SL) and Estonia (EE) and less in the case of Estonia and EU Framework Programmes (FP). At the same time, overlapping words indicate a wider reach (culture, education, social, history, human, innovation, etc.). In nationally funded projects (bottom-up), it was relatively difficult to observe the change in thematic trends over time. More specific results emerged from the comparison of the different programs throughout FP (top-down). Research limitations Only projects with English titles and abstracts were analyzed. Practical implications The specifics of SSH have to take into account—the one-to-one meaning of terms/words is not as important as, for example, in the exact sciences. Thus, even in co-word analysis, the final content may go unnoticed. Originality/value This was the first attempt to monitor the trends of SSH projects using text analysis. The text analysis of the SSH projects of the two new EU Member States used in the study showed that SSH's thematic coverage is not much affected by the EU Framework Program. Whether this result is field-specific or country-specific should be shown in the following study, which targets SSH projects in the so-called old Member States.

中文翻译:

基于文本分析的社会人文项目优先事项①

摘要目的世界的变化表明,SSH(社会科学和人文科学)的作用、重要性和连贯性将在未来几年显着增加。本文旨在监测和分析自 2007 年以来通过三种资助工具的 SSH 主题模式的演变(或重叠)。 设计/方法/方法 本文的目标是检查欧盟框架计划 (FP) 在多大程度上影响/不影响国家层面的研究,并借助文本分析突出特定时期的热点。使用了来自欧盟 FP、斯洛文尼亚语和爱沙尼亚语 RIS 的资助项目名称和摘要。不同数据集之间的最终分析和比较是基于 200 个最常用的单词进行的。在去除标点符号、数值、冠词、介词、连词和助动词后,识别出 ETIS 中的 4,854 个唯一词、斯洛文尼亚研究信息系统 (SICRIS) 中的 4,421 个唯一词和 FP 中的 3,950 个唯一词。结果 在所有资助工具中,大约四分之一的热门单词构成了单词出现的一半。文本分析结果表明,在大多数情况下,FP 和国家资助项目之间的词语不重叠。在某些情况下,可能是由于使用了不同的词汇。在斯洛文尼亚 (SL) 和爱沙尼亚 (EE) 的情况下,单词之间的重叠更多,而在爱沙尼亚和欧盟框架计划 (FP) 的情况下,单词之间的重叠较少。同时,重叠的词表示更广泛的范围(文化、教育、社会、历史、人文、创新等)。在国家资助的项目中(自下而上),随着时间的推移,观察主题趋势的变化相对困难。更具体的结果来自于整个 FP(自上而下)中不同程序的比较。研究局限 只分析了带有英文标题和摘要的项目。实际意义 必须考虑 SSH 的细节——术语/词的一对一含义并不重要,例如,在精确科学中。因此,即使在共词分析中,最终内容也可能被忽视。原创性/价值 这是第一次尝试使用文本分析来监控 SSH 项目的趋势。研究中使用的两个欧盟新成员国的 SSH 项目的文本分析表明,SSH 的主题覆盖范围受欧盟框架计划的影响不大。这个结果是针对特定领域还是针对特定国家的,应在以下研究中显示,该研究针对所谓的旧成员国的 SSH 项目。研究中使用的两个欧盟新成员国的 SSH 项目的文本分析表明,SSH 的主题覆盖范围受欧盟框架计划的影响不大。这个结果是针对特定领域还是针对特定国家的,应在以下研究中显示,该研究针对所谓的旧成员国的 SSH 项目。研究中使用的两个欧盟新成员国的 SSH 项目的文本分析表明,SSH 的主题覆盖范围受欧盟框架计划的影响不大。这个结果是针对特定领域还是针对特定国家的,应在以下研究中显示,该研究针对所谓的旧成员国的 SSH 项目。
更新日期:2020-09-21
down
wechat
bug