当前位置: X-MOL 学术IEEE Trans. Softw. Eng. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Automatically Categorizing Software Technologies
IEEE Transactions on Software Engineering ( IF 6.5 ) Pub Date : 2020-01-01 , DOI: 10.1109/tse.2018.2836450
Mathieu Nassif , Christoph Treude , Martin P. Robillard

Informal language and the absence of a standard taxonomy for software technologies make it difficult to reliably analyze technology trends on discussion forums and other on-line venues. We propose an automated approach called $\mathrm{Witt}$ Witt for the categorization of software technologies (an expanded version of the hypernym discovery problem). $\mathrm{Witt}$ Witt takes as input a phrase describing a software technology or concept and returns a general category that describes it (e.g., integrated development environment), along with attributes that further qualify it (commercial, php, etc.). By extension, the approach enables the dynamic creation of lists of all technologies of a given type (e.g., web application frameworks). Our approach relies on Stack Overflow and Wikipedia, and involves numerous original domain adaptations and a new solution to the problem of normalizing automatically-detected hypernyms. We compared $\mathrm{Witt}$ Witt with six independent taxonomy tools and found that, when applied to software terms, $\mathrm{Witt}$ Witt demonstrated better coverage than all evaluated alternative solutions, without a corresponding degradation in false positive rate.

中文翻译:

自动分类软件技术

非正式语言和软件技术标准分类法的缺乏使得在论坛和其他在线场所可靠地分析技术趋势变得困难。我们提出了一种自动化方法,称为$\mathrm{Witt}$ 威特 用于软件技术的分类(上位词发现问题的扩展版本)。 $\mathrm{Witt}$ 威特 将描述软件技术或概念的短语作为输入,并返回描述它的一般类别(例如,集成开发环境),以及进一步限定它的属性(商业、php 等)。通过扩展,该方法能够动态创建给定类型(例如,Web 应用程序框架)的所有技术的列表。我们的方法依赖于 Stack Overflow 和 Wikipedia,并涉及许多原始域改编和对自动检测到的上位词进行归一化问题的新解决方案。我们比较了$\mathrm{Witt}$ 威特 使用六个独立的分类工具并发现,当应用于软件术语时, $\mathrm{Witt}$ 威特 表现出比所有评估的替代解决方案更好的覆盖率,而假阳性率没有相应的下降。
更新日期:2020-01-01
down
wechat
bug